>_0004.000838_ 17739232 gi|17739232|gb|AAL41870.1| transcriptional regulator, LysR family [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:c856494-855730, Atu0856 MGIALVRRTTRSVSLTEAGQ QLYADVSPSIGAISQAAQAA SSLSGTVRGQLRLAVSSIAE RFLSGPLLASFADAHPDVQL DIVVTDEEFDIVAEQYDAGV RLGELIEQDMIAVPVSVPQR QLAVCSAEYRDRFGLPTQPR ELIDHRCIGWRARPGVAPYR WEFAENGREFAVAVQPDFTT NDMQLMIKLACAGAGITFGM EETFRPHLESGKLIAMLEDY SPVFAGFYLYYPSRRHIAPK LRAFIDHVRLSRER >_0083.003158_ NP_838579.1 gi|30064408|ref|NP_838579.1| putative LYSR-type transcriptional regulator [Shigella flexneri 2a str. 2457T] MLNSWPLAKDLQVLVEIVHS GSFSAAAATLGQTPAFVTKR IQILENTLATTLLNRSARGV ALTESGQRCYEHALEILTQY QRLVDDVTQIKTRPEGMIRI GCSFGFGRSHIAPAITELMR NYPELQVHFELFDRQIDLVQ DNIDLDIRINDEIPDYYIAH LLTKNKRILCAAPEYLQKYP QPQSLQELSRHDCLVTKERD MTHGIWELGNGQEKKSVKVS GHLSSNSGEIVLQWALEGKG IMLRSEWDVLPFLESGKLVQ VLPEYAQSANIWAVYRELLY RSMKFVSARNFWRHGASNGW ASPMKAIRSCRSTRNHSGSF SRFLLNFVQIFVS >_0018.003788_ BFUN_06OCT04_CONTIG481_REVISED_GENE3789 bfun_06oct04_Contig481_revised_gene3789 MRIPPLKAIVAFESVARTKS VNRAADELGLTPSAVSHQIA NLESMVGRPLFTRLGRGLVL TPTGQQYLSDVTGSLADLSR ATERASSQSGMEILRIHSSP SFGLMWLLPRLASFQEANGD IQLNLACSYEDVSFTSGYYD VDVRHGYAHWTDGEIKTLRN EFIAPLASAEYLGRHPRRSA MRWFNWLARRRAWSA >_0043.004057_ JANN_22DEC04_CONTIG27_REVISED_GENE4058 jann_22dec04_Contig27_revised_gene4058 MAVMTCSQRTANAIQSQSTS QSYMSNLHCPASALRVGFSL GYHPLFCYCRQKPSCIKKAG TMSYLDNIRTFVRVYELGSM SAAGRDLRISPAVTSSRISQ LEEHLSVRLFQRTTRSLTPT EHGKAFYSGAKEILESVENA EAQVIDITENPKGSLYVAAP LGVGRRLIAPEVPGFLDLYP EVRVRLRLTDRKVDLTTEGL DLAFFLGEPEDSTLRIRKIA DVERVLCAAPDYVAARGMPA GGAELTSGAHECLNLRFPGA TEFQWHLKTPDGPKRFRIQG RYESDDGDVLNDWALAGQGI VMKPIFEVAEHLKAGRLVAV AQETPPMPIQMACLFTHRRM QDPKTRLFMEFVINRIGDLV RQAEKLGQLPR >_0018.002719_ BFUN_06OCT04_CONTIG481_REVISED_GENE2720 bfun_06oct04_Contig481_revised_gene2720 MPNLRKKLPSANALFVFEAA ARCGNFTRAAQELYVSQPAV SRMLSRMEDHLGVRLFERVR GGIELTENGRILYRKISEGF NGIESAIREIEARATGVESV TLSVSTAFTTHWLMPRMSRL NQAFPNVDLRFQLISGRIGG PLVDVDLGMRFLREDEIGEN SVLVMPETLLPVCNRRYHEA AATEAGRKHGDTVIVMDDGE RGWHDRFAAFAAQGRHAASM LNFNDYAIVVQAALLGQGVA LGWLNVVSHWLLEGALLPAE EELIVTNRRCCLVWPENRPL RPVAADVRDWILDETRADVR AVDRQYPKLGLRRVLAQTGL AIGPARDEARSDAAARPVTP PGA >_0006.004341_ NP_890661.1 gi|33603101|ref|NP_890661.1| putative LysR-family transcriptional regulator [Bordetella bronchiseptica RB50] MSLPSLTALRTFEAAARYRS VKLAAAELHVTPTAVSHQIQ QLEDLLGVKLFERTGRGLVL TDAAVSCLPYLQQGFESLKV GVDKLRKHRGPDIITVNTSP SFASLWLFPRLHRFSLLYPD IDVRVTTRLRQAAQLRQEQQ SGVNNVQDWVQEADLVIAYG NGQFGGFEHEELIPLYIAPM CSPALLPKGEKQAMGSRLLE LPWLHDDRGTLYGSASFWQR WLAAAGLADGKPAKEMHFTH ALLALSAAADQLGVVVSTPV LAESLLRDGILYLPFDEQVR IDRSYYLVKDASTGNDSRLT IFKEWLRQEAALSNAKGTSA QRLA >_0013.003029_ NP_882021.1 gi|33594377|ref|NP_882021.1| LysR-family transcriptional regulator [Bordetella pertussis] MPNTIHQDEGRPIVNSNDHD SPAESRASGVLNLTHLRTLV AVVQEGHLTRAAERLRISQP AASHHLRSLEQQFGLPLFTR TPQGVVPTAAGLQLSEQAAR LLASSLELLSTASELRGSAS GRIAIGTIEDPSVHAALPSL IKWFHEHYPLIELSIESRNS SSIRQGILTGEINAGFYVSC TNEANLREYEIGQRELVVVA PQSWRERVATATWPELAKLP WVMTTTGSAHSEITAQLFRS HNITIRPALEVNTERLLRAM VSQGVGLGFTRREFAEAEQA RGAFFIVPISVHRTTMHFAY ARSAETDPLIQILSRGLATV LPVAERLIPSTAKPAEK >_0018.000912_ BFUN_06OCT04_CONTIG480_REVISED_GENE559 bfun_06oct04_Contig480_revised_gene559 MDTRQLKYFVAIVECGSMGK AAEKLYVAQPSLSQQMGRLE SEFGTSLLLRSQRGVTPTAA GQALYARSRAILRQMEQLKQ HVKEGASAESGTVAVGLPTT MVSVLAMPLIERVQQRYPGI HLQLIESMSGAITELLASAR LDLAILFRASDTLGVTAWPL LEEQLFVMGEPGDGVAAEAQ SCALSALNGVRMVAPGATNG LRLLLERVFARENLELNIVA DIDSLPTLLSIAESGHACTI LPVSALAQREAARCPKIRAI VAPELRRPASVCWSSTTMMS SATVAVCRMIVELVEDLTAS GVWKGISLPDEQSRRTLAEA LRAPERHA >_0086.000444_ ROSE_TM1040_30MAR04_CONTIG46_REVISED_GENE445 rose_tm1040_30mar04_Contig46_revised_gene445 MVLNIRHLAAHTQGRQTHSQ TLDFKNMTPPDQVHPRPHID PERLTREMNWNLLRTFVVLA ESGSITEAAERLRLKQPSVS VALKKLEDQLGQHLIDRSPG HFTLTKAGQMLYREAVNING SILRLATLMRTVTPEISGHV RIAVASHVLCPLFDSVIGDF YAAHPKASLAIDVITSGDAI TAVAAKRVSFALALVREQDP NLRYLPVYRETFGLFCGPRH PLFGRSDLTLEDLKGHAAVV FDNDRLQDALQDVTLLRARA ALAPEITGVSENLEEVRRMI MAGLGIGPLPKHVVARDIED GLLWQVPPFEDLPEVDVYLI WNPTTVRNRAEETLLTTLIE RLETVPFAERSYA >_0116.001030_ YP_193950.1 gi|58337365|ref|YP_193950.1| malolactic regulator [Lactobacillus acidophilus NCFM] MNIQDLRYFHELVNLKSYTK TAEKFGVSQPTITAAVKRLE NRFGGTFLIRDQPHKSIIIT RLGVQFDEHVQSILNEINIV EEEIKQNQNASIPFGLPPII GRNYFPKIVSQLFAKGLLHR LNVIENGSYDLYHLLLEGYI NFSMLGLTKVNVEPGIKLEV VKSYPMCIVVSKTHPLATKK AISFKEIANENFIGLSSDYI HTKALDRMLKKSKINLNTIY RSPDVTVVKNIVAQNLGISY LTTLSITENDDVISIPLLDK DQPKFILAAATRENHIMTDA EEEFWEILTQISLRRQVLTS TLPS >_0043.001348_ JANN_22DEC04_CONTIG22_REVISED_GENE1349 jann_22dec04_Contig22_revised_gene1349 MQTRALKTLSKIAQVGSFVQ SAEQLGMTLSAVSMQMKALE AELGVALFDRSVRPPRLTPV AAAVVTEAQALLLWEDRLLE LCRPSDTLVGQYKLGFVTTA AVRLLPDFLKTAQDMAPMAS FEVETGLSATLQDKVMTGQI DAAVVTDADGLPPRLSSRLL RTEPFVFAAHGDLMADGIEG LMTRDTFFHFMPDTGIGKLI ARAMLQQNRPGAARTIVLDD LEAIMECVASGLGFTLLPVP DVERYLTRQVRTVPAPAGLE RKLVLAVLRDGGMAPREAAL GALFDAAVDPTGPS >_0056.001752_ SARO_25NOV03_CONTIG28_REVISED_GENE1870 saro_25nov03_Contig28_revised_gene1870 MKRTHLPLNGLRVLDAAARH LSFTRAADELAVTPAAVGQQ IRALEDLLGVVLFRRTSKGL ELTDEASAGLDAIREGFLRF EEGVQAMQAGQSSHVYTIAC PRDFFAAWLSPRLADFRAGN PQMRFSLVGGDADVDFTEAN LDLAVRWAEGPGDLEGVSLG AATMITVAAPDAAPDSPWIG WPGDPTPEGGEAGFSVGDAG TAISAARAGLGRANVPFMLA EGPLSSGRIVALGEPQVSRR GYWLVAPPPQWRQKKVKALV AALSS >_0079.002604_ SBAL_17SEP04_CONTIG235_REVISED_GENE2607 sbal_17sep04_Contig235_revised_gene2607 MNIDVSENAIDIAFRVGEPK DPDWIARPLTTISFVLCAST SATQWHALQSIEALEQHPVI IAKPVKTWRLQHNITGQNFE FEPRGNIKLAVDDMAIASQA VVAGLGIGLLPISMASEQIQ SGKLVQILPEWQGIPRTAYL MYRDRDNLPLRVRLLIDFML ANPPEAY >_0043.000658_ JANN_22DEC04_CONTIG19_REVISED_GENE659 jann_22dec04_Contig19_revised_gene659 MHVGRENSSFQICNSSQDIR KSDILDAMSIRFRQLQAFHA TFETGTVTGAATLLGISQPG ISNLLAQLERETRFKLFERV KGRLIPTPEAGVLYQEVDTV VRGLDHVGQAVTDLQNKQGG QLQVASQHAMSFGFMPKLIA QFAKTRPDMSISFQSQYSSK VQEWVMSGLFEIGVCETPQL YDALDTHPFQVEMQLTLHPD NPLARHDILTPELCGAEPFI VMGPDHMTHRRTREAFHTAG VPWNTRVHTHLFKNMLSFVQ EDMGVAILDPFLLDHDESGS FVTRPFAPAIHMDMMVITSA TRPLSTLALDFLDLLLTGIA QSDRRRVSPSNAAPGRPPLP MAAR >_0081.002968_ SDEN_20JUL04_CONTIG81_REVISED_GENE2969 sden_20jul04_Contig81_revised_gene2969 MFELLERKRHYFSIISEYIG TIMSRAKSTLEQWRILQAVV DHGGYAQAAEKLNKSQSSLN HAVAKLQHQLGIPLLEVKGR KAYLTEQGEVMLRRSRHLTQ TVEELEQLAHNLEQGWEPSL TLGREIIYPMPLLVAALKAF LPHSRGTRVTILDTVLSGTN ELINAQSVDISICAVPPKGY ISEPLCEMEFYLVSHPSHPL AQLTQVDDDKQLAQHLQLVI KDTGTLGSSDTGWLKAEQRW TVANFHEAKEILNQEMGFCW MPRLLVEQDLTEGRLARIYL LGSQSRKAMMSLVIPNRDRQ GPAAKLLEQCILQQHKQAQS DIEQ >_0003.001302_ ARTH_26JUL04_CONTIG37_REVISED_GENE1305 arth_26jul04_Contig37_revised_gene1305 MLDVRRLRLLRELKIRGTLA EVADALQYSPSSVSQQLALL EKEAGVQLLRKTGRRVQLTP QAEVLVAHTAHLLETLEQAE ADLAASLTTVSGTVRIAVFQ SAALALMPGTLTRMAATYPE VRIEMVQREPETALHETWAR DFDLVIAEQYPGHAAPRYAE LDRLRLTGDAIRLAVPGADK GMPPVRSLEDTADLAWVMEP RGAASRHWAEQACRSAGFEP DVRYETADLQAQIRLIESGN AVGLMPDLVWTGRETSAQLL LLPGNPHRTVFTSVRRSSAK RPAILAAREVLAVTADSIAP AGPVPPEAKSA >_0014.005213_ YP_111906.1 gi|53722921|ref|YP_111906.1| putative LysR-family transcriptional regulator [Burkholderia pseudomallei K96243] MGSEIGWELYRSFLGVLREG SLSGAARALGLTQPTVGRHV AALEAALRVPLFTRSSSGLM PTDVALALRAHAEAMESTAD ALARAATSFGEDVRGVVRIS ASDVVGVEVLPPIVARLRQR HPALTVELALTNRVQDLLRR EADIAVRMTRPGQTQLIARH IGGIELGLHAHRDYLARCGT PRDAGELVRHALIGHDRPTA FIRQIAKSFPGFDRGAFALR TDSDLAQLALIRCGAGIGAC QAALAKRDPALVRVLPKAFA GRLDMWVTMHEDLRGSPRCR AAFDALAEGLDAYVDEQRAP AIARRRRPLPGTRSA >_0030.000120_ DHAF_12NOV03_CONTIG1007_REVISED_GENE141 dhaf_12nov03_Contig1007_revised_gene141 MKLEQLHYLKEAIRYRSLSI AARENFISQPSFSAAITGLE KELGVKLLNRSNRGCTPTDI CMEIMDRADTIFAMVEEIEL LASNSSYCETINIAVVLSIC EEILPQVLLEMEADKVFCKV AVASLEGEAIYPRVASGSSV LGIVAYIPKLLTADLKYTPL FEDEYVLYIGQHSPFWEQGS VSLEQMLSQPYIALGDDFAT PNSDWAKDILAFSMPKVESQ VSNLNTLKKMIINGPYVSLL PRFMVADDIYVKHNLMKPVR IDDIYLAAQFGYIENTRYKL TKNYTVFLDYFRKILTRLGY TVYKTV >_0020.002700_ CAUR_25MAY01_CONTIG925_REVISED_GENE4047 caur_25may01_Contig925_revised_gene4047 HLTTDPDLQPYSSFAQRLFY RLVESRYDGLSIAYADAVVC VSRFTQRMVEATYGRRDTVL IYDGIDTDVFVPPPGMQRRD DGLPPANGRIRLLFVGNRTR RKGFDLLPRIMDRLPEDYVL YYTGGFQGRDTGPPHPRMIP IGSPDRDGLVAAYQSCDILL FPSRLEGFGIAPAEALACGR PVVTTNVAALPEVVDDGENG FLVARDDVAGYAEKVQILGE DAALRRRFGEHGRAKVVTHF GYRPLGEGFRQLYGRLCGKG >_0018.007951_ BFUN_06OCT04_CONTIG482_REVISED_GENE7952 bfun_06oct04_Contig482_revised_gene7952 MIWRYPTARLTIGKAGNSDH FHRIYRIMSQRGFDLTQLRT FVAVAESGSVSAGAERVFLS QSSVSEQLKKLEERAGQPLF VRSKQGVSATHAGSRLLDHA RRIIAMSEAAFEDLQGRSLD GELRIAITDYYRPHDIARIL KTFSEQHPRLKLHVTVLPSA VIDSSAGDDASFDIGLSLRL VTGSARGAGRRSGAPGIVVR REKLLWVSAADSSPRPATPY PLVLLPSSCQLQRFVVKLLD EHKVPYVVSHSASGVAGLQL ALKAGLGISCLNESSIGGGV VACPASVGLPALPAVEFHLL PGRIGESERVSNARTALMRL FS >_0063.003749_ NP_251611.1 gi|15598117|ref|NP_251611.1| probable transcriptional regulator [Pseudomonas aeruginosa PA01] MMTTFRCHLGKPVKLHQLQA LVASADAGGIRAAARALGIS QAAVTRALRELETEQHLPLF VRTPSGLTFTEYGKALLIHA RLVLKQLEHAQVELDHLRGQ ASGRLCIGVTPWVALTFLSE AVQAFRERMPEVRLELFESL MAVAQPLLRDGSMDFAIGPL HGAQAAQEFACEKLLDYDTA VLVRQGHPLAGCGSIHELLE QDWALNYTGDGHDALMRELF WRHGALIDERRIVRAHSVAI LQTLVEQADMCTWVPAIIGA APPLLGRVVPLALRETFEPR RLGIITRRGGALSNPAQCFV ECLLQAIRRHARSAKKDDRR LFETLRLLV >_0048.002651_ YP_012925.1 gi|46906536|ref|YP_012925.1| transcriptional regulator, LysR family [Listeria monocytogenes str. 4b F2365] MDIENMKAFNKVAELKSISA AANELHHLQSNMSNKIKNIE KQFQTQLFFRHSNGVEPTKE GEKIYQQFKKMILLWEETID IINNEEETISIGITQSSLPM EFNTIIKEFYQQFPNKKLSI VSGSTSELIPKIANRELTIA YVAELEKENLFQDSQIISQT LSWDKLVFAGNTAGKSVQKI LAEERLYVFSKQCYSYRALA ALINDINIPNVSISEINIPE TLVEICNNELGIGIIPESIA LNYHFLNYETLPTEYASLRK TLIYHADHTISNGEKWLIEK SKPAFKS >_0018.000814_ BFUN_06OCT04_CONTIG480_REVISED_GENE470 bfun_06oct04_Contig480_revised_gene470 MTPTLPDRKQNPPSAPAATV SLRLLEIFMLVAQEGSVSAA ANRLNLTQAAVSQAITVLEQ ALDVKLLDRSVRPPLLTLRG STALQYAREILAKVHEFEDA MRYSGSGRVPLLRIGMLNSF ASTAGAFVLNQLRDIAAEWT VASGFRETCIQALLDRQSDV IITTDETPAPSEIEVFPIFS EPFVAVVPASFDGKTERLQD IADKLDFIHYGHDSHMGSKI TNYLKKIGTVPARRYQFDTT DAALHMVAGGFGWAIVTPLI YLKSRVEGSGVRVVPLTRLP IHRTFIVGMRRGEGSDIAQR VRTAAIITLRDVILPQIEAV LPEASKGVSVAEIEPKASTG KRKGSRRDSAGS >_0102.000714_ NP_111391.1 gi|13541703|ref|NP_111391.1| Predicted glycosyltransferase [Thermoplasma volcanium GSS1] MQNNESIKITAFGDYKLDEI PADIRNEIIYYYRLSRKVLL GLYNKSIIFVLPSIVEGMPS PPLEAMACGCAVVVTDNGGV NEYIKDGLNGIVCPVRDSYC LYQKVILLINNKALREQMIQ DGLETAKEFSYDNMNKNFIR LIEEVQRRKS >_0095.004355_ NP_459744.1 gi|16764129|ref|NP_459744.1| transcriptional regulator, lysR family [Salmonella typhimurium LT2] MYNATYINETHPMLIRLQLM QYRHHLELTWLEDCLALKET LNFSKASASRYVTQPAFSRR IQSLEEWVGTPLFERSKRGV TLTKAGEVFTDQLPELIHSL YTLKSDTLEAAGNKQPSLVF SATHALSFSFVPHLLKQSDK IAKFGSFRLLSDSLNACEKM MRQGDSQFLLCHHHPHMHLN LNKNNFMSIRLGFDTLIPFS KPDSETLKPLWNINNKIQFP YLSFSSQSGLGRIIANTASI NRITHNINVAFVADLAATLL AMVRSGDGVAWIPQSLARQD IEAKTIVTAAEKESNLWVPI EIRLYRPAKRMPPDAEELWE IFVEEQI >_0016.000662_ YP_440334.1 gi|83718200|ref|YP_440334.1| chromosome initiation inhibitor [Burkholderia thailandensis E264] MTIDPKQAAALVAVADTGSF EQAAARLHVTASAVTQRVRA LEMGLGTPLLLRTRPCRPTA AGQRVLQHLRRMALLQADLQ AELEAERGSTIAVAIALNSD SLGSWFLPALTSVLAGERML FELIVEDQDHTFALLESGMA IGCVTTQSKPMRGCFSTPLG TMRYRLIAAEEFAARWFPQG LTRESAREAPVVAHGRRDTL QSSFLRDKLGLPDGAFPCHY VPGTHAHFAAVRHGLGYAMV PELLLGAVPLAEQRLVDLAP AHSTDVSLYWHAWTVQSPKM ESLSSRVVEAARQLLAPLPR GAATAAAHAAPARVKRANAR NTR >_0006.004007_ NP_890102.1 gi|33602542|ref|NP_890102.1| LysR-family transcriptional regulator [Bordetella bronchiseptica RB50] MDRRVSLRHLRCFLAVVETR SFTLAASRMFLSQSSLTAAI QQFEEAVGLKLFERTTRHVE LTDAGRHFKAEAERVVHGFD ASIRDLKAFGQGGRMHVHVA SAPSVVQALLVPAIPHLKAS FPHITFTVRDADATRIERMV LNGEIDFALTSRHIAFDELE YVPLLRDQYGVVYRSSAFRL EGEGPVRWSSLAAEGYVQFT PDTALGAMLAALPAAARLYD EQRDAVSSSTTLYAMLELPG TYSIVGALSAGTGPFPEFDF RALVDPVLTREICLVTRRLR YMSTSARRILQALLEVFDRI ALPQGVELLRRDARSDPGSQ AATDGAGPSSPTL >_0115.003225_ YP_001336267.1 gi|152971158|ref|YP_001336267.1| putative transcriptional regulator (LysR family) [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] MKTLQYSFAQIEAFATIAET GSLSQAAIRLAKDRTTLRDL LDYLEDALGYRLFSREGRSL TLTAEGEQLFRQAHLLLRQA QAFESFAQTLPQTAGQALRL VYDPFVPREFLCALADNLAR RQIRLSCWSASRREAEQALS DGVAEMAICQANNRTLGSEM EWRALGTVDLRFYAADSLFH DAPRPLTLLNLSLTPQLVMH RRSDDQIARRLQISGHTLYM NEITLLRHALEQGRGWGFLP DHLRPGEWQGVSEIATEVGS QGLNVTMVMLWLPGMNKHRM LSDIVHEAPELWLRR >_0045.001254_ LBUL_20SEP02_SCAFFOLD73_REVISED_GENE1875 lbul_20sep02_Scaffold73_revised_gene1875 LSEMALSESGKLSIGVAEIV AHKFITDVIKDYRQVNPEYR VDFNLKRAVTGKLMDMLQSG ELDLAIVTTRPGEKIAGFDL VSMMHRKLELLLPAGHELAK QKSVSLKQIEQYPYLAFGKE HALAGIIADVFAKNDIHPQI SSVTDDVRTLVAIGEGVALL PKSKYNESAGDIAVPVTEDT SYTISLASRNLAQEPKVAQK MASFIVDFCQTRGLTWS >_0070.000759_ PPEN_30JUL02_SCAFFOLD2_REVISED_GENE587 ppen_30jul02_Scaffold2_revised_gene587 MQTKTEKIFSSKTLSYFLRL TDTMSYTQSAQLLGITQPAL TQQIKKLEHAVGAPLFYTLG KKLRLTDAGYTMLNATHEIN RILNHATDEIQQATSSGRGD ISVGILASLETRIFEDFIAN YYNNVPEINVTVHMLTRFEI WEGLESNRLDFAITYLPDAN IRSWKPYRARKIGTERLVFV HHDERLSKARGVSLKRAAAF DWVSYPEGYYLDDLITEVFK DAMVTKPKSVAYFTTPAQIL SFSNSTGIATALPESFVVAH QDEATDAYITKFDPKISYEI GFVYRKDKDKIPRMNAFLSA FFEYMDGEKYIDKVRNLTTK KDD >_0073.006185_ YP_298080.1 gi|73537713|ref|YP_298080.1| L-carnitine dehydratase/bile acid-inducible protein F [Ralstonia eutropha JMP134] MGRCSFGRRLKSISGRTGSA VRNQDCPRCAGLPLSKPFRL TRSGTDMTPPLAQTTFSRNA PQDRGPLAGVRILDMATVVA GPFSATLCGDMGAEVVKLEL PDGSDPLRSLAPVKDDVPLY WKVTNRGKRGITLDVRTEAG RELFLRMLGEFDVLVENFRT GTLARWGLDLATLHAANPRL IVLRLTGFGQTGPYAARPGF ARIFEAMSGLTNLAGTEESG PMHMNFPIGDMIAGLFGAFA ISTAIAERRANPELRGREID LAATEALFRLLEPLAVEHEQ LGVVRQRAGNRATYTAPSNM YRTADGVWMTLVASSDATFR RLAEAMDQPQLPLSPDFAVN AARIRNLERLDALIAAWFAA RDADAVSAALERCDVPFSKV FTIADVMADAQMQARSAVIR MPDPDVGSVPAPCVVPRFGG YQPPAPRTGPATGEHNDEFY TELGLGKDDLARLARDGVI >_0013.002134_ NP_880585.1 gi|33592941|ref|NP_880585.1| putative transcriptional regulator [Bordetella pertussis] MQETDSGAGPRTLRRGLMVL AALRDQGPRGLSVTDIARQT GIQRPTIYRLLAALLDAGLV VPLQGTKKYRTQLAADADLA APNPRVRQMLPVLRRLADRT GDAVFLVVRDGDDSVSLHRE IGSYPVQILATYAGKRQPLG VGSGSMALLAALPDEVAHAI VQRNSGRLDEYGGMTPQEMH RLIENTRARGYSVVGNHAVR GALGVGCALLDAQGAPVLAV SVTAIIDRMPAQRQREIAGW IGAELARLAPKA >_0030.003364_ DHAF_12NOV03_CONTIG875_REVISED_GENE3848 dhaf_12nov03_Contig875_revised_gene3848 MLARFGALHPGVALEVINDV RVFNLARREADLAFRFGSFA QENLIERRVGDVAYALYASE AYLAEHGRPDPADGFDGHAL VLMDHAAGAVAHEAWLPPLA PRARVALRANGLRAHLSAVR DGAAMAVLPCLLGEREPQLR RFDLAPQPVVRAVRVGFHSD MRQTPRLRALVDFAVAEFER QSERLCPPDLRR >_0093.002910_ NP_341669.1 gi|15897064|ref|NP_341669.1| Competence damage protein (cinA) [Sulfolobus solfataricus] MVSIVNIFMDYWFAEIITIG NEVLSGKTVNTNASHIGRRL TSLGFTVRRITAVMDEVDEI ASAFREAIDRKPRVIVSSGG LGPTWDDKTAEGLAKALGVN LELNKTAFDMILEKYMTRKI PITEERKKMAYMPYGAIPVE NNEGIAPGIYVYHNNIDILA TPGVPREMENVLENFINKML RNRSNLKYLEDFIYVENVME SSLAPYVKELVKKYDIYIKT HPKSYEMSHPILEIQIAGSG KQEEEIKVKIEKVKFELLDA IKKLNGIIRNSL >_0052.000186_ NP_103224.1 gi|13471658|ref|NP_103224.1| probable transcriptional regulator [Mesorhizobium loti] MLDAADASAPQEARTRINTM QDFRVRKAIQLMKANVCERI SFDDVARSVGLSRPHFFALF KEQTNLTPNVYWNTLRMEEA VRQLQWSQEPLISVACNLGF TTQGNFSRFFRDHVGVPPTL YREAARANA >_0030.002452_ DHAF_12NOV03_CONTIG1080_REVISED_GENE2791 dhaf_12nov03_Contig1080_revised_gene2791 MSVQTIEAMAKWVEDNIAKN PTLTEMSAYVGYSPYYCSAK FHEHIGMTYKQFLARCRLKA AAGDLANTNDKITEIAFRYG YSSSESLTRAFVAAFKCSPS QFRKSSPDIPVSEG >_0113.000641_ YP_001087995.1 gi|126699098|ref|YP_001087995.1| putative transcriptional regulator [Clostridium difficile 630] MMVLNKDIGAKIKQLRTQKQ MTLKDMSEKTNLSIGFLSQL ERGLTSVATDSLGKIASVLD VELTYFFMKPKEHKRAVLRS YEKEVFDVENSTFIHYHLSS SLKEKTMLPRLIEILPSKSS EEICCYVHEGEEFVYVLEGT LTVFLGDEQIEMYPGDTIHY NSEKNNHNWVNYTNKVVKIL VVSIPNPFEKSDAVKEA >_0012.001992_ NP_882815.1 gi|33595172|ref|NP_882815.1| putative dehydratase/racemase [Bordetella parapertussis 12822] MNLDFLKGVRVIESSAFIAA PLAGLTLAQFGAEVIRLDMT GGGIDYERMPRMPDGTSLYW TGLNKQKRSVALDLRKPEGR DLARKLVCAPGPDAGILLTN IGVPWLSHAALAEGRPDVIT CTIEGNADGSSAVDYTVNCA TGYPHATGDGREPVNSPLPA WDACCGYQAAMAVVSAVLRR RQTGQGAELRLALSDVAFAL MSHLGTLAQAELLGEDREPL GNHLYGAFGRDFVTRDGNRV MVAAISKGQWQSLVRTCGLA EAVAAIEARTGAKLAEEAQR FAHRDAIAAACEPWFRARTL AQAKTALDDGGVCWGLYQTA TQMLARDGRAGSANPLFEWI ATAGAGEHWALGTPVREPRA TRQPTQGASRLGQHTDQVLG ELLGLSARQLSDLHAAGVVA GPSGDPRGAH >_0118.002178_ NP_266533.1 gi|15672359|ref|NP_266533.1| LysR family transcription regulator [Lactococcus lactis subsp. lactis Il1403] MFSLFETFIVVYETKSFTLA GKYLFISQPTVTVRIKKLEE ELKSTLFLRGKHQEIIPTEA AILFYPKAIAYLKKWEEDQA EVQKKSLAKHPFKIGVSHSA ALSIMPGIFKVFEGELEHLD VEINMYDSEKVFELVANHDL HFGIIERLLISDQTETFPLF LDELVLAGNTDSETFFTRES GSGIGYYIKRYLKSAPSAQR NIVRMNSNEMIISHIKAGLG SSLISKSFLTDDIPYQELGS NYHREFLGVSFDEEKDPMIQ KLISKIKKETEIH >_0018.006957_ BFUN_06OCT04_CONTIG482_REVISED_GENE6958 bfun_06oct04_Contig482_revised_gene6958 MRRSTPSHTAADPVKEKDGS GEEVTALARGLTVLRAVAAA DAPLSNRDLTELTGIPKPTV SRITATLVGAGFLFRLPDSE RFVLTSSVLELSNGFLRNFD IRARARPFLIELAERTALSV HLAVRDRLDMVVIDAIRPRS AVLVSRLEIGSRMNLSRTAI GRAYLAALEAPEREKLLIGL QAAEGDDWGHVGNRLDSALQ ETIERGFAIATGEWYDGLNA IALGFTGPSGERYAVNCGGS ADQCPRDWLITRAAPALLEC VANIVLEIGGTPGRRLDT >_0011.004359_ YP_106473.1 gi|53716160|ref|YP_106473.1| transcriptional regulator, IclR family [Burkholderia mallei ATCC 23344] MTNQTDGGVAAVNRALAALV AFGEAPGGLTLAQVSEASGV NMSTLLRMFESLERFRFIKR LNDGRYVLGPAVFQLGMMYR ESFQLREHVMPVLDALGAET GETTAFYVREGDQRVCLFRV HPRRAVRTYLREGDRYPLDV GAAGRVLLAFSGARGGSFDK TAAQGYAVSLGERDPDSAAI ACPVFGVGRALIGALSLGVP RFRFNKKVQADYLPRVQAAA NALTHALGGDLPPAAGVAQV VDLFGAAHE >_0056.000414_ SARO_25NOV03_CONTIG24_REVISED_GENE445 saro_25nov03_Contig24_revised_gene445 LRIREVSCLAQCLCKNARHL NGNWVRHMEWSDLEVFLAAV RTGSYTAAGRQLGINRTTIG RRVEALEKSLGLPLFEKNPL GYAPNAAGARLLATAEAVER EVAAMLHDIGGAARQSAPVR IASSGGIASEFLPEIAAFRR ANPDVPVELLGELDPLDAVT QRRADLGIALVRSLPLRLAG TQVATLSQAPYGRRHAGALQ SLGWGYEFDAALPGGPWSAN PAGEAAQAAGLVTFNAWPQL KQAVMAGIGKATLWCFAADA EEHLERLAPPDPRHDCPLWL VHRAKAPPGPGLARLIAFLD NAIAARCDGKRAATP >_0120.002791_ YP_001303560.1 gi|150008817|ref|YP_001303560.1| glycosyltransferase family 4 [Parabacteroides distasonis ATCC 8503] MITCFFRKKREGVNSIEMVF STIESLLPLHTSIQLPYEGA SPKVLFGNILFAHRNKAKIN HITGDAHYIALGLGRNTVLT VHDVQSALQINNPLKRLYVK LFWFWLPALMVRRITVISEF TKNELSKIIPFAKNKILVVH NAFNPTIKYVKKIKDNRPVI LHMGTKPNKNLERVVEALKG MDCLLIIVGKMSEKQLLLLE SSTIDYENHYDVGYEEIVRC YQRCDIVSFPSVYEGFGVPI LEANAAGRPIIAGDIPVLHE VANDAACFVNPYSVDSIRSG FVKVIEHDEYRKELIAKGLK NIERFSPKAIAEKYNEVYKE LSDE >_0078.003353_ NP_828217.1 gi|29833583|ref|NP_828217.1| putative transcriptional regulatory protein [Streptomyces avermitilis MA-4680] MNSVIQNIGRNVSDLDLLTQ SLARNVKRWRTERGFTLDTL AARAGVSRGMLIQIEQARTN PSIGTVVKIGDALGVSVTTL LDYEQGPKVRIVPADQAVRL WHTDAGSYNRLLAGTEAPGP LEMWDWLLMPGEGSPSDPHP NGSVELVHVTAGELTLTVDG VVHHVPTGASVSFEANVPHR YANSGDAPLEMIMTVSVPPV R >_0099.002226_ TFUS_04MAR05_CONTIG93_REVISED_GENE3001 tfus_04mar05_Contig93_revised_gene3001 VHLAHAVHDHRFGETAVKRE LRAAGITDPQLCAAYLHCRT LHAHHGRTYYLATLTLPPQR RPAIHALYGFARWVDDVIDA PRTALPVEARAALLSEVGTE LTAALAGGEDVHPVVRAVAD TARRYEIGAELFTAFLRSMR MDLSTTDYASFEDLRGYMYG SAAVIGLQVLPVLGTVVPLP RAAPHAAALGEAFQLTNFLR DVAEDLDRGRVYLPADILAR YGVDRDLLLWCRAHRRGHPR VRDALAHLVALNREIYRTAA PGIAMLDPVSRPCVATAFTL YQGILDVIEASGFDVWSGRC TVPQWRRVQVAVPAFARAML RRAVSGRRANSGVAPRMSGA RSYGGN >_0035.002793_ NP_816744.1 gi|29377590|ref|NP_816744.1| PTS system, IIA component [Enterococcus faecalis V583] MLGIVIATHGALSDGAKDAA TVIMGATENIETVNLNSGDD VQALGGQIKTAIENVQQGDG VLVMVDLLSASPYNQAVLVI NELEPALQKKIFVVSGTNLP MVLEAINHQLLGTPIAEAAQ AIVAQGKESVQAWDISMTSF EDEEDEDDDF >_0085.001193_ SHEW_20DEC04_CONTIG138_REVISED_GENE1194 shew_20dec04_Contig138_revised_gene1194 MDVKVFKTFLEVARTRHFGR AAENLYITQAAVSARIKQLE SFFDSALFVRNRNSIQLTTS GERLVPYAEVMVSTLEQAKN ELALTNHKALQLTMAGTPNI WDAYLQHCLSKVTDAFGGYG FLAEALSREQLNRNLLERTL DMAFAFDPLKSEELVCKQVA DLILVLVSTRPCDQSEALSD KYVYVDWGTRFASEHAERHY RMPPPYLRTSTGRIALDFIL DKAGAAYLPLSIVEPFLQSK QLYLVEGVEPWNRPIYLSYR KDSGSLEAIIKIEELVKEID PLTAFSLQQIGQLG >_0109.000794_ RER070207000795 REr070207000795 MKVLNLLSAGGIGGIEQLCS NIAKYANYDNTFCFMFEDGQ IYKEMKKSGADVISFAECSS KKISKKRWESLCELAEKADI IVTHHCTIALHIYYCALKKR FKNKKFVMTIHSCFDPELNY NYGSWIKNKIAKWNIEEALK ISDKIIFVSEAGRRSYLDNF NIDVTKTKVIYNGVEVSAIS IDSIKLENNYTRITYIGRVE KIKGLDLFVCALKKLLNDDS NIKVWIIGDGSFRNELEMLV QKLDLTGVIEFTGAKRNIGD YLRKTDLFIYPSTCQEVFGI SIVEAMSYGVPCVANNVGGI PEIIENDYNGFITSKTNDDE LYKCMYKFIKLDPKLITQMR NNCLLTAEKFSINCTIDNLE KTLKGMM >_0073.002805_ YP_296195.1 gi|73541675|ref|YP_296195.1| regulatory protein, TetR [Ralstonia eutropha JMP134] MSQTDSGRTESARTESGICP PKTPRGQKTRESLLRAAEKV FGEKGYYVASISEITQEAKV AMGTFYLYFKDKEDVFRALV QHMLELLRTHLRKHVAPATS QIEAERLGLKAFLSFVSRHK NLYRIVLDSYSVDETIYSGY FQVFADLYSRRLSRAAEQGE FRPGDAEVRAWCLIGISNFL GMRYALWKRPASMEKVVDAA FDMITHGLQAR >_0072.001618_ PROC_21JUN05_CONTIG39_REVISED_GENEPMN12A1623 proc_21jun05_Contig39_revised_genePMN12a1623 VKTSKEITINKKNKFGAEIL CIGSEILLGNIVNTNSQWIA AQLAILGIPHFRQTVIGDNP ARLEEAILEASNRSEILITT GGLGPTPDDIKTKVIADTFK TPLEQRNDILIDLRNKSKDK VSKLSESQKKQSLVPKGAKI INNYSGTAPGIFWSPKENFT ILTFPGVPSELKEMWAKEAS KLLISNNLSKEVISSKVLHF AGITESLLADKIQHLLISKN PTVATYASTGSVKVRITARG KSSEKTNRLIEPIKKELTQI TGLKCFGLDNETLEEIVFKL LLKRKETIAVAESCTGGGIG SKLTKIPGSSQIFHGGVIAY NNSIKQRLLGVPEEIINTHG AVSKQVVESMARGVQIKFKV NWAISVSGIAGPTGGSKSKP VGLVNFCIKGPKTLITWEEN FGSNKTREDIQKLSVLNALD RLRLSIIMAN >_0061.003249_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR0003 npun_22dec03_Contig1_revised_geneNpR0003 MKRPGFIERHHLWTEIQKEA SEKIKTIVKEQDLLLIRTAW SDQHGIVRSKSLLPQAFFSA LENGMQISTGTFLLDTGGAI VFNPFVPGGSLDMPLMTGAP NLVAVPDPDTFKIVPWAKRT GFILCDEYFQNGQPMPFSSR GILRQSLIDLHQRGLEHIVG LEVEWYLAKLEDPMLAIANV GSSGKPGEPAKVSAVDHSFQ YLLESHNPEIQDLLGLLAEN LVEMKLPLRSVENEGGVGQF EFTFDPIPALQAADTMMIFR MATKQICRQQGYIASFMCRP GLQGSSSNGWHLHQSLVDLT TGENAFMSANSQTLTSDLAK HFVGGLLKHANAASVFTTPT INGYKRFRPYSLAPDRAGWG LENRGAMIRLQGGFDDPSTH IENRVGEPAANPYLYMASQL ISGLDGVDCKLDPGSPTEEP YTTENPILPKSLSEAISYLS QSELFPQKMGQQFINFIIKM KESEINRFLQSVADQPPEEY LKQVTNWEQREYFELF >_0086.000526_ ROSE_TM1040_30MAR04_CONTIG46_REVISED_GENE527 rose_tm1040_30mar04_Contig46_revised_gene527 MHTFTQIDLASGLWEIFGKI RKNMSETTGKESSFSADTHS AVLDEGEKQRIRLELGRRVK GLRSAAGMTLEQAAERTRLA VSTIYKIENGKVSPSFENLL RLARGYGVGLEKLIAEPEDE VHTTRLTVTRAGQGRSVEGK VYDYEVLCNGLTGKKIIPLI GTICAKGEGVPAHLDRHDGE ELLLVLEGEVELRVEHYEPV ILAAGDCAYYDSTLRHGVRS TGPDKARVFWACTHLGALS >_0003.001719_ ARTH_26JUL04_CONTIG41_REVISED_GENE1724 arth_26jul04_Contig41_revised_gene1724 MADSRASRTSDGLGSASPAP AVTRAAAVLEALAASATGRL TLSDLSRELGIPKSSTSNLL LALEEARLINRQGADFTLGR KLVELGAAYLSRLDEVQEFY RFCEQAPTLSGETVRIAMLD GTNVIYLARYEGHPAVRLTS NIGDKMPVSLCAVGKALIAR LHDHDIDELFPDDAELPVLT PKSLRTGAEFKKQLPVIREQ GYAFEDEESTTGVVCLAVSV PTRGAHGPSLGLSVTALKAT YSQEQGAMMVKELKELARSL GNPMG >_0110.001926_ YP_001297727.1 gi|150002983|ref|YP_001297727.1| putative RNA polymerase ECF-type sigma factor [Bacteroides vulgatus ATCC 8482] MNQTTHKMTDPIQIKHYKED ITSFNQLYKEFQRRFVRFAN TYVRDLTTAEDITIEAMMYY WENRQSLSEDSNIPAYILTI IKNKCLNYLRHQQIHEEYSD KIKDYYEWELNTRIATLQAC EPYELFISEIQELVQQTLTD MPEKTRTIFMLSRYENKSYK EIAVLMNITPKGVDFHINKA LKMLQTNLKDYFPLFLYFFM KCH >_0088.003349_ NP_718851.1 gi|24374808|ref|NP_718851.1| transcriptional regulator, LysR family [Shewanella oneidensis MR-1] MISSSTVHAFYFSAKYSSFS RAAEILETTQPNISGQIIKL EKDLDTILFIRKQGRVSLSK EGEKLYLVAEKIVNSYKEFD SVVEYFFEEEPIVIVTQPRL YTKYVAPYLTDDLINNNLFH IKTGELTAIKSWLDKEDADI VITEGLFDSHSRYINEGVID VLKFSWAKKVNEQFDSPCPT LVHSKVWDHWHDLSSALNRN KEFKRKLIIDTPDFAERLIE SGVGIGLLPNVMISNNANLE VCHGPTRNNVLGPICLYSKK YSQHKNLRSLVNMFKKESSD S >_0054.001352_ NP_987476.1 gi|45357919|ref|NP_987476.1| Glycosyl transferase, group 1 [Methanococcus maripaludis S2] MLKKRILVIAPHYNSFVKGQ VDEISKSVEHIDLLIKYNPL TEISNYIPHKNSKHFQNFRK KNLATFKDKPSNVSVHLIPT VYFKPDGKNKKLGDMLFKKF DKYIQKNNLNFDLIHAHFTW PYGYVAMKLKEKYNKKVILT VHENRNWLIKEYESKNKKFI TTWKNSDVIIRVNKKDIPLL KRYNENTVNIPNGYDEIIFK KIENIDKIKEDLNIPKNKKI ILNVANYVIPHKNQLNLVKA VYELQKKRKDFILYLIGNYT GDEKKIINLIDELNLKDVVK VLGPKPHDEIPLWMNVADLF VFPSYSESFGVVNIEALACA TPVISTINGGSEEIITSEEY GFIYTNPEDYEKLAELIDKG LNKKWDSAKILEYSKEFTWE NISEEILNLYSTDI >_0084.003399_ SFRI_16AUG04_CONTIG89_REVISED_GENE3402 sfri_16aug04_Contig89_revised_gene3402 MKVAIVIHDLKGGGAEKMMM RLANAISKQHHEVDLVLLTN GGTNKDLLDNGVNLIELNSL RTASSVPRLRQYLKINSPDR VLSALTHVNVITFIACLSLG WLSRLHCSERNAFSFDKDVN KSPLIKFAYFLAPFLYRISP NPVIAVSLGVALDLIETTVV CPKNVINLPNPTLNDNYKTH IFLAPSHPWLSDKTKPVIVG LGRLAQQKGFSDLINAFALV REKIDSRLIIFGEGELRIKL QTQIDNLGLTDSVSLFGYVR APMDEVHSADVFVLSSLFEG SPNALVEAMASRCKVVSTRC PCGPDEILIQGALGILVPVK SPNKLAEAIICSLLDADYDF ENQLDKIERFTANNSASAYL SAMGIHGV >_0084.002641_ SFRI_16AUG04_CONTIG85_REVISED_GENE2644 sfri_16aug04_Contig85_revised_gene2644 MQHLNYNHLYYFWMVQKKGS VTKAAEALCLAPQTITGQIR ALEERLKGTLFKRVGRNLVA TELGELVFRYADKMFSLSYE MLDLLNYQKDKSLLFEVGIA DALSKALVSRVLLTVIPDDS SVHLACYESTHESLIERLRE HKLDMILSDCAGGSLKFPEI LSKKLGECGVSFFSAETISV PFPACLEQRKLLIPGKRTSL GQQLHGWFAEKNLNVSILGE FDDAEMMKAFGYFNRGIFVA PSIYRHDILSQGMVLLGETT DIKEEYHVMFAERMIQHPAV KSLLATDFSDLFAGRDLQVQ DFENRIS >_0046.002543_ NP_472028.1 gi|16801760|ref|NP_472028.1| hypothetical protein lin2699 [Listeria innocua Clip11262] MIKLTMLSSAEKVKGQGVAS AYRELVNLLEERYTNEIDMK INSFEKSDITHYHTVDFRFF LSTFFKKKRGVRVGYVHFLP ETMEGSLKLPWIARVVFYKY LISFYKRMDEIVVVNPSFIP KLTAYDIPEEKIHYIPNFVS KKTFFPISTAEKKLAREKYG IPADKFTVIGIGQVQHRKGV LDFIEVAKQLPHIQFVWAGG FSFGKITSGYEELKKIYDNP PANVKFIGIVDRSEMNACIN MADLFFMPSYNELFPMAILE AMSADVPILLRNLELYEEIL TGYYVKEVDNPGFIRAIERL EHDTDYYNEMLQAAKEGATY YSEDRLAKIWFAFYQGLLTK E >_0043.000072_ JANN_22DEC04_CONTIG12_REVISED_GENE100 jann_22dec04_Contig12_revised_gene100 MNTRLRAVFCDHLSIMRGKY LPGSKIGDDDTRFCRSVFGV HYDKDLLPAPGSMMMEGLPD MELRWREEDIRDSWEADTRI VIGDLFDTDGTRLPLCPRGA LKRTVADWQARGLTPKVGIE LEAFAFVHDADGKLIPYDSF GGVVYGTGAFTDPRGFNDAI WEVADALGFRLDMITAEYDS PQFEYTLTFDDAVQAVDDIV LFRQMAREVALGEGVILSFL PKPIAAAGGNGMHINFSFTD EAGANALSEGGQSGPDHLNY LAAGCVAGLIHHHKGLAGLI APSGNSYDRLQPASLSGYWQ NWGGDHRNVTTRVSSEGGAK ARLEHRMADAAANPYTAVAA VLQAARLGVEHGYELPPKEA GDGFEHTAAEAGVAPDLATA LEDLSADTLLAEAVGQGLVE NHVFMKTAEVEKTAGLEPEA LRDFYIPYL >_0038.000321_ NP_280449.1 gi|15790625|ref|NP_280449.1| phytoene synthase; CrtB2 [Halobacterium sp. NRC-1] MVEKTHITTSKAIQRRTGKT FHLATRLLPTRVRHATYVLY AFFRVADDVVDTTADRDPGV QREELEGIRAAALGARDPES VAADTEVLAAFRELATRHGI SDEDIHTFIDAMQADLEKTR YESHAELEEYMRGSAVAVGY MMMDVMEVAEPETAAPHAAA LAEAFQLSNFLRDVAEDVHE YDRVYLPAESRADHGVTVEQ LRARTVDAGFREAMREELAY TERKYRTGVAGIEYLPEDCQ FAVLVSAVLYADHHRAIRER DCDVLTATPSLSTPRKLWLV AKTRALWALNSSPEAVFYRA TGLAETGDSRRRHGDPQPTP SR >_0034.003428_ NP_416752.1 gi|16130184|ref|NP_416752.1| orf, hypothetical protein [Escherichia coli K12] MLKVEMLSTGDEVLHGQIVD TNAAWLADFFFHQGLPLSRR NTVGDNLDDLVTILRERSQH ADVLIVNGGLGPTSDDLSAL AAATAKGEGLVLHEAWLKEM ERYFHERGRVMAPSNRKQAE LPASAEFINNPVGTACGFAV QLNRCLMFFTPGVPSEFKVM VEHEILPRLRERFSLPQPPV CLRLTTFGRSESDLAQSLDT LQLPPGVTMGYRSSMPIIEL KLTGPASEQQAMEKLWLDVK RVAGQSVIFEGTEGLPAQIS RELQNRQFSLTLSEQFTGGL LALQLSRAGAPLLACEVVPS QEETLAQTAHWITERRANHF AGLALAVSGFENEHLNFALA TPDGTFALRVRFSTTRYSLA IRQEVCAMMALNMLRRWLNG QDIASEHGWIEVVESMTLSV >_0024.003333_ CHUT_08NOV04_CONTIG199_REVISED_GENE822 chut_08nov04_Contig199_revised_gene822 LNIAVNTRLLLEDRLEGIGW FTYETLKRITEQNPQHTFHF FFDRPYNDKFVFGKNVVPHV LFPQARHPFLWYIFFEWSIP FMLRKVKADAFISTDGYMPK SSKVKVLNVIHDINFEHRPQ DLPKRVANYYKKNMPLFAQK ATRLATVSEFSKQDLVKTYN IPADKIDVVYNGCNAAFKPI PEAEQVKVRYKHSAGRPFFL YIGSMHPRKNILNLMKAFEV FKKMTNCDMKLLLVGKAMWS NKDIQSLYHTLIYRHDIHFL GHIKTAELARIMASAHALTF VPYFEGFGIPILEALNCGVP VITSNTTSLPEVAGKAALLV NPESSEAIANAMIQIYKAPH IREKLLAQGVIQRQKFSWDK TAALLWASFEKMMR >_0021.001260_ NP_421916.1 gi|16127352|ref|NP_421916.1| Rieske 2Fe-2S family protein [Caulobacter crescentus CB15] MLGASQPYDLGPGIRRGSRG KGGAMTQIPNHDPLSDWGLP GWIYTSERFFKEEQDKVFRP SWQIVCHLNDIPKAGDFHTF DFIGESLVVVRSKDGGVRAF ANVCRHRGARLLDGPVGRCG GRIVCPYHAWTYDLEGRLIG VPMRDDYPALDMAKEGLATI EVEVWRGFVFVRLEGDGPSV ATMMAPYEDEVAHYRFEELQ PFGRVTLRPRAVNWKNISDN YSDGLHIPVAHPGLTRLFGK GYGVEAEAYVDKMWGQLIDE PSESPSERLYQDILPDVAHL PGDRKRLWTYFKLWPNFAFD IYPDQVDFMQFIPISAEQTM IREIAYALPDDRREMKAARY LNWRINRQVNTEDTELVARV QQGMASRTFTAGPLATSEVS LRSFGRKMRALIPESRMPRP PEGW >_0016.000389_ YP_439806.1 gi|83717696|ref|YP_439806.1| transcriptional regulator, IclR family [Burkholderia thailandensis E264] MTAYFPYPCFHPTHPIDNMN ARKPRPAGPADEPQDLDDAD RYRAPALDKGLDILELLSER KDGLTRTEITKELGRNASEI YRMLERLVARQYVIRSPGGD RYSLSLKLYALAHRHPPMHR LIAEALPPMQRFADAAEQSC HLSVYDRGNLLVIAQVDGPG IWGMSVKLGSRVGLVDTASG HAMLAFQSAEQRAHMLAEHT KVKGEQPLAARNLDARLDAI RAAGHVQQDSRQMFGVTDVT HPILGPAGHAIAVLTCPYIR RIDAYVAPPLDAVVALLHDT AAGLSMVGEAAV >_0108.0001294_ YP_460432.1 gi|85858230|ref|YP_460432.1| glycosyltransferase [Syntrophus aciditrophicus SB] MMRLLCFIDYLGSGGAQRQL TFLARHLKKSGVDVEVLTYH ESDFFIPVLAEAGINVETVR GGGRFTKVFSLRKSIRARKF DVLLAFLNAPALYAEIAALP RRRWGLVVSERLAVPGSSMG FARFRRYLHGLADYVTTNSH TNRLLIEKAVPGLTGKVVTI YNALDLEYFSPLQGPVPAGD GRLRFVVLASHQLKKNLLGL VEAAHQVVQAAPELDFTIEW FGRFDAGKNGPGDTGPFEQA KRRIETFRIQDRFIFSEPTS DVAAVYRRADALILPSFYEG LPNVVCEAMACGRPVLMSNV CDAENLVREGDNGFLFDPRD PADMARAILRFAALSGDDRK LMGEKSRKRAVILFNPERFA AHYRRVLESAMRREQTIIPH WCESIPQSAIRMIENEK >_0074.003512_ RRUB_10JAN05_CONTIG98_REVISED_GENE760 rrub_10jan05_Contig98_revised_gene760 MGIRVSTISRCIGHLEDEIG VALFLRRPNGVTLTFAGEQF LVRARRAMTEVRHAICEAGI AGTGRNGFVRLGLLSSLAAP FPAMLIKIYCAAYPGVRILY SEGGAADHRAALQHHRLDLA FLPHPEKAEGCDALGLWEER LFAAMAQDDPLTRHDVVTWD DLRERHFIFSEVPPGPELLD YLTERLVGFGFLLKADLLTV YRDTVMRVIANGSDVTVIGE ARVSHPIQGVACRPITGESL FFHAVWLPTNDNPAFRRFLS LAKVLSKQCSVCELKEGLAG PSGEDHSVMDETSASLERKS RLLKIVSERCATCLLKKAIA D >_0052.002647_ NP_107697.1 gi|13476127|ref|NP_107697.1| transcriptional regulator [Mesorhizobium loti] MMTDGVVERSRPRDRILETA RDMFHKHGIKGVGVDAITEA AGTNKMTLYRHFESKDELIV ECLRANAAKAGAMWDAFEAE FPGDKLAQLHAWVRKAAAML NADGRGCDMANAAAELTEPD HPARLVIKELKEAQRERLVT LCRGAGIGQAELLADTLSLL FEGARVSVQTVGAEGPSTQF VRMAEGLIVSFRGTAAG >_0034.004713_ NP_415136.1 gi|16128586|ref|NP_415136.1| putative transcriptional regulator LYSR-type [Escherichia coli K12] MANLYDLKKFDLNLLVIFEC IYQHLSISKAAESLYITPSA VSQSLQRLRAQFNDPLFIRS GKGIAPTTTGLNLHHHLEKN LRGLEQTINIVNKSELKKNF IIYGPQLISCSNNSMLIRCL RQDSSVEIECHDILMSAENA EELLVHRKADLVITQMPVIS RSVICMPLHTIRNTLICSNR HPRITDNSTYEQIMAEEFTQ LISKSAGVDDIQMEIDERFM NRKISFRGSSLLTIINSIAV TDLLGIVPYELYNSYRDFLN LKEIKLEHPLPSIKLYISYN KSSLNNLVFSRFIDRLNESF >_0116.000991_ YP_193868.1 gi|58337283|ref|YP_193868.1| galactose mutarotase related enzyme [Lactobacillus acidophilus NCFM] MDYTIENNMIKVVISDHGAE IQSVKSAHTDEEFMWQANPE IWGRHAPVLFPIVGRLKNDE YTYKGKTYHLGQHGFARNAD FEVENHTKESITFLLKDNEE TRKVYPFKFEFRVNYNLMNN LLEENFSVVNKSDETMIFGV GGHPGFNLPTDHGENKEDFY FDMHPSVTRVRIPLKDASLD WNNRSLAPTDSLIALSDDLF KDDALIYELRGNDNKVSLRT DKNKFHVNVWTRDAPFVGIW SQYPKTDNYVCIEPWWGIAD RDDADGDLEHKYGMNHLKPG KEFQAGFSMTYHSTTDEVK >_0083.003067_ NP_838400.1 gi|30064229|ref|NP_838400.1| putative LYSR-type transcriptional regulator [Shigella flexneri 2a str. 2457T] MDIFISKKMRNFILLAQTNN IARAAEKIHMTASPFGKSIA ALEEQIGYTLFTRKDNNISL NKAGQELYQKLFPVYQRLSA IDNEIHNSGRRSREIVIGID NTYPTIIFDQLISLGDKYEG VTAQPVEFSENGVIDNLFDR QLDFIISPQHVSARVQELEN LTISELPPLRLGFLVSRRYE ERQEQELLQELPWLQMRFQN RANFEAMIDANMRPCGINPT IIYRPYSFMAKISAVERGHF LTVIPHFAWRLVNPATLKYF DAPHRPMYMQEYLYSIRNHR YTATIFSILLKIVTGQTINP ASSRLQLNYGVSRRRG >_0057.000943_ NP_840675.1 gi|30248605|ref|NP_840675.1| Uroporphyrinogen III synthase HEM4 [Nitrosomonas europaea ATCC 19718] MDLSSNRLAGKSILITRPLH QAGGLATWVRELGGEPWLFP VLEISDSENKQPLLDLIARL DEFDLAVFVSPNAVEKVIPL VQVSHSWPRHVLVATVGKGS ARVLERYGITNVIVPEEGSD SEALLRMPQFQVMQGRHVVI FRGNDGRRLLGDTLRERGAS VEYIECYRRHKPEADPLPLL KHWRDDGIQAVIISSSEGLD NLFDMIGETGQQLLKATPVF TAHERIERKARELGIRKIYR TLLGDEGTVQGLLEYFEKM >_0050.001908_ MFLA_01DEC03_CONTIG129_REVISED_GENE2121 mfla_01dec03_Contig129_revised_gene2121 MASTQELSNFLAQIERRAFK QTAYAVRDDHAAMDIVQDAM LKLAEKYAARPVEEYPMLFQ RILQNTMKDYWRRQKVRNIW TTLLSSLGVSPQDEEEHDPL ETMASDHAYENPEMQYEQQE TIAIIEAAIKNLPKRQREAF ILRYWEDMDVAETAAVMGCS EGSVKTHCSRAVHALAAALG QYGFAKEMLDRAGDEQ >_0117.002121_ YP_795076.1 gi|116333549|ref|YP_795076.1| Transcriptional regulator [Lactobacillus brevis ATCC 367] MSQRAVSKRISALEAEIGAT LFDRQKNKINLTAAGKHFLT RATELLNTMRMTTYEVQQFT QQAKEQFSVGYFSPFEGALL RMALLDLPTTTNFLIEEAGI EHLISDVLLKKIDCAVIIDN PLFNAKIDQTNLDSVTLVTD HMTLGLSPDLISDQTDSPAD YLAKFPVIYYSSEESTYLEE VFKSSIGQLADTFNARRVNS YEQMQLLVGMGKAISFYPTE LIKYMATPYDHIAYLSLDGV TTAHSEFKLIYHHDNHQPLI KQIQRYFDTHQF >_0013.002061_ NP_880389.1 gi|33592745|ref|NP_880389.1| probable LysR-family transcriptional regulator [Bordetella pertussis] MKLPTPVQIETICTVDRLRT FTAAAAYLHTTQSAISARVR EVEEMLGVILFKRNGRNVET TLDGRRFVEATAPLQQRLAD FMGTFLEPHAISGRIRLAVG NSSMGRVSSMLSAIEKVLPN ICFDLEVMYAAEILRDLEAG RTDIGVFHTPSRLDSKLFIH TSIGTEPTQWLMSGALRADF ERRSPGFGLQTLLDHCQIWC VPKPAFYFDQAIESIASHGG KIRRLSTSGNMPATVDILLA HGGIGLITDHLSQAHCQNGS LVAAFDGISPPGFDYVLACA RSRQSHLLGTVMEIATAAVH AGAAAPALRPVPAR >_0003.001688_ ARTH_26JUL04_CONTIG40_REVISED_GENE1693 arth_26jul04_Contig40_revised_gene1693 MRRPGRQVRLPVKKTLSDRS VYGKDPCGPADDLRGAQTDW SVYHRVMTTPAPAQQAVRTP AGPAKEKILATAFRLFYAHG LRAAGIDTIIAESGISKATF YKYFPAKDELILAYLDKVDA IWTGQLHAAAEAAGPDPAAQ LAGLFDALASACRRDGYRGC AFINAAAESASGTRVHDRTV AHKKAVLAWMQGLAAEAGAA RPDRLARSLSLILDGGLASG VLDGDPEAAITAREAASQLI TASLGNPE >_0116.001254_ YP_194266.1 gi|58337681|ref|YP_194266.1| transcriptional regulator [Lactobacillus acidophilus NCFM] MYNNELNTFLLVAKNGSFSG AAKEMYVSKNAVMQQINLLE SHLDLTLFNRTTHGVTLTNA GRVFVEEAQKILELSEQIDQ NLARYKNTIFVGSGFLNPPI LIKKIWHQFLKINPKSRIDF IEIADYENLSRNIDLFEGCY AEKPFLRQGFSFCKCTTSKL VAIMPPSDPLVKRNELRISD LKERQVVIPDDSAYKSMHEI EKFLQDKISNIKLEKYNSLT TAVVNNAQIRDEILVVPDYL GTLANPNLVKKVDWDLKIDY GLYYRPDSSELCQKLIETIK REFN >_0091.000233_ SPUT_CN32_28JUL04_CONTIG113_REVISED_GENE234 sput_cn32_28jul04_Contig113_revised_gene234 MLGAMKTETQSTRQHILDIG YKLIVRKGFSCVGLSLLLQE AEVPKGSFYHYFKSKEQFGE ALITDYFEKYQLDLDTLFNN STLTGYERLMQYWQQWLHVQ TDGCVDQKCLVVKLSAEVAD LSEAMRLALLQGSAGIIDRL TTCVQVGITDKSIAEQDPQS TAEMLYHMWLGASLMNKLGH SPAALERALVTTEAILTPKT VP >_0088.003732_ NP_719550.1 gi|24375507|ref|NP_719550.1| transcriptional regulator, LysR family [Shewanella oneidensis MR-1] MSKSLSRLDLNLLFTFQLLS QERSVSKAAKKLNVTPSTVS KSLAKLRDWFDDPLFIKTPQ GLQLTPLAQSMEHDLADWLQ MGKQLMGKRGDDMTKGLSFE LMLESPLSLIMLNQLTQAIY QHYPDAKVKVRNWDYDSLEA IIRGEADIGFTGRESHPRSK ESLDLLPYFIDFEVLFTDLP QVYLRRNHPALQEEWSIDTF LKYPHINILWEKSETWALDD VLIELGLSRNIVLTLASFEQ SLFVAAEPHHSMMAIAPQYC EQYARQLHPDLVTRPIPISG EYLNKLAIPFTLIWHKRNSH NPKITWLRSKIKAIYQPRAR D >_0081.003627_ SDEN_20JUL04_CONTIG98_REVISED_GENE3629 sden_20jul04_Contig98_revised_gene3629 MWHLFLECAFKIFLFEHSFK YFLNNFSKLGRLIIPSLKKT QIMRNAEFDRKAVLCAAMSV FTAKGYAKTSMQDLTKATGL HPGSIYCAFENKKGLLIAAI GQYQDERHQQFVRLFDNDAP CLSNLSNYLTEIVDECLSQD MSKACLLTKTLNEVGSQDEE IKTLITANLTAWQAALTAVF SLAQKNNEIDSQANSQELAQ YLMMGIYGLRTFAHTHKDGQ QLAQLAKKLLADVSR >_0063.002619_ NP_249392.1 gi|15595898|ref|NP_249392.1| probable transcriptional regulator [Pseudomonas aeruginosa PA01] MALSESAGQRPALLRPAQGR DDRAYWHELFGSLEVVHFFL VTARCGCFMQAARSLDVKPT LLRKTLARLEERLGLHLFVH EGNALSLTREGRIVQAAGQR LVEDSQSHADLHRQQPLVRL AVAESVLHDVLSRELWGYLR KNANLRVALAELREGWPESA GPAEIAIWIADPGQPHPAME GHFAAPARIAELEYQPHIGK RYSRERTRPASEDELDDYLL AQLHGHAASAALAPWNRRVA ARQSGVIEVQSHDLLLQAIL WGACIGLLPHYAGRLERNLA ALPQVFDEPMRREVWMSVQP EAENRVEVRALLDLIEHAFD DRRDWFGR >_0061.000710_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF1377 npun_22dec03_Contig1_revised_geneNpF1377 MDKKSSVSPLPTKALSDDKN SPITVWHIGGEDIRLRIPLL LKLREKGFNVGAVGSEDGNA FADNQIPYFRYTLERGINPW ADIYTRSQLSALFRQHQPDI VHSFDTKPAMIAPIVAMKTG IPGRVRTITGMGYVFSSTST LALALRPIYRHLQRQAAIAT GITIFQNPDDREYFCKHKMV LDGRDDLVLGSGIDVEGLMK NSAEPEKLAATRRELGLEGQ LVVTMISRLVISKGVREYLQ VASIVCQQMKNVTFLLIGSV SSEGGQAIPIQEIHQQAGVV RYLGPRNDIPTLLNLSDVFV LPSYYREGVPRVLLEAATME LPLITTDMPGCKEVVKDGWN GLLVPPRDTKALATAILKLL NSPEQRNLMGKRSRVHVQTN FSLNQVADAYADIYNRVLKL PKTLK >_0030.001489_ DHAF_12NOV03_CONTIG1061_REVISED_GENE1691 dhaf_12nov03_Contig1061_revised_gene1691 VEKQSLRADDCEEKIIKQYS DLVYRLAFARMGTRHDADEI FQEVFLRFIKKKPVFHEEEH RKAWFIRVTINCSKSFWSSS WFKNVQPIDDDIAFETKEAM DLYYELQKLPPKYRGVIYLF YYEDMSIEEISKALNRKNST VRTQLTRARAALKAVIKEDD YV >_0006.002473_ NP_887266.1 gi|33599706|ref|NP_887266.1| putative isochorismatase [Bordetella bronchiseptica RB50] MKLINGVEVRDTLEELIAPA HTALVVVDVQNDFCHPDGHF ARHGKNIDTIAGMLPALVPF VNAAQDMGIFTVFVQQLTLP HGRSDSPAWLRLKCRDGKSP EYTMVGSWGAQLVDGLQPRA GDVMVQKFRPDAFVRTPLDG ILRAQGIESLVIVGTTTEGC VESTVRGASYHDYYVIPVTD LITGPIAQLHANSLAFMRAR YPAAESAQVLQTWRAARGAA AA >_0107.001678_ AFE_1726 AFE_1726 glycosyl transferase, group 1 family protein {Acidithiobacillus ferrooxidans ATCC 23270} MTETGLKAEVLPTGVAVTSG PHTPRVLFVNQACVLGGGEL SLLDIARLLPFQIRVALFKD GLLRERLKQAGVTVDVLDPT GNGLSKIHKDSGWREAFGSL TQLVRITWRLQGLAKNADVM YANSQKAMVVSALVCITARK PLIWHLRDILSAEHFSPTMR RVAVTVANARASAVIVNSHA TGEAFVAAGGRRKLVRVIHN GIDPKPFDGITAHEAVLARA ELRPLDNSFLIGVFGRLAPW KGQHVVLEALCSLPGVCAVF VGDALFGETDFVHVLHKRAE REDLRERVRFLGFRNDIPRL MRAVDVVVHSSVNPEPFGRV IVEGMLARRPVVASAAGGVL EIIEDGDTGLLYPPGDGLAL RAQIERLRNDPALCERLGAS GYKKAQEYFSIPAMIDGVNS VITEVSSPRRRSV >_0035.002091_ NP_815929.1 gi|29376775|ref|NP_815929.1| PTS system, IIB component [Enterococcus faecalis V583] MQKPNVKMVRVDERLIHGQG QLWIKSLGVNLVICANDKAA EDSLQQTLMKTVVPKETNIR FWTIERTAKVIWKAAPSQTI FVVVGNLHDALELCKLGFPM EQLNIGNIHADEGKEKISQF IYLGKEDKQALCLMRDNYGV TFNTKTSPLSNDGSQYLDVL MEKISN >_0109.003022_ RER070207003023 REr070207003023 MTIVKNQHQAEEITQNTFYK AMTAKKAYAGKSSEQTWLCS IARNLAMDECRKSTKFTELD EEQLEQPDNMVKSLENKDTA LQIHLILHELDEPYKEVFQL RIFGELPFSQIGMIFGKTEN WARVTYHRARLKIKERMDRN E >_0081.003176_ SDEN_20JUL04_CONTIG88_REVISED_GENE3177 sden_20jul04_Contig88_revised_gene3177 MISKEKAPLTVYAPASMGNV GVGFDLLGAALAPIDGSLLG DKVTISQASNSTASPASLHA SGEIHFSQTGIWSHKLPTVP EDNIVYQCAQFFLDKLGVKT GIALNLEKNLPVGSGLGSSA SSVVAALYGLNEYFDTPFEP QVLLQLMGEFEGKISGSVHY DNVAPSYLGGMQLMLDSPSE LCAAIPHFKHWYWLVAYPGI SLSTAKMRALLPAQYDKAVT IDFGRHLSAFVHASHSQNPK LAIEVLKDVLAEPYRADAIP GYKQASGALNQLGMLTTGIS GSGPTLFSITDDIGLAEKAK AWLTDNYVTQDGGFVHICHI DEQGARRV >_0079.001981_ SBAL_17SEP04_CONTIG224_REVISED_GENE1988 sbal_17sep04_Contig224_revised_gene1988 MDKQQLWVLDGGMGRELARR GAPFRQPEWSALALIEAPQT VTEVHQAYVASGAKVITTNS YALVPFHIGDERFAAEGEAL AALAGKLARDVADEHANAVR VAGSLPPLFGSYRADLFEAA RVSELALPLIRALSPSVDLW LAETMSLIAEPLAIKALLPE DGKPFWVSFTLEDETLGSEP TLRSGERVADAIDALVAVGV DAILFNCCQPEVIEAALQVA SDRLSALGRADIRLGAYANA FPPQPKEATANDGLDEIRAD LGPLDYLGWAERWRAAGASL IGGCCGIGPEHIQALSTRLR >_0062.000906_ OOEN_16SEP02_SCAFFOLD30_REVISED_GENE1229 ooen_16sep02_Scaffold30_revised_gene1229 MAKNTRDKIIETTISIIENK GINFVNMRDLGAQIGLSRGA VYRHFKNKDDLLVTIAIQSF VKLSEHMSKTVRDNSKEQLI NLLNYYYNFGTKHPSLYDLM FQKKWTTSEYQNLHAIATQP LEILRKFIPNTIDSATVLAF IHGLIELTNSGHVEPEKGLD DPQLLISTFITKIYE >_0062.000769_ OOEN_16SEP02_SCAFFOLD2_REVISED_GENE642 ooen_16sep02_Scaffold2_revised_gene642 MKIVKFGGSSLSSGVQIRKV FQIVKSDPERRIVVVSAPGK RFKGDVKITDLLLKLAEAIL NEEKTAQIYEQIFLRYQAIG DFFKIPRSGIEQLKEHLYSV AQADYPSNDFFRAALSAQGE NMSAHLITLIFKQLGLKARL LTPKEVGLTVSGEARKAQIL PGSYAKIAKTKFTDNEVLIF PGFFGITKNGLINTFSRGGS DITGAILARGFGADLYENFT DVDSIYAVNPGLVADPAPIK EMTYNEMRELSYAGFAVFHD EAILPAIQGKIPICVKNTNR PEMPGTKIVPRDKVSHKNKI TGIASSHHFQAIYLHRYLIN REVGFTAKILKIMADLNISY EHMPSGIDDLTIILDKNQLT NGRKEKLTQRIKDEIQPDDL QWRNDYAIIMVVGEGLVNRV GAMADIVDPIRDAGISLTMV NQGSSEISIMLGVRPEDEQK AVKAIYNDHFENGHPKKELQ IPRASFKSVSKSLIANFFV >_0060.001399_ NMUL_10JAN05_CONTIG15_REVISED_GENE1400 nmul_10jan05_Contig15_revised_gene1400 MQQLLCMLVTGRIGLLHVHM SSRASTWRKSLFLLMGMVFR VPYLIHLHSPNFVDFFEHEC GERRKRLIRFLLSRALYVIA LSQGWAKDIKKISPAARTVI LFNSVPLPAARLREKEMEES SNALCDPPLILFLGHVGKRK GTFDLIRAVALLSENFRLII GGDGELQRAQMLSEELGVSD KILFAGWLGKAEKDHLLARA AIFVLPSYHEGVPMAILEAM SWGIPIVTTPVGGIPEVVTE GQEGLLVNSGDIVGLAHALA RLLAAPSLRREFGERGRHKI ESKFSIKVLQPQLEQLWIDS GVSEPEPRPEPELVMQRGEV PAPEIQPKV >_0051.000870_ NP_248053.1 gi|15669248|ref|NP_248053.1| capsular polysaccharide biosynthsis protein M [Methanococcus jannaschii] MSNKKKQLTVMGTVWDFWSV LKMFDKLYESKYISFYEPWL KGEIDKEKIILFNEKSKNPL LWPFKILKRTYKILKIIREF KPDLVITHHDDANVSIIPVI LLNKIFKISNNTKFILWVRN NPIESYKEGLYSKIIILAYK YFYKYADIIIVQTQENKKII ESHFKSLKNKTKIVPNVYEI DKLQQLSNEPLEKQYRNIFK DSFVFINIGRLTEQKGQWFL IRSFKRVTEKYPNAKLIILG DGELKNKLQELINKLNLQNN VYLLGMQKNPFKFLKHSNCF VFSSLWEGLPNTVIEALSLN LPVISTDCKTGPREILCPEL NISDKIDYPYYGKYGILTKP FSREFIWQDLNEKPLIEEEK MLADLMIKMIEDEDLRKRYS NGLERAKDFDIEKIIKEWKL LIEGTI >_0049.002450_ NP_786344.1 gi|28379452|ref|NP_786344.1| transcription regulator [Lactobacillus plantarum WCFS1] MDFNQLQTFLRVSEYGSFTK AGEQSFISGTAVMKQINRLE AELNLKLFVRTATGVQLTPQ GKKFQPYVQQLLDLLNTAIE ETRRVRSDDKQLILLGTSLL HPADAFMSLWKELAPKMPKF QIRLVQLQEDLNSRNREYAM LGRSSDLIVGTFDSTTLKQS FSAIQLGAYHFGIAVRSDNP LAQLDEITYSDLAHRKVLMV STGISEKNDLVRSEMLAAEP SIQPIDTSGRYDINTFNETV EENIAMISLTPWKRIHPNLV TVPLKTSVTVPYGLLSTKFP GKKTADFLHEFTKLVPTETQ SHS >_0024.001517_ CHUT_08NOV04_CONTIG199_REVISED_GENE2364 chut_08nov04_Contig199_revised_gene2364 LTLESKYHKKPEEIEEEQLW IEKAKQNPQHFAVLYEKYYK TIFLYLFRKVNDMDVAGDLC SDVFSKALSAIQSYEYKGVP YSAWLYRIAANEANMYFRKH NKREMICIDDTSIHLLNEDL QESNEENIYLTFLPLCLERL KPDEVILVQWRFFENKAFKE VGEIMNMTENNAKVKTYRIL EKIRKWMLEMKGKSNE >_0020.002795_ CAUR_25MAY01_CONTIG939_REVISED_GENE4193 caur_25may01_Contig939_revised_gene4193 MAIIRAINRVAPIPLPLRIV EEFAVQIVINGSFWSQPTVG IGQYLHHLLPWLHRLAPQHR YLMVVPAGTKTPALPVGVEG ITVKIGGPRQIAKVIFEQIA IPVITQRLARNGEPTVIFVP YFAPPLRARQPVVTTIGDLI PLLLPAYRGSWAVRTYMALV RRAALRSAHVLTFSTFSRST ILNYLAIPSDRVTVSYLAAG DQYRPAADVHAAQALVAARY GVQPPFIYYVGGLDERKNLG TLLRAFALVHGRHPHCTLAI AGRALGRDPRLFPDIDQLIR DLDLTKAVRRIDVPVDDGPL LYQACTIFTYPSRYEGFGLP PLEAMACGAPVIVSDASSLP EVVGAAALRIAPDDVAGWAA AINRLLSDEALRSDLRTRGL AQAASFSYRHTATITLNVLE QVAKAAPVVRHR >_0009.003585_ NP_244492.1 gi|15616187|ref|NP_244492.1| late competence protein required for DNA uptake [Bacillus halodurans] MKKTAEGDHCLDCKRWLPEM QTIEKNRALFEYNPFLKNVL TQLKFRGDVKLAEAFHPLLK KLYQKEFKFDVIVPIPLSKD RLIERGFNQVEALLAKWCTY EDVLARKPGRKQSKKSRIER IQQRTTPFMLKGEAEAVEGR SIVLVDDVYTTGATIRQAAT VLQAHGAARVRSMTIAR >_0069.000630_ NP_895791.1 gi|33864231|ref|NP_895791.1| putative uroporphyrinogen III synthase [Prochlorococcus marinus str. MIT 9313] MSSKPIAPLHGRTIIMTRAQ EQQSEARSQLHALGSNVLDL PALVIGPPDDWQPLDDALAD IKTFHWLVFSSANGVRAVEE RLQRIGQSLANRPKGLKLAA VGRKTAQYLEHLGAAVDFVP PNFVADSLINHFPVSGFGLK MLLPRVQSGGRTILGEAFRE SGAHVIEVAAYESRCPEAIP DDTATALANSNVDAIAFSSG KTAAHTASLLSHRFGSDWLQ QLETVKVISIGPQTSLSCEQ HFGRVDQEADPHNLEGLISA CVKACRSEAETCPPGSR >_0062.000677_ OOEN_16SEP02_SCAFFOLD26_REVISED_GENE976 ooen_16sep02_Scaffold26_revised_gene976 LMMQLTKEKNNYNFLFSSWV KASSKNEIMLNENSNLADKV AFIFKTFEIDKLEMMQMFAG QDSDLKNKVINTYDDLIHYC YQVAGTVGCMIFPILSKNNN LSSIRNKVIDIGIAMQLTNI LRDIHEDAIRNRVFIPDQLL VLFKVNKEELKGRKTKRNLK KLIAFLSKKALFFYESELEV IQTVDSFSAKFSLKLAIATY KKILEKIIDSDFEVLEGRIY VTNLEKAKILDNIILKL >_0105.000358_ YP_054863.1 gi|50841636|ref|YP_054863.1| putative glycosyl transferase [Propionibacterium acnes KPA171202] MRILIVGAASSIHTVRWVNG LVKRGHEVHLASVHPVGRHS IDSRVRIHLAPHGGKAKYVV NAGWLRSVAAGVQPDIVNVH YATGYGLLARLAHIDAPTLL SVWGSDVYDSPRANPLMRHM VRSNLVSATRIASTSHCMAR VTRDLVNKPISITPFGVDTE ILTPPDRRRDANDGDGVVCI GTIKALHSKYGIGELIRAFS RVHDERPNTVLHIWGGGPDE NPLKVLARRLVPDGSVEFRG AIDHSEVRDALGSLDIFAAL STLDSESFGVAIIEAGACGL PAVVSDADGPAEVVEDGVTG LIVPRGDVIASATALMQLVD DVELRRRMGGAGRHHVVETY SWERSLDLMELAYRDTIDDA ARQHCSGRHGG >_0083.002215_ NP_837031.1 gi|30062860|ref|NP_837031.1| hypothetical protein S1430 [Shigella flexneri 2a str. 2457T] MGVKMSLELPWPNDERLQQL CENLLNNQGYLPTLDNLADK INVSSRTLMRLFVKETGLTF RHWVQQMHVISAVTLLDDGY SLTKIAHRLGYASAESFGNM FKRRTGYSPGKFTRRLTMHN YAITRQMI >_0074.002095_ RRUB_10JAN05_CONTIG98_REVISED_GENE2885 rrub_10jan05_Contig98_revised_gene2885 MSDAPQPAWKTAFCSFYYQD APPPDDPLLEAGKLALLVID MQNVYVSRPDRASLDDAGKR AYDAWTPFHQRMGETVIPTI ARLQAAFRAGGHPVLFARIA CQTTDGRDRSLSQKLPGWNN LLLPKDAPASQILAELSPLG DEIVVTKTTDSALTGTNLRL ILTNLGVTQVVCCGIFTDQC VSSTVRSLADESVSVIVVED GCAAATDALHRQELAIINRI YCAVMTADDVLGYLP >_0062.001145_ OOEN_16SEP02_SCAFFOLD3_REVISED_GENE1178 ooen_16sep02_Scaffold3_revised_gene1178 MMAVLPKNHHLANLRKISIS QLEKEFFLLEPKGSRPYNLC VNLCKSSGFLPNVVYTDRQI ENIVDLVADGMGISLLMSKL VPKESSRIVAIPVEPTVSTS ISLCYMSDENLTDLKRKFIA YIKNR >_0053.004173_ MMAG_12JAN01_CONTIG3880_REVISED_GENE4195 mmag_12jan01_Contig3880_revised_gene4195 LRILHTIPGRNWGGMEHRTL EQVRWLKSHGHDVWLASPQD GESYKRAQAAGMPVVDFDFD RPWKPATVRSFRKLLIEKSV EVVDTHVTRDAKTACACLDL VAVVRSRHVNQPLKGGAIRR LQWRMGADHIITVAECTRSQ LLGVGLADAKRSVSIGGWAD ERFFDLPDPVATRARLRAEL NIPAEAYAWVCVGMIRPDKG QDHLLAALALLKARGLSPML TIVGSATSECADYERGLHAQ LSAAGLAGQVVFTGYREDVS ELMQMGDAVVIPSLTEAQPR VAVQAFAVGKPVVASAVGGV PEIVFDGETGLLVPAADPAR LAEAMARIMTDHDATARMAA NARQMAEKDMRFDNRMNQTL EVYRTAQAHARKRFLPKFRG VGA >_0035.001743_ NP_815513.1 gi|29376359|ref|NP_815513.1| transcriptional regulator, LysR family, putative [Enterococcus faecalis V583] MTIEKLEYFYTIAKYNSISK AASELHVSKSTLSASLKDLE SELGHLLFNRNGNSLTLNSY GDKIVQSVYIILNEAKKMKL NLHEMIENPVMRLGFGNTSL MYKVTENEDQLNRFWECYHG SSFELLNKLENHELDFVITS ADVNSPVLKKEQLIELKMYL CVSREIKQEIEEEGFSCLTN YPFLFLPHHLDHLEATKSVL EMLQLTSPLVCCYDTLMLTR LIEKSKGVYAVISLRKEQLQ EIDSKLFFLPIERKQKFYLY RNVSSSVFVQPGQIKATLQK LLET >_0016.000804_ YP_438494.1 gi|83717366|ref|YP_438494.1| transcriptional regulator, TetR family, putative [Burkholderia thailandensis E264] MSAIPQATRKRGRPVKGESA TLRDELILKSAKLFRTQGYE RTTVRDIAAAAGVQAGSWFY YFKTKQDILVAVMEQGMSNA LARIEALDVENLPARDAFRA LVRTHLQTLVSPDHDFIPVL LYEWKSLDEAMRSKVLKLKD RYEAVWDGVIERLQAAGDWP APTPIDRLLTFGALNWVAQW YKPDGALGLDALAEHAVRFL LRTGAMPAQAPAAASAAKRR RAAKAG >_0120.002810_ YP_001303579.1 gi|150008836|ref|YP_001303579.1| putative transcriptional regulator UpxY-like protein [Parabacteroides distasonis ATCC 8503] MGMSFSTAEEKKQVRWFVMR AYKNEKMAEDRLKDKEYGLE YFIPKHYAVRTYHGVKSTRL VPVIPSLVFVHASHSQITEF KKRYNFLQFAMWEKSTGMEY ITVPDDQMDSFIQIASLHEK DTAYYKPDEIDVRKGTRVCI HGGKLDGVKGVFMRVKGKRN RRVVVMLEGIMGISAEVHPD LIEVIS >_0088.003584_ NP_719260.1 gi|24375217|ref|NP_719260.1| hypothetical protein SO3720 [Shewanella oneidensis MR-1] MVIADIKAVPAMESFDKIII GASIRHGKHNPALYEFIQKH QQILTQKVSGFFSVSLVARK PEKNTPETNPYMQAFLSKTT WRPKLLQVFGGNLNYQGYNA FDRNIIRFIMWLTKGPTDPV TNVEYTDWQKVQEFGLQIHQ A >_0085.000952_ SHEW_20DEC04_CONTIG133_REVISED_GENE953 shew_20dec04_Contig133_revised_gene953 MSLKMKLIYVVESSVPYGAN KCLIEIIDRLDKDKHEVMVV GADEGALSSWLRERNVSYVS LNHRLSIYPVLNSRKDYLLW LPRLFRRVALNVIAMVKFFK ICKRFKPDIVHTNVGPCSLG YFVATYLGIKHVWHVREYQD LDFGMSFFPNRKAFLKRLKR SDAVICITNSIAAHFHLTDN ENLAVINDGVISDEKEIDII QKENYFLFAGRLEAAKGIEE AIESFFEFCETDSSGISFYV AGDGNFNYLKKLKEKVSKSN FSNRVRFLGFRDDIFALMRM AKALVVASRCEGFGLITAEA MYQGTLVIGRDTGGTSEILK DSKGIHYGFLFNSIDELTKL MHEVVDLSPDEYTKMARSAQ RRTLDKYTINRNFREISDVY NTLM >_0076.002035_ SAMA_14OCT04_CONTIG76_REVISED_GENE2038 sama_14oct04_Contig76_revised_gene2038 VGTGRGHVVWAARNLYPVGA FYPHFLENLMTHTLILYSTV DGQTRKICERIKARCEAAGE QVTMADIAEADALLETADKV LVGASIRYGKHRPALFEFAT RYGAVLGTKINGFFTVNVVA RKPEKNTPATNPYMQKFLQL SQWQPQQLGVFAGKIDYPRY GLFDRTMIRFIMWITKGPTD IKGTFEFTDWDKVDGFADTF AARKAP >_0074.003356_ RRUB_10JAN05_CONTIG98_REVISED_GENE62 rrub_10jan05_Contig98_revised_gene62 MAAPDTPPDSPPPISAVEMP SGKDAGTENFPVGSVLLPAR LRPHVARFYAFARAIDDIAD SPDLDAAAKIDRLQGFEEAI TGRDTTDPAYAKGHALRVSL EATGVPAIHGVELIAAFKQD AVKGRYATWAEMMDYCRLSA APVGRYLIDLHGGSRAGYAS SDALCVALQVINHLQDCQDD FRVLDRVYLPADWLAAEGAS VSDLDRAACTPGLRRVLDRC LEGTRRLLDAARPLPGDIID RRLGLEAAVIVSIAQTLTGR LARQDPLAQRVKLGKAATAF CAARACAAYLMVPGAYR >_0011.004506_ YP_105134.1 gi|53716446|ref|YP_105134.1| iron-sulfur cluster-binding protein, rieske family [Burkholderia mallei ATCC 23344] MSNLSDALQLKSAHSQLPVT AYFDEALLAREIETLFKKGP RYVGHELMVPEAGDYFALPS EDEGRVLVRNQASQIELLSN VCRHRQAIMLNGRGRTQNIV CPLHRWTYDLEGQLLGAPHF PDKPCLNLHATPLQHWQGLL FEAEGRDVAHDLAQLGTKHH FDFSDYLFDHVEIHECNYNW KTFIEVYLEDYHVVPFHPGL GSFVSCDDLKWEFGDWYSVQ TVGVHNALAKPGSPTYQKWH DQVLRYRNGVPPEFGAIWMV YYPGLMIEWYPHVLVVSWLI PRGPQKTTNIVEFYYPEEIA LFEREFVEAERAAYMETAIE DDEIAWRMDAGRRALMERGE SQVGPYQSPMEDGMQHFHEF LRRQLGAI >_0107.002228_ AFE_2288 AFE_2288 GTP cyclohydrolase I family protein {Acidithiobacillus ferrooxidans ATCC 23270} MPSQPSRELERFSNPHPERD YVVHMDLPEFTCLCPLTGQP DFAHFMLDFIPDQHNVELKS LKLYLWSFRDEGAFHEAMTN RIADDLIGLINPRYLRLLGR WYVRGGITTDVLIEHRQPGW QNPDILGQLPTVRWAQHQPG H >_0104.002382_ NP_636884.1 gi|21230967|ref|NP_636884.1| 5-methyltetrahydrofolate-homocysteine methyl transferase [Xanthomonas campestris pv. campestris str. ATCC 33913] MTHLPIPSAESSIPFSLPWL HPERAAKLTAALRERILIID GAMGTMIQRHDLQESDYRGT RFAEGYDSAQGHVHGAGCDH APQGHDLKGNNDLLLLSSPE IIAGIHRAYLDAGADLLETN TFNATSVSQADYHLEHLVYE LNKAGAQVARACCDAVEELT PQKPRFVIGVLGPTSRTASI SPDVNDPGYRNTSFDALRET YREAIDGLIDGGADTLMVET IFDTLNAKAALYAIEEVFEA RGGRLPVMISGTITDASGRT LSGQTAEAFYASVAHGKPLS VGLNCALGAKELRPHVETLS QIADAYVSAHPNAGLPNAFG EYDETPAEMAETLREFAKSG LLNLVGGCCGTTPDHIRAIA EAVADLPPRQLPNALEQAA >_0093.002836_ NP_342328.1 gi|15897723|ref|NP_342328.1| Hypothetical protein [Sulfolobus solfataricus] MFFARLIPLKGVLELPFIVK EVITMSGYKELKIVVMGKFP DDNLKAIFFEIVKRLQLEDN IIYKGYLTSREELFNMVSEA RCMIYPTHEDSFSLAILESI TVGTPVVAYDIPGPKSVYSG LSAVKFVEEYNIKLMATEVT KILTMNDDEYNSLIFNDKMD KFIEKHTDWDLVVERYYRDL MSLF >_0076.003088_ SAMA_14OCT04_CONTIG96_REVISED_GENE3092 sama_14oct04_Contig96_revised_gene3092 LSGHDKVQTPKRNRSEIKRE AIMLAAKELFQTQGVQGTSM DELARVAEVSKRTVYNHFAS KEALVLELVAELWQSANADI KVCFVKDHPVHPQLLEVLNA EIAIMTNPDYLELVRVAIGH FMFHPGALKCELESRVTHES ALRRWLSEGIKSGALPNLDV DEVECTLHGMIKGVCFWPSL MQLCEPLPKGELERLGADIA AFFEFKYLSGRN >_0061.003750_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR1064 npun_22dec03_Contig1_revised_geneNpR1064 MKLCIVTHKIKKGDGQGRVN YEVANEAIRRGHQLTLLASE VAPELEANSQVNWIPIPVKD YPTEFVRNFVFAQKSTDWLR KHRSEIDLVKVNGAINLAAA DVNAVHFVHSSWLRSPVHIS RNRRDFYGFYQWLFTAFNAR WEKQAFQKAQVVVAVSEKVA HELVNIGVPRSRIRVIVNGV DLDEFTPGESDRQKLGLPEN VTLALFAGDIRTPRKNLDTV LHALVKVPDLHLVVVGHTQN SPFPQLAASLGLSKRVHFVG FRRDIPQIMQAVDLFVFPSR YEACSLVLLEALSSGLPVIT ATATGGGELVTPECGIVLSN SDDSDALALALLTLVSSPTL IKQMGKAARSVAEQHSWTTM AQTYVDLFEELSKNAEHRSD TNLSPSTRPITLPFSATEAN >_0055.001807_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA1177 rgel_26jun05_Contig562_revised_geneMpeA1177 MSIKSDKWIRRMAEQHGMIE PFEPGQVRNAPDGHRIVSYG TSSYGYDIRCAPEFKVFTNI YSTVVDPKAFDEKSFVDIDA EVCIIPPNSFALARTVEYFR IPRNVLTICLGKSTYARCGI >_0043.002625_ JANN_22DEC04_CONTIG27_REVISED_GENE2626 jann_22dec04_Contig27_revised_gene2626 VDVDFRKNGDPAHLDVLVPN FKKRLSGVTSTIVRLVPLQA QEMALAAVSPDLPAHVPQIR LSQLLTMPRSGPHGARVWHA RRNTEMLGGLALKYLLRKRL KLVFTSASQRRHTGYTKWLI GQMDHIIATSQKTASYLERP ADVILHGIDVDSFTPATDKA ALRARLGLPDATLIGCYGRI RAQKGTGDFVDAMIPLLRAN EGVHGLIMGRATEKYVEYER GLHAKIAEAGLTDRLHILPE VPVHDMADWYRVLDLFVAPQ RWEGFGLTPLEAMACGVPCV ATTVGAFPELITDQVGMLVS PQDTPAMIRAITHYTDDAHM RLGQGAAARAHVEQNFRIER EAEAIMAIYRRLLATGQARR >_0038.001636_ NP_444240.1 gi|16554516|ref|NP_444240.1| Aspartokinase II alpha subunit [Halobacterium sp. NRC-1] MRVVTKFGGTSLGSGDRVER AADSIADAVAAGHEIAVVAS AMGNTTDELLDDITFDADEP DRAEIVSMGERTSVRMLKAA LAARGVDATFLEPGTTDWPI VTNERGEVDADATAAGVDRL AGRLGDTVPVITGFLAEDPQ GNVTTLGRGGSDTTAVMLGR YLDADEVVIVTDVEGIMTGD PQVVEGARNVGRITVDELRN LSFRGAEVIAPSALSFKDDA LDVRVIHYQHGDLLSGGTRI EGTFESMIDMRDSPLACLTV AGRAIRNRPGITSALSTALS DSDINVDAVASGMDSMTFYV DESVAERAENILHQEVIAVS ELSSVTVTDDIAAIRVLGGE LPNRPGVLCRIIDPLADRNI NVIDIISSATSVAVFVSWAD REPALDVVQNGFGS >_0013.001363_ NP_882213.1 gi|33594569|ref|NP_882213.1| TetR-family transcriptional regulator [Bordetella pertussis] MHPVRTTLPADAVLLTPELE NWVGIDYDDMPAVQRKLLDA AAKAFTTYGFAATSIDVIAS QIGATKGSVYYHYRSKTDLF FAVHKCAMVMNLKAQVPVAF DASLDPRAKLYRMAYLHAML MMDSLYYQRVTVQGVELHQS VSTTPMEREALAEVIAMRDV YEGLFSQVVRDGMASGHFAE ADHSIAAKGILGILNWITVW YRPRETESPGFRQRVATQLA TQATQAIQGIARADTRG >_0109.000793_ RER070207000794 REr070207000794 MLRPLFNLLYTDALACGNEA GLWLFGNRKFKVLKNGRNVK KYSFSLEKRNEMRKRLDISD CIAIGHVGGFFEQKNHKFLI KIFREVLNRKPNAKLFLIGD GPLKDEIMKNVSDIRKSVIF VGTVDNVNDYMQAMDIMVLP SLFEGLPLVAIEWQINGLPS LLSNTITEDCNITNMVRFES LEEAPYIWTNDILEMLEKEN RLENSQKAISLVRKNGFDIY DNAKILEKIYKS >_0084.000956_ SFRI_16AUG04_CONTIG69_REVISED_GENE958 sfri_16aug04_Contig69_revised_gene958 MTAEQLVAQYSKSTASRDDS VKTDLTFIGPTVNGQEEVFN SGAVAFLDSLCAKFVDEVPE LLAKRKQKQARIDNGELPDF LPETRAIRDGNWTIRGMPED LTDRRVEITGPVDRKMIINA LNANVKVFMADFEDSLAPSW QKIVEGQINLRDAVRGDIEL TVPETGKHYSLNPDPAVLIA RVRGLHLIEKHIEYKGKPIP GGLVDFAMYFYHNYRQLLAK KSGPYFYIPKLESHIEARWW AKMFAFVEERFCLQPGTIKC TCLIETLPAVFEMEEILYEL RSNIVALNCGRWDYIFSYIK TLKNYPDRVLPDRQAVTMDT KFLSAYSRLLIKTCHKRGAL AMGGMAAFIPAKDEATNELV LQKVRGDKELEARNGHDGTW VAHPGLADTAMKIFNDYIGG DRTNQLHITRDVDAPILASE LLEPCQGERTEAGMRLNIRI ALQYIEAWIQGNGCVPIYGL MEDAATAEISRTSIWQWIQH GKSLSNGKLVTKALFKDMLK EETANVKKELGAERFNAGEF AKAAELFEQITTSDELVDFL TLPGYELLTA >_0079.002232_ SBAL_17SEP04_CONTIG229_REVISED_GENE2239 sbal_17sep04_Contig229_revised_gene2239 MSRFVKRIWPPSAWRIVCLG YQLSTRWLAGSLPNRCLLCH QSIPIPETGICIVCLQSGLY HGPICLGCGKSMQIEVDYCG ECQKRQPRKVVAPCSYHQGL GAWIGAIKYQGQLAALPVLC RALVARIKLLEQQGLIMLPQ AIVPVPLHPKRLQQRGFNQA WLIAHELSQLLQLPLVSEGL TRQQDTRPQAGLSGAQRRRN LHDAFMLADDFAFQRIALVD DVVTTGTTVSEIARLFEARY VHVQVWCLARAEAPGLLDDL DNE >_0062.001561_ OOEN_16SEP02_SCAFFOLD6_REVISED_GENE1882 ooen_16sep02_Scaffold6_revised_gene1882 MKYLLLVSHGDFSKGLKSSL AMFASASMSSVIAVGLKPDE SADTFGERFQKLLKTIPKDS QFIVLADIIGGSPLTTVCNV LSSHGKLDCKLIPYRYLLKF >_0034.004179_ NP_414880.1 gi|16128331|ref|NP_414880.1| transcriptional regulator for mhp operon [Escherichia coli K12] MIFYCALSIGRVFSATIKTC PNVHQVHHVVLTIEMSINMQ NNEQTEYKTVRGLTRGLMLL NMLNKLDGGASVGLLAELSG LHRTTVRRLLETLQEEGYVR RSPSDDSFRLTIKVRQLSEG FRDEQWISALAAPLLGDLLR EVVWPTDVSTLDVDAMVVRE TTHRFSRLSFHRAMVGRRLP LLKTASGLTWLAFCPEQDRK ELIEMLASRPGDDYQLAREP LKLEAILARARKEGYGQNYR GWDQEEKIASIAVPLRSEQR VIGCLNLVYMASAMTIEQAA EKHLPALQRVAKQIEEGVES QAILVAGRRSGMHLR >_0034.000900_ NP_414671.1 gi|16128122|ref|NP_414671.1| putative PTS enzyme II B component [Escherichia coli K12] MLGWVITCHDDRAQEILDAL EKKHGALLQCRAVNFWRGLS SNMLSRMMCDALHEADSGEG VIFLTDIAGAPPYRVASLLS HKHSRCEVISGVTLPLIEQM MACRETMTSSEFRERIVELG APEVSSLWHQQQKNPPFVLK HNLYEY >_0025.001200_ NP_282850.1 gi|15793027|ref|NP_282850.1| hypothetical protein Cj1724c [Campylobacter jejuni] MRYGEKEIKEFDVENMEIWP NDAKNDYIIKITLPEFMCCC PRSGYPDFATIYLEYMPDKF VVELKAIKLYINTFMYRNVS HEASINEIYNTLKDKLKPKW IKVVGDFNPRGNVHTVIECR SDMVVPK >_0018.000789_ BFUN_06OCT04_CONTIG480_REVISED_GENE448 bfun_06oct04_Contig480_revised_gene448 MSFDLETAANPKGLRRVAID PVSRVEGHGKVTILLDEQQR VQQVRLHIVEFRGFEKFIEG RPYWEVPVMVQRLCGICPVS HQLAASKAMDRVVGARPVTR SAEKIRRLMHYGQVMQSHAL HFFYLASPDLLFGFDSEVDL RNIVGVAQAYPDIAKQGILL RKFGQELIRATSGKRIHGTG SIPGGMNRYVSQADRDMLYR DVDQMTGWAADAVDIAKQLH AQNPALYDSFGSFRSNMLSL VRADGAMDLYDGVLRARDAD GGIIFDGASDQDYMSLIEEE TRPWTYMKFPHLRSLGRDTG WYRVGPLARVQNCDFIPSPL AEAQRKEFVDWGKGSPVHAT LAYHWARMIEVLHAVEVIKD LLDDPDILQGELMASGERRA GGVGIIEAPRGTLIHDYRVN ADDLVTHCNLIVSTTHNNQA MNEAVRSVAREYLDGQRLTE GLLNRIEIAIRAFDPCLSCA THALGRMPLDVVLLGPGGEP FDHMTKSHDGEVMRHSPAGM GAMA >_0011.002150_ YP_103966.1 gi|53725750|ref|YP_103966.1| glycolate oxidase, subunit GlcE [Burkholderia mallei ATCC 23344] MEEDDIVAGWAARIRDAAAS GRALRIRGGGTKDWYGQALD GEILDTRAHHGIVSYDPAEL VVTARAGTSLAELEATLAER GQMLPFEPPHFGRGATLGGA VAAGLAGPRRATTGAPRDFV LGVAILNGRGDRLRFGGQVV KNVAGYDVSRLMAGSLGTLG LMLELSVKVLPVPAAELTLK FDMSATDAVRKLNEWAGRPF PLSASAWRYGTLVLRLSGAE AAVKSAKTVLGGEAVDAVEA ERFWEGVREQNDPFFSSLAP GHALWRLSLPSITEPMHLPG TQMMEWGGAQRWWITDADAQ TVRMSAKQAGGHATLFRASE SYDRSAGVFTPLPAPLMKIH RGLKTAFDPARIFNRGRLYP DL >_0005.004289_ YP_323176.1 gi|75908880|ref|YP_323176.1| Deoxycytidine triphosphate deaminase [Anabaena variabilis ATCC 29413] MIKNDIWITEMAQKGMIAPF ESSLIRKIPKDNLVAAQPVI SYGLSSYGYDIRLSSAEFRI FRHIPGTVVDPKNFNPQNLE PTPLHTDKDGSYFILPAHSY GLGVALEKLEVPNNITVICI GKSTYARCGIIANLTPAEAA WRGHLTLEFSNSSSADCRIY ADEGVVQLLFLEGEPCAISY ETRQGKYQDQLEKVTLAKV >_0004.002498_ 17741041 gi|17741041|gb|AAL43530.1| transcriptional regulator, TetR family [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:2520075-2520734, Atu2549 MNKTIDQVRKGDRKSDLPVR RRPRRSAEETRRDILAKAEE LFRERGFNAVAIADIASALN MSPANVFKHFSSKNALVDAI GFGQIGVFERQICPLDKSHA PLDRLRHLARNLMEQHHQDL NKNPYVFEMILMTAKQDMKC GDYYKSVIAKLLAEIIRDGV EAGLYIATDIPVLAETVLHA LTSVIHPVLIAQEDIGNLAT RCDQLVDLIDAGLRNPLAK >_0000.000755_ ADEH_23AUG04_CONTIG59_REVISED_GENE757 adeh_23aug04_Contig59_revised_gene757 VTEHAQHAVDRVEKAIAYLR AGKMVILTDDEDRENEGDLC LAAEKVTPAAVNFMARYGRG LVCLSLTEEKVRQLRLPLMV DETANTSGFGTAFTVSIEAR QGVSTGISAKDRAHTILTAV ADGCRPEDLARPGHVFPLRA RKGGVLVRAGQTEGSVDLAR LAGLKPAGVICEVMNDDGTM ARMPELEKLSRAHDLPIVSV ADLISYRMMKDTLVRRAAEA PLPTEHGEYRAVAYENDVDR HQHVALVKGKWRSNEAVLVR VHSKCLTGDVFGSERCDCGP QLHAALDQIDRAGKGVLLYL DQEGRGIGLANKLKAYNLQD EGYDTAEANVRLGFKPDLRD YGIGAQILRDLGVRKMRLLT NNPKKIIGLEGYGLEVVERV PIEMPATRRNRAYLITKRDK MGHLLTLAPVAAAVARAPAA PARAAKRRRPAPSRKGKGRR >_0088.001510_ NP_719406.1 gi|24375363|ref|NP_719406.1| transcriptional regulator, LysR family [Shewanella oneidensis MR-1] MIINNNIKELSIAHLQLIVC LIKHGNSCVVSDELGISQSS ISYHLRRLRVIFADEMFIRT GKGLKPTERCIQIGHFAQDL INRVEEELIHANDFVPKQMK RELTLIAADTACGWFSTLFS DMQKTLPRVTLCARPWNLKS MEDLDSGAVDFGIHIIENSK KGIYDMDIAPCYRLCVVRDG HPLISKGEVTLKDLEQYPVL INDLGGWNNDGNSLMQRVLE KHGLKLNIAGRLGYINSIFT ALHTSNAITYTSAASIPQSI PGLTLLRGPKEVNEVDCFYR LYISRMRYGSQETNYLIDFI YDSFKHFLTQQYNRIDIAAV INK >_0077.002641_ NP_371252.1 gi|15923718|ref|NP_371252.1| similar to GTP cyclohydrolase I [Staphylococcus aureus subsp. aureus Mu50] MAHGRQQDELQDITLLGNQD NTYNFDYRPDVLESFDNKHQ GRDYFVKFNCPEFTSLCPIT GQPDFATIYISYIPNVKMVE SKSLKLYLFSFRNHGDFHED CMNIIMNDLIELMDPHYIEV WGKFTPRGGISIDPYTNYGR PNSKYEKMAEHRLMNHDLYP EKIDNR >_0073.006373_ YP_298412.1 gi|73538045|ref|YP_298412.1| regulatory protein, IclR [Ralstonia eutropha JMP134] MTTQAAPNDDKAADGGKPQR GIQSLDSTGQLLAALVAAGR PLPLRDLAQAAGMAPAKAFP HLVSLQKTGLLARDAAGNFL CGPLSLELGLIALQRLSPTR EAEPEIIELAEATGLSVAMA VLGPLGPTVVRLEESARPQH VSLRVGTVLSLVNTAIGRTM AAYLPENVLAGLLERDDLRM AGVRRADVLLDGGGLVPDYA DRLAHVRAGQVDNALSRPVP GIDTLAAPVFDHTGSLALVV AVMGSTGSFDSSTQGATADL VRHAAHRLSWRFGAVQTPR >_0030.000170_ DHAF_12NOV03_CONTIG1010_REVISED_GENE203 dhaf_12nov03_Contig1010_revised_gene203 MDTIDEKDRAYIFSIAKNTA IDIGRKRSRQDSSLDEIENL VGDEAVSIEDDIINREMFDI LQKKIDELPNAYGDIILLKY IYGLPDQELAEMLGISLENT RMRLSRARRKLKEMLTEGKE APDHE >_0116.001179_ YP_194174.1 gi|58337589|ref|YP_194174.1| putative pyrophosphokinase [Lactobacillus acidophilus NCFM] MKAYALVGGPTDLWLHDIKK QLTEAKQNNDLIFGVDRGAL FLEELGIIPDVAIGDFDSLQ AKDLSRIEKTVKDIRYSNPI KDWTDSEIMVQTVFKDYLAD KLIILGASGGRIDHFLINLL MWLNPPINQFAQRVEIIDNH NSIVFFNPGVHIIKKKPDYP YIGFATLSETEDFNISGARY DLNDYSSTYPRVFSSNEFLP NSDYFEISLKKGMIAAIYSK DINRFHNL >_0104.001832_ NP_635814.1 gi|21229897|ref|NP_635814.1| transcriptional regulator tetR/acrR family [Xanthomonas campestris pv. campestris str. ATCC 33913] MRMNKASSYPVSGRGPADHD VRDQIVNAATEHFRRYGYEK TAVSDLAKSIGFSKAYIYKF FESKQAIGEMICTNCLRQIE DEVRAAVDETDSPPEKFRRM FKVIVDASLRLFFEDRKLYE IAASAATERWQSVLAYEERV LALLQEILQQGRQGGDFERK TPLDEATRALYVLIRPYTNP VLMQHSLDVIDEVPGLLSGL VLRSLSP >_0095.004261_ NP_459569.1 gi|16763954|ref|NP_459569.1| putative transport protein, PTS system [Salmonella typhimurium LT2] MRHVYVASHGPFARGLINSL SLLIGDEHGVTPVCAYDGDI VTTEQLEQTLENLIAQANGE EVVVFTDLLGGSINNSAAKV LMRHRHVFVVAGVNMTLLLE FLLCEEESTDAAITYATNAA RESIVFINTLITQPSADLQG ESHDQISAH >_0078.001791_ NP_825698.1 gi|29831064|ref|NP_825698.1| hypothetical protein SAV4521 [Streptomyces avermitilis MA-4680] MSNDEITLTAGDAEVALVPG SGGRVRSLRVGGVELLRQGE RYGCFPMVPWCGRIRDGKFR NGATLHQMPLNSPPHAIHGT ARDGAWRVARTGKNEAVLTY DLVEPWPYPGRVTQIAALTE DSLTLTMSVEAYDSSFPAQI GWHPWFNRNLGQDDARVDFT PAWQEERGADHLPTGNRVAP LPGPWDDCFGMPDGVHVTLS WPGQLELTVASREEWVVVYD EQPEAVCVEPQTGPPDGLNT RPRLVTPIEPLEATTTWTWR TL >_0076.002207_ SAMA_14OCT04_CONTIG82_REVISED_GENE2210 sama_14oct04_Contig82_revised_gene2210 MKLETIDYRSPDAAAQFVQS LRDTGFGVLSNHPIQQSLVE AIYKDWYEFFQSGAKEEFRF NPETQDGFFPADVSETAKGH SVKDIKEYYHVYPWGRIPES LRANILAYYDHANQLAAELL SWVEAHSPDEVKALFTEPLP NMIDGSHKTLLRVLHYPPMK GDEEPGAIRAAAHEDINLLT VLPAANEPGLQVKAKDGTWL DVPSDFGNIIINIGDMLQEA SGGYFPSTSHRVINPEGMDK TKSRISLPLFLHPRPDVVLS ERYTADSYLMERLRELGVI >_0066.000372_ NP_904499.1 gi|34540020|ref|NP_904499.1| competence protein F-related protein [Porphyromonas gingivalis W83] MRKTANGTGRHIRLLIRKVL DLFFPRYCPVCDSLLAETEI GVCPRCMVRMPRYIEGMQYG LDRLNGDVYIDALYSLFIFK EDGGVRPMIHALKYGGYSEI GEMLGRMAGRSYPFLSKDYD LIVPVPLHPRKQRKRGYNQA LLIAQGLSRVTGIPVQEGLR RKVYTDSQTGQSYSERKSAM KGKFALSPNTRVAGIRVLLV DDVLTTGATVQAAAEPLAEA FAAKIGVLVAAVTKRPSNWS HYSDER >_0056.003317_ SARO_25NOV03_CONTIG30_REVISED_GENE3523 saro_25nov03_Contig30_revised_gene3523 MPGRPCRPTHRWKRRNSAFA RQTVRLSHDPMTDESRIWTA ALVVVGDEILSGRTQDKNIA QVATWLGVQGIRLREVRVVP DDMDAIVEAVNTLRARNDYL FTTGGIGPTHDDITVDAVAS ALGVEVVIHPKARAILDSYY ASRGGLNEARLRMARVPDGA DLIENRVSGAPGIRVGNVFL MAGVPGITAQMLDGLTGQLE GGLPLLSTTVGCWVAESEIA DLLRETELAFDGCQIGSYPF FREGKTGANFVIRSVSEDQL RACAFALEQGLADMGRFPIP GGI >_0056.002288_ SARO_25NOV03_CONTIG29_REVISED_GENE2435 saro_25nov03_Contig29_revised_gene2435 VTLRLSDLIDDRPSQGVFRV NRAIFTDQSVFDAEMRRLFE GGWVFLGMESQASGPHDFFT TSAGRVPVMVQRDGEGVLRA FVNSCPHKGARLAQVRQGNA RLHVCPYHSWSFDSAGRNKA VKWKAAGCYSDAFDRDDHGL AVLPRFEGYRGFLFGSVSPE VPPLAEHLGEAAKLLDLVAD QSEEGLELVPGQVTFTYQAN WKLQLENCSDAYHFTSAHPS YIRVLERRQKEISEEVVASV WENSDYWKEDTKGVGGGSFS MANGHVLNWGVFGVTPAIPL YERAAQLAERVGEGKRDWMF NMRNLTIFPNLQVAENASSQ LRVIRPISPALTEMRTWCIA PKGESDAARRQRIRQYEDFF NPTGMATPDDTVSYENCQIG FAGTTEPWLQGYARGMEASV EGGNRFSERIGLEPQRSVLA DSQLCDETLYHSYYRAWAAR MAPEFAA >_0037.002508_ NP_953509.1 gi|39997558|ref|NP_953509.1| glycosyl transferase, group 1 family protein [Geobacter sulfurreducens PCA] MYEALEDTRRRDTLAVLRDA AAVAAFHKCVRCRVLDHHPS LAETMAVIPQGVELPGEEFD WGNERFDRGEFVFFLPAGLR PVKNAAFALGPLAELHREEP RVRFLLAGPVLDREYGAATL EAIDCHPFARYLGEVGRDAI GALFRRADAVINSSTFEGGM ANSVLEALAFGKPVLASYID GNRSVVKEGTTGFLFRGERE FLDRARDLLRNPALGRRLGE QGRELVRERFSPGREAEAYL ELYRRITGA >_0032.001724_ YP_010281.1 gi|46579473|ref|YP_010281.1| glycosyl transferase, group 1 family protein [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MRIIILSSSTNRSGGTRQAF YQARELATRGHHVSLCLPED SQMHGMGYDDLIVTLPADRK RWKASIEALFAADGPTVVHA FHNRAVKFVAWHGMFWRHRG VVCCAHRGVMYRPGNPLPYI SPGMDAFIANSQACARVLRL FTPAGKLFVVYNGVDDARVT PRTPADTAREQVGLPPRGAE AAWSSSSPAPLVFGFVGQNS PVKGADILIDAFARADVGEA RLLMVGVSHDMWRPRCEALG IADRVHLVPHTESVSDMLQL MDAFVLPSRTESLPNTMLEA IRMGLPVIGSAVGGVPELVR GNGLLFPAGDIDALAAALGR MASDHATREAWAAASHAEGE RYTIHARVDALEDIYAQLLR RRGLHTA >_0031.001780_ NP_295125.1 gi|15806419|ref|NP_295125.1| transcriptional repressor, TetR family [Deinococcus radiodurans] MTPTMDSSSLRERQKERRRA RIYNVAIDLFKQGGFQGTTA TDIARASNVSRGTFFNYYPY KEAVLLDYGSEVMERLRDLA EQRLAQGVPAMSVLYEIWDT LADENARERDLFPPLAYEVM NPNPERARTAYQALPLSKVV ELVLRPLHQEGQIRTDLSLQ RISNLIADTYLMVALRWSAY GTERPLKDEMRLALGLLLEG ALRRDGPR >_0030.001769_ DHAF_12NOV03_CONTIG1068_REVISED_GENE2015 dhaf_12nov03_Contig1068_revised_gene2015 MREDPIVATEQTAEQMAEQT PDPSDFAAVFQHYYPMVVQT IEKILQERSAAEDLAQDVFW RLYHAPWQDIGNLKAYLIQS GINAAYNHLRTSKRQHSLWE RLTRQTSANEPSAETQWLRA EEIQRVREVLTELPARDRSL LLLRFAGLSYKEISETISME FASVGKSLVRAKERFRKGYL KRGEY >_0021.000829_ NP_421115.1 gi|16126551|ref|NP_421115.1| regulatory protein, putative [Caulobacter crescentus CB15] MTTCYRQPPASMLSATILDS EAMRRAERLFQIIQILRRSR APVTADAIAAELETSKRSVY RDIAALVGQRAPIRGEAGVG YVLDAGFDMPPLMLTPDEIE AAVLGAQWVAGRGDPVLAKA ARDLISKIAAAVPDRLRPYV LEPAAAAAPAWKPQTDKIDV AQVRAWIHAGRKIRLNYSDE AGAISDRVIWPVTVGYRETI RMIIAWCELRGAFRTFRTDR VVGAEFMDERHGKRPAVLRA EWLRFRDAEIAAWEARERAE C >_0019.001207_ NP_349212.1 gi|15895863|ref|NP_349212.1| Transcriptional regulator (TetR/AcrR family) [Clostridium acetobutylicum] MNPLTNRKLQAQNTKNKIYK ASIELFEKKGYENLKIKDIC KEAGVSIGSFYNHFDSKHAI LIEVHKKADEYFKTEVKDNI ISTNGIDKIIEFFDYYSIYN DFVGLDTLKQLYHSGNYFFS QNGRYLQVLLQKIILEGQDK KEIISTMTTEEICNHLFISA RGVCYDWCINDGSYDLVDFM HKHISLIAESFKIN >_0018.007245_ BFUN_06OCT04_CONTIG482_REVISED_GENE7246 bfun_06oct04_Contig482_revised_gene7246 MEAKPPRRTRERILELSLKL FNEIGEPNVTTTTIAEEMEI SPGNLYYHFRNKDDIINSIF SQFEQEIEKRLRFPDDHRAT IDEMWSYLQYMVDFTWRYRF LYRDLNDLLARNRTLETHFK QIISHKVRFASQFCEQLVAD GEMVATPEELQVIATNVGVI GTYWLSYQFVMNPRKYNEQE AIRAELHQVSVQIVSLMAPY LRGRSRQIFDDLVSGKLPKR EFYDYLPPKEGAAPRNEPKD A >_0018.001305_ BFUN_06OCT04_CONTIG480_REVISED_GENE912 bfun_06oct04_Contig480_revised_gene912 MKNALIQSGPDTTGRRPSRK VLIYGLNYAPELTGIGKYSA EMAESLADVGYEVRVICAPP YYPEWRIGAGHSAWRYRTEQ RAAVRIQRAPVWVPSRPSGL KRLLHLASFAVSSLPTVFAQ LFWRPDIVIAVAPSLMNIPA ALMFGKMARARTWLHIQDYE VDAAFELGMLKGKRLRQFAL GVESWLMRRFDVVSTISARM IEHGRNKGVDSPRLFALPNW VDVNVIFPLDRPSLYRTALN IPDEAIVVLYSGNMGAKQGI EVLAQAAASLAHRSDIHFVL CGDGPYKVNLVEQCGHLANC TFLSLQPFDKLNDLLNVADI HVLPQRADAADLVMPSKLTG MLASGRSIIAMARAGTELFD VVSPRGVTVPPEDVQALVAA IENLADDVDQRTRLGAAARA YAETELSRRAVIERLDGRFQ MLCDRVRGRSVFT >_0016.000961_ YP_438752.1 gi|83716361|ref|YP_438752.1| lipopolysaccharide biosynthesis protein, putative [Burkholderia thailandensis E264] MKHTETSIKSLQIGMHWFPE RAGGLDRMYYSLVGALPGAG VAVRGVVAGSERVAADTGGA IRGFGPATSSLPRRMMAARH ALRDVVRAERPDVVSSHFAL YTFPGLDVTRGIPQVSHFQG PWADESHVEGADSLGQKVKH RLEQAVYARSSRLIVLSRAF GQILTSRYNIDPARVRVVPG CVDTAQFDLPMTPADARRKL QLPQDRPIVLAVRRLVRRMG LEDLIDAVKTVRRRHPDVLL LIAGKGRLEGELQKRIDDAE LGNNVKLLGFVPDHHLAALY RAATLSVVPTVALEGFGLIT VESLASGTPVLVTPVGGLPE AVAGLSEALVLPQIGACAIA DGLTAALSGSLVLPDADACR QYARAHFDNTVIARRVAEVY EEAIRAAD >_0004.000710_ 17739093 gi|17739093|gb|AAL41742.1| ring hydroxylating dioxygenase, alpha-subunit [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:726009-727415, Atu0726 MGSEGELRMSQFVANAEANV AVCAAANDRRCLDEDMLPQP RKWCRAGGLIEDLAMELRDT VLRQLKNRREGFSLEQPFYT DPDYFKLDMETIWYRDWLFV GHDCEVPKSGNYMTVQVGAY SVVIVRGRDGQIRALQNSCR HRGSRVCSAQKGQAARLVCP YHQWTYDLDGKLLFARHMGE EFDKAEFGLKPVACETVAGY VFICLADQPADFAPMRAEVE SYMAPHRIWEAKVAHESTII EKGNWKLVWENNRECYHCAA NHPELCRTYPENPSVTGTDG GASDPEIGGHWARCEAAGLP SRFKIDPKGQFRVARMPLIG EAESYTMSGKRAVRRPLSED VSISHIGALLLFHYPTTWNH FLGDHTISFRVLPLNANETM VTTKWLVHKDAVEGVDYDLE DLTHVWNETNDQDRRIVEEN AFGIRSPAYQPGPYSMEDEG GVMQFVNWYSDFMVDRLSGD KARLSAVA >_0003.004395_ ARTH_26JUL04_CONTIG47_REVISED_GENE4401 arth_26jul04_Contig47_revised_gene4401 MRGTTADDRPRRGVGPVLGT FQDVARIAVLRGGGLGDLLF AFPAVSALKAAYPGSTVTVL GTPLHAALVQGTAGPVDAVR ILPYADGVRPGEEDPAALDS FFADMQREQFDLALQLHGGG RYSNPFLLRLGARHTVGTQT PDAAPLERNLPYAYYQHEPL RALEVAGLAGAPPVELEASL RPLPQFAARAGELLDADLPD ADLPDGVPSRRAGSAGRTTR PVVVIHPGATDPRRRWPAGR FAELAAACVADGCRVLVAGD RSERQLAAEIVDRAGSALVR PIAGEADLGTLAALLARCDV MVANDSGPRHLAQALGTPTV GIYWAGNLINAGPLGRTLHR VHLSWLTHCPECGADVTQVG WTAPRCAHDSSLVAGVRAED VYADVRLLMATSPHLRGRSA SP >_0066.001282_ NP_905845.1 gi|34541366|ref|NP_905845.1| hypothetical protein PG1739 [Porphyromonas gingivalis W83] MEKVHYAAIDVGSNAVRLLI KCVNSEGMEEPLSKVLIMRV PIRLGEDSFTKGYIGEEKTD NMVRLMRAYYEMMQIYRVKD YRACATSAMRDASNAEAVIA QIREKTGIHIDIIDGDEEAR LVSDNHIEQIISDGGNYIYL DVGGGSTELTIPAAEIFLEV ADITGAKTIIAPIVGLADGI IEDLYIRHQSQPS >_0061.004048_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR1629 npun_22dec03_Contig1_revised_geneNpR1629 MLILDKVELISFIDKQMHSQ SKQLNILTPSRELPLYGKRI LVTAPRNYAYRLSEQIIKQG GLPVFMPTIETCYLSNYAKL DAALNHIAEFDWIVFTSRNG ITAFFHRMNDLNIPVSVVEK CQLCALGKDAESLLSFCGKV DLIPTESSPAGIVAELAKIP QIHNKKVLIPAPEVVGLPEP DVVPNLITDLQQLGTEVIRV PTYITQGLNTSIYSIELNLI HQGMIDVIAFSSTAEVESFL TMVNSQSDYEGCIVACFGPY TTANARKLGVNVSIVSRDYS SFEGFAEAIAEFFTLTSNSH >_0061.001192_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF2445 npun_22dec03_Contig1_revised_geneNpF2445 MDTRTLIYLHQYFSIPTVSG GTRSWEFSTRLVQDKWQVCM FCGDSEIHGIPAVNILKLFN TKNVSFKLNVIPLKYSNYMS FSRRIFAFLSFAVRSSLQVL REEKADLTFATSTPLTIAIP ALLRKWLHGTPYIFEVRDLW PEMPIAMKAVRSPLAIFLAR QLELIAYQNASHIVALSPGM KEGIVKQGISPEKVEVIPNA CDNARFNISETIGLEFLRQH PELSGGPLIVYTGTFGHING VNYLAYLASHMRNIIPEAKF LVVGSGVCESEVREASCQLG ILEKNFWMWPPIPKAEMPAL LSACTVATSLFKPIPEMEHN SANKFFDALAAGRPVVINYG GWQKEILEQSGAGISISSND PKVAAIQLAKFLNSSECLQS AQAAARKLADTVFDRDILYK KLANVFQKVLNETHSDI >_0061.000696_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF1359 npun_22dec03_Contig1_revised_geneNpF1359 VIINSFPKIVKDILRGLPKN DYPVLNSRLFFECWLSYAMD NSLTSMRDLFNRLNNTGFPV DISTFSKANLHRSQKPFQEI YQKLNELVQKKVQKKLHDKY AICPIDSTIITLTSKLLWVL GHHQVKLFSSLNLATGSPSD NFINFGHDHDYKFGCKMMSS LPNNAVGVMDRGFAGLKFIQ ELVQENKYFVLRVKNNWKLE FEEQTGLIKVGASNDAQAYR VINFCDLETKTEFRLVTNLP TLGEAAVSDYEIRDIYRLRW GVELLWKFLKMHLKLDKLIT KNVNGITIQIYVTLIAYLIL QILSVPQQWGHTLLDKFRYL QSCMCQKISYVHWFEEMMSC >_0061.000401_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF0762 npun_22dec03_Contig1_revised_geneNpF0762 MKAISNDKPQGVCATDFASV LGEENAVCLWENIEIGQQKR IQQAVTSGKPPTCIVYPRTQ QQLAAVIAKAYSHNWRVLPC GSGSKLSWGGLAKGVDVVVG TERINQLIEHAVGDLTVTVE AGMKFSDLQALLAKSRQFLA LDPTAPQSATIGGIVATGDT GSLRQRYGSVRDQLLGVTFV RADGEIAKAGGRVVKNVAGY DLMKLLTGSYGTLGFISQLT FRVYPLSEASGTVVLTGSAE AVSQAADILRGSALTPVQAD LLSTKLVSSLGLGEGLGLIA RFQSISESVKEQSNRVLEVG QKLGLDGAIFADADEASLWQ RLQERIHSTATESVITCKIG VLPTAAVDILTQVGLGLIHI SSGLGLLQLEDKNQVLKVRE QLRSAGSDGTQANSGFLTIL EAPMAVKEQVDVWGYNGNAL PLMRRIKEQFDSKNILSPGR FVGGI >_0056.000757_ SARO_25NOV03_CONTIG26_REVISED_GENE804 saro_25nov03_Contig26_revised_gene804 MSPDVDIKALRCFLALAQEL SFGRAAERMNLTQPSLSAQI RKLEDQVGHRLFERTTRAVH LSPTGKALIEQARTFVSQAD AFAAHLSAWRERPDRRMVLG APIYTFELPEHGALLSAIAR ELPDMSLRVDNGFANSLVEG LIKGSVDMAMVVAAAVPHDR YLADMAGEGAGELEMPDGLQ RITLSDEQIGLAIPEEHPLA SYDIVPPEALSGSVIAMLAP LHGRSLYRPISVWLSAAGAT GFLPPEAHAFALERYCREYR IPAISIAQFRPRETGNVVYR PAGGLNVRTELAVLRSTRKQ RSSIEERLWDLASSLGR >_0037.000304_ NP_952380.1 gi|39996429|ref|NP_952380.1| homocysteine S-methyltransferase domain protein [Geobacter sulfurreducens PCA] MTDYVSFSAFLAESPVILGE GAVIERLRRAGVDLDPWLVN SALVYAPAGRAALATICREY LDIGARHDLPLLLSTPTWRA SRERIEAAGLAGSDVNGDNF RFLDELRRSYGAYGRKVLIC GLMSCRGDAYRPAEALSEDE AREFHSWQADALAAAGVDFL LAATLPALGEAVGLARAMAA TGMPHVVSFVVRPGGTLLDG TPLREAVAALDAAVSPRPVA YLVNCTHASFFRSALLHEAN SSPLVRQRVVGLLANTAALS PEELDNAAELVEESPGTFGR TVAALHRDLGMKVLGGCCGT DGRHIECLAAELSGSPVR >_0033.000541_ YP_048313.1 gi|50119146|ref|YP_048313.1| hypothetical protein ECA0186 [Erwinia carotovora subsp. atroseptica SCRI1043] MTEPTLLHPSLLPLDGGINF RDLGGNRAADGRLIRHGKLF RSGSLDLLSQADCEHLAGVP ISHVVDYRDVDEIAQKPDIL WTGANYHAYPANPLRHEVTA NLDSLGSDVLAAFDSRAFML ELYRRLPFNNSAYKQLVSLL LRPDEGGLVQHCAVGKDRTG IGSALVMFALGADEQTVMED YLLTDTTLTPFRQQLLAHLS ETLNEKALGQFSYVLSVQEE FIVTALQAIYERHGSIDSWL EVEYGLDNRARNYLQDKYLA >_0017.003546_ NP_811601.1 gi|29348098|ref|NP_811601.1| hypothetical protein BT2689 [Bacteroides thetaiotaomicron VPI-5482] MEHGRDASQKAELRERIIMT ATEAFTLKGIKCITMDDIAA ALGISKRTLYEVFADKESLL KECILQKQAERDKYLQEIYE QSNNVLEVILAVFQKSIEIF HQTNKRFFEDIKKYPKVYAM MKDRSESDSEKTMSFFKSGV EQGIFRADVNFEIVNLLVRE QFDVLLNTDICNEYPFIEVY ESIMFTYIRGISTEKGAKVL EEFISEYRKNRVEQQ >_0009.003259_ NP_244073.1 gi|15615769|ref|NP_244073.1| BH3207~unknown conserved protein [Bacillus halodurans] MGVVSSSSKEKVLEAAITLF QVKGFHGTSVRDIALKANVN VALVSYYFGGKQGLLEQLNV QFLEGYIQAMEQATEERLGQ SSHEKLLAVMEQVLLYEQQA PSLARLVLREMTLDSTLVRE IMSTYMRKEKHLLETILRQG MGNQEFRKQPLDLLLLHIRT MMTMPFLQPHYLRELYQLSV NDPSFIKRYMVHVNDWMAAC LLKKPIASSRLVIHLDS >_0113.003588_ YP_001090155.1 gi|126701258|ref|YP_001090155.1| putative isochorismatase [Clostridium difficile 630] MVLLIVDAQKLITNERLYKF NEFVVNVENLIDTARKNNIE VIYVRHDDGVENELTKGKNG FEIYEKFKPCNGEKIFDKKV NSAFKETGLLEYLTNNGEKD IIIAGLQTDYCIDATIKCGF EHGFNIIVPAYSNTTVDNFF MSAEQSYKYYNEFIWNERYA KCISLEETLRKMK >_0077.000269_ NP_371808.1 gi|15924274|ref|NP_371808.1| competence-damage inducible protein [Staphylococcus aureus subsp. aureus Mu50] MSIAIIAVGSELLLGQIANT NGQFLSKVFNEIGQNVLEHK VIGDNKKRLESSVRHALEKY DTVILTGGLGPTKDDLTKHT VAQIVGKDLVIDEPSLKYIE SYFEEQGQEMTPNNKQQALV IEGSTVLANHHGMAPGMMVN FENKQIILLPGPPKEMQPMV KNELLSHFINHNRIIHSELL RFAGIGESKVETILIDLIDK QTNPTIAPLAGSHEVYIRLT ANADSKEQAQSLIQPVKQEI LDRIGEYYYGSDDTLIEQAV IKKIHEPFVIYDGITNGALY HRLKEVDLNDVLKGMINHNE NFVDINKPIEQQLKDAVQFV NKLFNVSSAIILLEYDGVVH IGYDNNFEFKTEQFKMSKSR NLLKNRSQNYVLIRLLNWLR TTN >_0076.002372_ SAMA_14OCT04_CONTIG86_REVISED_GENE2376 sama_14oct04_Contig86_revised_gene2376 MNKPLDKQLAELDIFSLLLF RAIFETGHANIAARQLDVSA PKVSRALASLRLIFADELFY RRQQGFRPTPLAEQIYPAIR ELTDSINLLGMQLKDHQAFH RQECQVFDLAVSSGILTTLA LEFNRREGLCPSVVLHHWQE NTADRIHAGELDLGLALAPE PHTELSYERLCDAPGLCLVG AANHPIWQYTDICLEHICEY PFLCMNYPGFNDKVDPLELF CHREGLIPPTVLRVTDKEEW FGHLLCQHSLAFASPLEWGL LNALPGVEIKHLPAKEISRL HSGSAVPGLYLIEKPAHHRR YSPDDREKLLHIISLTLGLV TEDHPVNFSI >_0056.001946_ SARO_25NOV03_CONTIG29_REVISED_GENE2078 saro_25nov03_Contig29_revised_gene2078 MTNTRPNFVRNCWYVAGWDY EFTAGKPHPRTFLAEPVVFY RKGDGTLVAMADRCVHRLAP LSLGRLEGDDIRCMYHGMRF TADGKCVEIPGQDMIPSSAC IQTYPVVEKGSWAWIWMGDP HLADAALLPDARGLDDPVWV LKSGQLDYAAPHELINDNLL DLSHLAYVHVASFGATPGWI TQQPRTTQIERGVRVERWVE SAPPLPPLPSLAAYESVDMW ASYEFLIPGVFLMYTSLHPP GTAKGSNHAAPAGDVLFSNF TCQAVTPLTAGSSRYFFSWG PGSQFGGEEIAQQMIDVAMA AFLEDKLIIEAQARIIAMSP GEKIMPNAADRTVTIFQRMM ERMKHPASTSS >_0053.004186_ MMAG_12JAN01_CONTIG3880_REVISED_GENE4208 mmag_12jan01_Contig3880_revised_gene4208 VARHGYSRISAPLAVVAPLR ILFVTASRIGDAVLSTGLLA HLAERYPKARFTVACGQAAA GLFQSAPFVDKVIPMVKRRR AGHWIELWRQTAGIFWHQVI DLRGSALGWLVPTMSRKIIK SSWEPKHRLLHLSALLGLDH PLPPVLWSTPDQEAQAARLM GEGPILALGPGANWTPKQWP AERFAQLAGRLTGDGGILAG AKVALFGAESERPSLHALIE AVPIERRLDLIGRVDLATIH ACLKRASLYVGNDSGLMHIA AASHVPTLGLFGPSSEVFYG PFGPLCAAVRGARSFEDICH APDFNHRRPDCLMLDLDVDK VAEAAAGLMAGGRA >_0052.004968_ NP_105601.1 gi|13474033|ref|NP_105601.1| transcriptional regulator (TetR family) [Mesorhizobium loti] MADTTPDVETDKCDDLLESL RRGRPAAGQDPVKRSQIIDG ARRVFIEKGFEAASMNDITR EAGVSKGTIYVYFANKEELF EALIEEERGTIFKNMYDMLD RADDLRQTLVKFGKVLSMKI TSARVIQAQRTVIGASDRIP DMGARFYERGPKRGHDKVVK FLNAAIERGLLKIDDVDLAA YQFTELCLAGLFRQCIFAYR TKAPSQEEIDHIVRSGVDMF LKNYGTEQLAEEESHQMIAL EAKA >_0052.002434_ NP_107308.1 gi|13475741|ref|NP_107308.1| hypothetical protein [Mesorhizobium loti] MPACSIGWVFRTEISSNIRK ALGKELDMPRIVPVLDLSRL EQGASERRTFLADLRSASRD IGFFYLAGHGISWAEISEVL TASRQFFALPEADKLAIEMV KSSQFRGYTRAGGELTKGRE DWREQLDIGVERQAIAQGPG TPAWTRLQGPNQWPAALPDL KPALLAWQSKVTAVAIRLLK AFAQSLDQPEDAFDPIYSSE PNHRMKIVRYPGRDTTGGDQ GVGAHKDGGFLTLLLQDDNK GLQVDYDGSWVDVDPIPGTL VVNIGELLELASNGYLRATV HRVVTPPAGVERISVPFFFS ARLDATIPLLGLSEELAAQA RGPASDPDNPLFRDVGTNVL KSRLRSHPDVARRHYADLLK GESRVG >_0045.001096_ LBUL_20SEP02_SCAFFOLD58_REVISED_GENE1628 lbul_20sep02_Scaffold58_revised_gene1628 LRIFGKEGCRMEMQRVIELD MPVNFRDLGGYQGLDGRQVK WRKIYRSAALNEMSARDRVK LANLRITVDCDLRSSREQRS YPDLLWPGVRFVNIGLYAEG DRFNQAHPFLRLFHHLPEFD DYLPKIYQQVLLNEHSEEGI KRVFEELLKLPEDQALVYHC AAGKDRTGIISILILMALGV DDKTIAEDYLLTDELYDFSI EKQHPTNEKLSQVIAKMNVT RGEGTAVKGITETIRQGWGS FDKFFTRKLGFKQADLEKFR EMYLEEEEED >_0043.003740_ JANN_22DEC04_CONTIG27_REVISED_GENE3741 jann_22dec04_Contig27_revised_gene3741 MQKYFVSPPKDCRITPVLRY DMRMNVSDEDLALAAAGGDG AAYQLLLTRVYDRLFGLCFR LTGSRTEAEDLTQDICLALP GKLSGYRAQAKVTTWLYRVA VNAATDRRRRKASYTRATQG WGAWEVDRVQTAQEATAGAR WLYRTMALLPTHLHDTLALV LDDVTHAEAAEILGVSEGTI SWRVAEAKKKLREIKEREDA >_0038.001354_ NP_444226.1 gi|16554502|ref|NP_444226.1| Putative amidase [Halobacterium sp. NRC-1] MIAGSAGYLLPASPEQRVYL ADEAWPELGDYFDAESLALV PLGSTEQHGPHLPESTDHRI AEAFARTVADRTGVLCTPPV NVGVSPHHRQFPGTMWVEPP VFRDYVASFTRNLAFHGIDR VVFVNAHGGNVSHLREVGRR LRDEGTLYAVEWMWDESIPE LVDEVFDQNGPHGGPKETAL MQYLDGEHVHDDRLEDARDG GVASVADADTIKHGSRTFYD AIDNTDNGVLGDQTDATAET GERLFEAASDQLVQLCEWLD AQAFADLLPEPHVST >_0031.001170_ NP_294099.1 gi|15807627|ref|NP_294099.1| conserved hypothetical protein [Deinococcus radiodurans] MRVCMNIAGNLYTDAMNQGC RLYQSDMKLYVPGNSSYFYP DVMLVCGGEPHDRYSETSPC LLVEVLSGSTADTDRRHKYA AYTGIASLQTYLIVSQSERH VVEYRRAGNGWEMHEHRDVG EVYVACLGRTLTLDEIYRGM L >_0028.000670_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE0671 ddes_06jun05_Contig143_revised_geneDde0671 MNSGLRHIPEPRCASPADTA PPCCAGPLCYVISSYCCCLF PCFFPKGVYPMTLTRRSAFF HLPVFTAVILCMVMLTGCAE EPQEQGPPPVKPVKVLTVGS KDKGVRRVFPGKVVAGEKVE LGFRVAGQLTRFPVKESQHV EQGQTVAELDKSDFITKVRN IESQLGGARASLNEATLNFK RMETLLGQDTISKADYDKAR ASMDNANAKVLSLTQQLKQA TQDLEYTTLRAPFSGVIAKK YVKNYELVQAQQPVLKLENT DRLDVEVEVPEFVIAQLRHQ DRKNMPAPVARFSAFPGRSF ELSLKEYQTSANPQTQTYTV TLTMQSPDDIRLLPGMTADV EGTLPFGGDIDATSLPVAAV VGGEDDRTYIWVLDKESMTV SRRAVVTGAIHEDKVAITQG LEPGMTVVAAGANYLHEGQK VRILNGKIGGGQ >_0028.000252_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE0253 ddes_06jun05_Contig143_revised_geneDde0253 MHKAGRAKAEKHMRQLELKD SMKIGIKELDDQHQQLTEII NELYYAYMQGNHRAVLCGLI TRLNDYAHEHFALERQYMER FVFEMPDYEQHMQEHREFFT DAIGFLLRYIEEGSEITPEM LDYLQGWWKDHVMVRDKELG RFLRSRGVTA >_0024.003018_ CHUT_08NOV04_CONTIG199_REVISED_GENE539 chut_08nov04_Contig199_revised_gene539 MRILEINTEKGWRGGENQTL LALIGFRKLGHEAELLCFEN SALHVQAAAAGFTCHPLTSS TKSIGFLMEYGKNYSILHTQ TSKQLTYCILTKPFHNSKVI LSRRVDFVPKGFFTLLKYNA CDGIICVSGAIEKILKQSGI KTRTVVISDCVTEKTLNKTR AQELLQKLTISGKNIIGTTA ALVPHKDPVTLVNAVNILRT KRSDFVLLHFGSGPLAATIQ QFITDNNLQDYYKLIGFKER VEDYFSIFNYFVMSSQEEGL GSSVLDAFVYKVPVVSTNAG GLNELVTGRGYVTEKKNAQL LAEALHTAMNSPEQNKKNVA AGYTYAVSNLSVEKIHEEHI DFFKTV >_0018.000681_ BFUN_06OCT04_CONTIG480_REVISED_GENE350 bfun_06oct04_Contig480_revised_gene350 MSTLENVHASPIHFMSKPAE IEHTQRPLNEASHAPGYIYG DPDVLRREKERIFMKDWLCI AHVDELPNPGDFVTHTVIDE PVLIVRNRSGVLQAFYNQCR HRGVEVAEGRGNAKIFKCPY HAWSYDLDGKLIGVPFMKEA QGFDQKNCSLKPVRLDTWGG FIHISFDPDAEPLSSYMAEY EQEFGMLRQGELRLGLRWET DFDCNWKFVYENLMDIYHVG VTHADTIGRYQDQSSYRYLQ LPRGRASIHYRAKTMSETGS SLFGRLPWIDDDSFARIGYL PPNLTMLARCDYVRPVTHWP TAVNRTHSVAYFLFPEDKIA DPQFQEKIQTYVKFVEQVLD EDRGMILSLQRAMNTRGFEP GRMSFMENAIHHALGYHLER LFGPQSA >_0010.000556_ YP_033400.1 gi|49475359|ref|YP_033400.1| hypothetical protein BH05660 [Bartonella henselae str. Houston-1] MINFLPYSLDILRAADRDRY ISVLFAPKKKRRALAALYAF NAEIARIRENVHNPLIGEIR LRWWYDSIAKSEMEKSESNP ILSDLFMAMTLFNLPKTAFL RYCDAQILDLYHNPIATLYD LEFYCGETASIILQLSCQIL DPDMAQNFTDAYQHAGIAQG LSGVLRLLSFMQSRYQYYLP ADMLKALGIKREELESNRIS KEQKCHVIEAMVALSRDHYN RFYEYFNMMPKTLKPAFLPL AIIPASLQKAIQLGAAVFQE NAALPLLHRYWFITKAAISS NLPKIL >_0005.004123_ YP_322847.1 gi|75908551|ref|YP_322847.1| Protein of unknown function DUF820 [Anabaena variabilis ATCC 29413] MTQAIPKLVTFEEFVDRLPE NSGIRYELHNGSIVEMAQPV GDHEEVKGFIAKKVTVEFAR LDLPFIIPNQVIVRPLEKDS GYFPDVLVLNQTNLINEPLW KKTSTISLGASIPLVIEIVS TNWRDDYYLKYADYEEMGIP EYWIVDYAALGGRNFIGSPK QPTISVCNLVDGEYQISKFR DNDVIISQTFPELNLTPNQI FQAGLVDS >_0005.001309_ YP_322596.1 gi|75908300|ref|YP_322596.1| Protein of unknown function DUF820 [Anabaena variabilis ATCC 29413] MGVYHRIRVSRQTTTLPNII LTKKMIAAKDKAPQLTPQEY FIWEEQQLEKHEYINGEVYA MTGGSVNHGRIAIRFTAMFD SHLQNTGCITGNSDIKVNIF GSNNYTYPDASVTCDVRDQT TTQYITYPCLIVEVLSKTTE AYDRGGKFRMYRQNPALIDY LLVSSTSVEIDLYHKNDAGD WLIINYKPGDTIELKSINLN FPIEKVYRGLTLEPENGG >_0004.004077_ 17742778 gi|17742778|gb|AAL45109.1| formyltetrahydrofolate deformylase [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008689|:1442543-1443508, Atu4315 MAGGSFMDDLRRGAGDLSVR ASVREQDLNTYIMKVSCPAR SGIVAAVSGYLARSGCNIND SSQFTDQETARFFMRLSFIS EQGSGREALLDGFGSVAADF DMDYDIHDLSQKKKIVIMVS RFGHCLNDLLYRSRIGALPV EIVAVISNHLDYQKQVVNED IPFHHIRVTPETKPEAEGAI LQVVRDTGAELVVLARYMQV LSDQLCQEMSGRIINIHHSF LPSFKGANPYKQAYERGVRL IGATAHYVTADLDEGPIIEQ DTIRVTHAQSGMDYVSLGRD VESQVLARAIHAHIHQRVFL NGNKTVVFPASPGEHVSERM G >_0112.001592_ YP_909492.1 gi|119025647|ref|YP_909492.1| probable 6-phosphogluconolactonase [Bifidobacterium adolescentis ATCC 15703] MNVTERKTVVYPNPQVLAEA VAARTLLTIIDLLAEPNRKR VDIAVTGGTDGNRVFAAMNA SPLNDAVDWSRVHMWWGDER FVAADDDDRNAKQAREAWYG ALVESGRMPAANIHAMPADT RGAEEIAAASPEQTDAVLAE AAAEYQRELIEQLGENPALD IAMFGMGPDAHFASLFPDHG EAEIDDPHVLVAGVRDSPKP PPLRLTLTVPMIARSKHTWV FTSEERKASAVKAAFAQRNN PHAPSSYADGEELLWLIDEG AASKL >_0111.000209_ YP_210857.1 gi|60680713|ref|YP_210857.1| putative transcriptional regulator [Bacteroides fragilis NCTC 9343] MNTNERPEMTEKQSCHWYLA FTASRAEQRVKQELDQRKVR NYLPLRKITYQWQGRSREAL CPQIARCVLIWTSLSDIRQL SGISGLIIPQNIWDYRVPEW QVESYQLLFSQMDTAVEWIP DCLESATMVRVTGGPLTGLV GELDTSDTGFRIRIRFHSMG CFRVAVPEEWIEKF >_0090.002210_ YP_165425.1 gi|56695078|ref|YP_165425.1| transcriptional regulator, TetR family [Silicibacter pomeroyi DSS-3] MTRPPQKRRLETRARLLAEA ARLVEAQGYAGLRVEDVVEA AGVAKGTLFSHFTDKDGLLA VLIGAEVMRLIDGMETAGPP DDLPQLMHRLAPLLDFIASD RVIFDLLLRYSGSTGAERNE VVADGFYRQIDLWAGWIAGM QAAGTVRGDHPPALLAEGIQ AFLNQIIAIRFCQGGQATGT PAGALYPFLAAWLIT >_0076.003229_ SAMA_14OCT04_CONTIG97_REVISED_GENE3233 sama_14oct04_Contig97_revised_gene3233 MDFVAVAVHTEYRCNQGWET RAECHWFVSTIRAIMTATQS ISLVGGEGDELLMQRYASGD IQAFDTLYRRHKGGLYRYFL RQIGDSQLAEDLYQETWGRV IKAAPAYEISAKFTTWLYRI AHNLLIDHVRAVKPLNEADT LDEEGMALTDSMTPVRSHEH DLKVQALKHCVGLLPQVQKE AFLLSSEMGFTALMISEIAS VSLEATKSRIRYAYQSLKTC VAKRLGEDSDER >_0076.001704_ SAMA_14OCT04_CONTIG64_REVISED_GENE1707 sama_14oct04_Contig64_revised_gene1707 MGVPGMNKEAKVLKEAVFKS FDNTDSLETQLADRIARQLQ DAVDARGKASLVVSGGSTPL KLFKALSNKAIDWNEVFVTL ADERWVDNAHKDSNERLVRE NLLQNRAASAKFRGLKNMFA SAEEGAAMTSEHLANVPRPF DVVVLGMGNDGHTCSWFPCS DDLMRALDSDALCEAVTPRT APHERITLTKKAILNSRQIY LHLVGEQKLSVYRQALESDD IKAMPIRVVLGQHKTPVDVY WSA >_0074.002315_ RRUB_10JAN05_CONTIG98_REVISED_GENE3082 rrub_10jan05_Contig98_revised_gene3082 MEKSTSPAVDGRSASRSGAP WTGAAPWRPLIWLDVYSVGD PLLDDDHRRLMEEINHLGAA LINQIAPVSEALIAPLKRLA HQEAEHNLREEAILAQLGYP GLEDHRAEHRQLESGLGALV ESLIPQGPIEPEILADLLKD WFVRHVLGQDMRYKTYVLEG RERAAPPRPPLAQPKT >_0072.000809_ PROC_21JUN05_CONTIG39_REVISED_GENEPMN12A0811 proc_21jun05_Contig39_revised_genePMN12a0811 MSIDLVVPKEVELPTLLDEL RKLSWASADVLMAYARGGEP PYGFPKSLNVEEGGDGPVSA ADMAVNELLISGLKDNLAFK EWDILSEETSKEKTFQQDNY KKDWCWILDPLDGTKDFLQG SENYAVHIALAYKKKPKIGI VLIPEKNELWFGIVGIGAWF EDRDGSKNHFSFSDRLDISK LILVSSKNHQQSKLNNLLST LCFGETKKIGSVGCKVASIL RGEADVYISLSGKTSPKDWD MAAPHALIEAAGGMFSHADG KNLVYQEKNYSQSGCLIASH GKSHQKICQKAMDFFSLEEP KYFV >_0071.001709_ NP_745886.1 gi|26990461|ref|NP_745886.1| transcriptional regulator, TetR family [Pseudomonas putida KT2440] MPPVNAAMRCANFEERRDRA MALFAEKGFGQVSMRELAAH VGLTAGSLYHHFPSKQDLLY DLIEELYEELQATLDQARRA MARGASALSCLIAAHWQLHA ERPLQFRLAERDLCCLSEAQ QAHLASLRKRYEAGLLRLIA PQAKLPGQALDATAHVLATL LNQLPGLLKAMPEEQGLELM ENLLVGGIERTLR >_0067.000935_ NP_142993.1 gi|14590920|ref|NP_142993.1| aspartokinase [Pyrococcus horikoshii] MNLRGKMIVIKFGGSSLRSE FKSAVSLIKALSEEKDVVVV VSALKGITDLLEKYTDTFDS RYAVEVSKTYLEFGKRMGID TSSLSPYLKQLFNPPDLPPQ ALRDYILSIGELLSAAMIAE KANGAVIFPWDLFVAHGSFG NGFIDIKKSKRNVKLVKEAL ESGKIPIIPGFIANLNGYRV TLGRGGSDYSAVALGVLLNS ELVAIMSDVEGIFTADPKMI PYSLLIPYMSYDEILIASKY GMEAIQWKAAKLAKEYEALI LFGRASDWRMGTVVSNSSSH MPLLSYDQGKLLVMNMDSEI PYEVVEEGEFWRVYRVPKRD SIKIIKELHRKIIYQENAQL LGRVKA >_0066.000552_ NP_906205.1 gi|34541726|ref|NP_906205.1| DNA-binding protein, histone-like family [Porphyromonas gingivalis W83] MIKIKAIERKAGFGKTSKTL WYPAIHLHSDVKFEEFIELV SDETTVSSADIKAVFDRAAK VLIRLLQDGKSIDCGDMGTY RPSITAKAGSGVDSADKVTV ELVDKAKVIYTPRMKVKTAL KGVRMERAERVLDVPYASSI KPNENNSGGSSSSDNNGHAG L >_0061.006474_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR6327 npun_22dec03_Contig1_revised_geneNpR6327 MFQQIAAQITFSDSFPYLVT ALGITSTAGIFGWRWWKQKN TYKSLQSFPSPKRHWLLGNI PQVLAAVKEKKFFQLLFDWS QQLGPMYVYWTGFPVLVLSK PKVIEDTIVNGMRDGSLIRS QRASKAWNDIGGPILLGQNG SEWQYRRKAWNPEFSSSGLS KYVEIINQACEQIIEKIQSV ASPEVQVDPLFVELTMRVIS CLVLGIPVDKNIATNEGQPL DVLKVYEAMSIVGYRFLRVA TGEKIWMKYLPTKNSRDYWA ARRYLEEFITPRVDLALQMR EQNQTDLTQVSPLFQESMLV KIAAKEPKYNRETLVAEVIE LLIAGTDTTAHTLSFAIGEL ALNPRVFHQAQAVVDQVWES QGTINGESLKELNYIRAILK ETLRLYSVASGSTSLEAQRD TVIEGTVIPRGTKIYWSMLA AGRDPEVYSHPDEFLPERWL EKGKENSQLPMIDFGSGSHR CLGEHLSMLEGTMMLALLVY YFDWELVNGRSSLEQLQQNL LIYPPDRMPVRFRLRK >_0055.004296_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA3671 rgel_26jun05_Contig562_revised_geneMpeA3671 MLTLRQALLALFHQFETLFH YLEHLCRHRSAARPGRSSFL HSSVEIQVKETLIRNLWYVA GTSDEFAPGQLVGQTIVGLP LVLWRPEAGGEVVAFDGRCC HKRMPLAAGKLLADGTLECA YHGLCFDSAGKCVKIPSQPG RAIPSRAKLRPYPIVEQDGM VWIWMGDDANPALRPPRIPE LASPEWEAIRVQNAEVPTNY LLLIENLLDISHFYPLHDGN IGDIANSEIPVEFSHEGVQG NRSVTSVRKAESYRHPPYLE DFFGYELVDRHHSHSMISPA AIRVDLRCAPPGRLETEDER GYILTHLITPIDERHHRWRV IAACRAGHRWPADPSVSTVA RVAEKFPGVIAQDLWALTEQ QKMLDLADTHYLGEMHLRAD TGILKAREIVRDMVAAEAQP ASQAVAA >_0051.000680_ NP_247756.1 gi|15668952|ref|NP_247756.1| cobalamin biosynthesis precorrin-2 methyltransferase (cbiL) [Methanococcus jannaschii] MNKLVKKVYGVGVGVGDKKL LTLKALEVLKKVDKIFVPVS KKGKKSIAYEIIKDYVDGKN IEELLFPMIKDKERLKKYWE NALEKVLKEDGEVAIITIGD PTLYSTFSYVWKLLKERGVE VEIVNGISSIFASAAALNIP LVEGDEKLCILPQGKDLEKY IDEFDTIIIMKTKNLNEKLS VIKNRDDYIIGLVKRATFED EKVVIGKLDEINFDEFNDYL SLAIIKRFKR >_0049.002241_ NP_786051.1 gi|28379159|ref|NP_786051.1| N-acetylglucosamine/galactosamine PTS, EIIA [Lactobacillus plantarum WCFS1] MKIIVSGHGNYASGLQSTIH LLAGDIANIEYIDFTDNMDD IQLSERFQRAVIGENNVVFM CDLLGGTPFKEAVKLSQVSE KDIAVTSGCNVGALLEVGFE LTTYSAPAKKLAEKLVKISQ VQTKVFHHQAIQVDDGVEGI >_0006.004465_ NP_890881.1 gi|33603321|ref|NP_890881.1| hypothetical protein BB4347 [Bordetella bronchiseptica RB50] MRCPFCGNADTQVVDSRVSE EGDTIRRRRRCLSCDKRFTT YERVELAMPSVVKRDGSRTE YDAGKVRGSLSLALRKRPVS TDEVDSAVARIEETLLASGM REVPSEQIGELVMGELKRLD KVAYVRYASVYKSFEDIGEF VEAIREMQGPLLPGKKLRKD >_0094.000217_ SSUI_28JUL04_CONTIG158_REVISED_GENE219 ssui_28jul04_Contig158_revised_gene219 MKETQMLKGVLDGCVLQIIS QKEIYGYELVQELRNQGFEN MVGGTVYPLLQKLEKNDLIL SQNKPSPDGPDRKYFYLTDQ GKAYLEDFWSQWTELVQKVQ RLKGE >_0088.001719_ NP_719851.1 gi|24375808|ref|NP_719851.1| transcriptional regulator, TetR family [Shewanella oneidensis MR-1] MNERSFISMSIGNRMDSKRD LILRSAEKIIATEGLHNLSM QKLAVDAGVAAGTIYRYFKD KEDLIIKLRKDVLQQIASKL LENIDEGSFDEKFRRLWFNI VELGREQSHANLSFAQYSHL PGVDAPEHQAFEREIFQPLH QLFEQAKGQGVIQPLNNALL FSIAFEPAVAIAKRLRRGHL QCTEHEIQQACELCLQAISV TL >_0085.001409_ SHEW_20DEC04_CONTIG142_REVISED_GENE1410 shew_20dec04_Contig142_revised_gene1410 MLYVIKRPLAVAFYRLPLHR TLGRGIDLLLITLQHAGRGL PQREITGRFPQFIAIGLMIK RKKLIQCRQRRHLAVTRILS IMCAYFDMEKRLMTQVHDPY RDAPALSKLTLGKSTGYQAQ YDASLLQGVPRQLNRDAIAL GDTLPFHGADIWTGYELSWL NAKGKPMVAIAEFILDFNSD NLIESKSFKLYLNSFNQTRF DSIEQVQQTLAKDLSACAGG EVVVKIIEPKQFAHQRVVEL PGTCIDDLDIEIDSYEFSSD YLIDSTDDKSVVAETLTSNL LKSNCLITSQPDWGSVMIRY QGPKIDREKLLRYLISFRQH NEFHEQCVERIFVDLKRLCH CAKLTVYARYTRRGGLDINP YRSDFENPPENHRLARQ >_0077.001098_ NP_371196.1 gi|15923662|ref|NP_371196.1| LysR family transcriptional regulator [Staphylococcus aureus subsp. aureus Mu50] MIIEHARDMLKRERLFFDKM QAHIGEVNGTISIGCSSLIG QTLLPEVLSLYNAQFPNVEI QVQVGSTEQIKANHRDYHVM ITRGNKVMNLANTHLFNDDH YFIFPKNRRDDVTKLPFIEF QADPIYINQIKQWYNDNLEQ DYHATITVDQVATCKEMLIS GVGVTILPEIMMKNISKEQF EFEKVEIDNEPLIRSTFMSY DPSMLQLPQVDSFVNLMASF VEQPKA >_0060.001516_ NMUL_10JAN05_CONTIG15_REVISED_GENE1517 nmul_10jan05_Contig15_revised_gene1517 MKDSALRVGIYQGPEIPQSF KVYAENVWRHLPKQDIATIP FKDRKDLPKSADVLWDIRSG GGNPPPDFLLEHPLPPLVVT VHGFAPLSLNGWEYFRTLKG LIMSGQYAKHKRERWREVRT AVGSIIAVSAFVKDEAIRFT GVPADRIHVCHHGVDGNAFT PGPDTESEPYFFHVSNDEPR KNVGAIVRAFRQLRRHCRVQ LVLKLPEESARNYEKIEGIR VLSGFLTTEELVHLYRHALA FIFPSLYEGFGLPILEAMAC GCPVITSNVSACREIAGEAA RTINPRNESELLEAMEILYR NPEERRARTAMGLRRALGFS WEESAKCHARVLYMTADRNA PVASPD >_0055.004099_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA3474 rgel_26jun05_Contig562_revised_geneMpeA3474 MSSRRLRIGLLTHSVHPRGG VIHTLELADALHEAGHEVTV MAPALPGQALFRTPRCAVEL VPVAPAPADLASMVASRRDA CIDHLAPRLERGVGWDVLHA QDGIGGNALATLQERGLIDG FVRTVHHLDRFDDARVMAWQ ERAFLRARQVLCVSQTWCDT LRREHGVAAALVHNGVDLQR YGRQAGAADARVRRRFGLRV GAAHDAPVYLAVGGIEERKN TVRVLQAFAALRARQPQAQL VIAGGASLLDHDRYAREFTE ALAASGLRVGPGADVVITGT VADDEMPALFRAADVLVMAS LREGFGLVVLEALACGTPVV VSRQAPFTEYLPADERHGEA CWADPLNPLSIADAMARACE PERAQALARAVPEVCRRYSW TASAARHVALYRAMRALVGH GVPLAAAVPTEPAAMDAAPV VS >_0055.001306_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA0675 rgel_26jun05_Contig562_revised_geneMpeA0675 MTTASTTLRDLVGLGHRPAA LGEAALIMIDCQNTYRQGVM QLTGVEEALVEARKLLEIAR ARRMPIFHIQHDAGAGSPYD VRAEIGAISAEVAPIAGEPV ITKHYPNSFVQTDLDERLKA LGIQQIVLAGFMTHMCINST AHGGFNLGYAPTVVASATAT RALQGPNGKVLTAREVHDGA LASTRDLYAAVVDSVADLGN >_0049.000065_ NP_784849.1 gi|28377957|ref|NP_784849.1| glycosyltransferase [Lactobacillus plantarum WCFS1] MIMRILVVGDFLKTSGMTRY IFNVIGRIHASDFQVDALSI SGKDECKGQVKKMGWGFYCV QAANNNLLRHLVQSFIFFKK HAKDYDVIHFNESALWNFLP IIFAHLFGAKRIILNSHNTY FATEGSKKLYVVLEIMHRIG KWLVAHVVAQNIAVSEEAAR WMFTRKTINRHEYKIIVNGI ELEKFAYNKEQRFKIRKSLK INEDTFLFGNVGIFNARKNQ LRLLEIFNELVKRNLDVKLV LVGDGPIKEAIVAKISELRL ADKVVMTGLINNTNEYYQAM DGVIMPSLNEGLSTVLLEAQ TSGLQFILSEGLPLGNHIDL LVHEVPLSESNRVWADVVEH NMNREKREVFGSIMQKKGFD VELSAKELYQTYLEER >_0048.001278_ YP_013453.1 gi|46907064|ref|YP_013453.1| transcriptional activator, putative [Listeria monocytogenes str. 4b F2365] MVAYGELIREVRLSKGLTQK EVYTGVISKSYAIGFEKGKH DITLVLFEEILERVMLSSDE FFFMNRGYSLAEEDNFWYKF ANAANQKSLADLQELYQEVL QQTGDRANLRKAIVHSRMEI NEQFLLNNRFDVSIVSEKDK AVIQTYLWKVQSWTLEEIRI FANSVDYFEEDVQIYFFQLV LKSLEKYKHYDRGKKVFSTL LSNIIEELITRNQLEYAAQL LEILHELSSTHDCAFYRIMH NYYQGLIWMKNDQVEQGLKE SKSAIRILDVLDYKSLAQLY NTLLQQFLEKENIQIV >_0036.001024_ EXIG_01APR05_CONTIG276_REVISED_GENE1036 exig_01apr05_Contig276_revised_gene1036 MRQAVLVIDFQQDLVEGTAE EAGVHAKENMIQVINQVVQE AENEGHVMIFIRDLDVANGQ GEGFAVHSSIQIPDQAVTFD KAATNSFHGTPLLDYLKQHR VEHIVILGCKTEHCIDTAVR SATVNGFDVTLVADGHTTNG SDVLSAEQMIAHHNQVLHGH YNVEHFAIVRPSTETVFEPI HHQYRK >_0033.001590_ YP_048509.1 gi|50119342|ref|YP_048509.1| hypothetical protein ECA0383 [Erwinia carotovora subsp. atroseptica SCRI1043] MITGNVHHLDLVPYLPAKLR EAIEYVKQNITADTPLGKHD IEGNSVFVLISNDSTDLLEK RRAEYHAKYLDIQIVLSGVE GMTFSNLPAGKPDTDWLADK DIAFLPAGEQEKLFVMQEGD FVVFFPGEVHKPLCAVGEPA HVRKAVVKIDASLVV >_0023.001152_ NP_599656.1 gi|19551654|ref|NP_599656.1| exopolyphosphatase [Corynebacterium glutamicum ATCC 13032] MSNWRTPLRLVELLDDSGAI SEKGINKLTSAVGEAADLAK TLGCAELMPFATSAVRSATN SEAVLDHVEKETGVRLSILS GEDEARQTFLAVRRWYGWSA GRITNLDIGGGSLELSSGTD ESPDLAFSLDLGAGRLTHNW FDTDPPARKKINLLRDYIDA ELAEPARQMRTLGPARLAVG TSKTFRTLARLTGAAPSSAG PHVTRTLTAPGLRQLIAFIS RMTAADRAELEGISSDRSHQ IVAGALVAEAAMRALDIDKV EICPWALREGVILTRIDKGL E >_0022.000937_ NP_939051.1 gi|38233284|ref|NP_939051.1| Putative reducing hydrogenase alpha subunit [Corynebacterium diphtheriae NCTC 13129] MSTTLRLDQFVDPFEAKVVY TDSGGYFDLSGLPRLDPMLV GRNVAEVPDIVKRLCGLCPV AHHLAGVRALDALCGVEVPE SAQLVRLLLHHGSILYASRD MDIKRLGKAVMAAAGSPGHF PDVAIPGGVRALPDPQALAE LRLPENYEESFEPVPYDGFD MMLTSHGTLDPLGDHVTARS GEERITFDLQTWADHVAESR PGDPAPRPLVHGHPYRVGPY AHGEAMVPRSLAEIRRIIAD PRLCEGEFRKESSILSGIGV GAVEGPRGLLVHRYVANEDG VLVDCQILTPTAQNEAWLAS MLEASLQQDSDRGVELSVEN SIRTANPCLPCSSAPEGHMG VVIEKGN >_0020.000861_ CAUR_25MAY01_CONTIG1079_REVISED_GENE1276 caur_25may01_Contig1079_revised_gene1276 MNGSALYNIGFVLEQALGHV THAKNLQANISNDPDVRPHW ALIPFETHGLAARIPLYKSN WTVRAGLRARRSLRRLTRHT TLDALFFHTQVPAVLAADWL QRYPGIVSLDATPLQYDQLG PYYQHDPGPPWLERIKWQLN RRCFQLARHLVTWSQWAKDG LIEGYGVPADKITVIPPGVN VHEWQRPTPRTRHEGPVKIL FVGANLERKGGLLLIEAFRA LRPLGVELHLVTKDQVPDEP GVFVYHGMQPNSAPLKALYH QADIFALPTFGDCLPMVLSE AAAAGLPAVTTRVAAIPEIV RDGETGLLIPAGDLNALTEA LRRLITRPDERLRFGERATI HVSRMYDARHNAGRLLDLIK GEVDLARMQERIPA >_0019.000691_ NP_348086.1 gi|15894737|ref|NP_348086.1| PTS system, fructose(mannose)-specific IIB [Clostridium acetobutylicum] MSVSFLRIDDRMIHGQTCTR WALEYPCDGIIAVNDAAANN PVLKAAYKSASGKKTFVWTY EHWKLKCDTVLKSSTRYFVI TKEPIIMSKILVDDKFNPGI KEVIVGPCNDRPGTVKLGNN QSINQKEAEAFERIMQAGYN VEFALLKEEAIGNWKKFRGQ FGFK >_0018.000390_ BFUN_06OCT04_CONTIG480_REVISED_GENE135 bfun_06oct04_Contig480_revised_gene135 VPESIVTPPMRASAPHADGA RDASPFVPVRSPATFAQSPV SLAFGQPLLLGLFLPIQAGG WSASTLPRTTDWTFDYNLEL VQKAEAFGFDLVFALSQWLP NGGYGGVFNGQALDSFMSLA AMTARTERIILVATSHVLYG PWHPLHFAKFTATLDHISKG RWGINVVTGHRAIEHEMFGW HRIDHDRRYQLAAEFLDAVQ QLWAQPDNFSFAPQLSSWKL DKAFVTPKPRFGRPLLVNAT GSDAGIEFAARYSDVVFITS PAGSEIEAALAALPAHTARV KAAAAAHGRPIRTLINPMVI CRETAAEARAYHDAIVAHGD EGSFHRFDSDAHAWRGGAGQ RNSAAARAVGGNIAIVGSPQ QIADYIVRLHRAGIDGVQLS FFDFKPDLEFFGDRVLPLLR DAGLRH >_0017.002650_ NP_810164.1 gi|29346661|ref|NP_810164.1| transcriptional regulator, MarR family [Bacteroides thetaiotaomicron VPI-5482] MIREGAEFRELMLQVFRTRM AFRRSMQRTLRKNNAGITFE MLQVLSCLWHEQGISQQILA ERIAKDKACLTNLMNNLEKK GYVHRKEDPADRRNKQVYLT PEGEEFKEQIRPILDQVYVY AEQVIGIESIELMLSELKGV YDVLENV >_0016.000649_ YP_440318.1 gi|83717228|ref|YP_440318.1| transcriptional regulator, LysR family [Burkholderia thailandensis E264] MMRFRAAMPRVSLSLEEVTV PDALARLRNGALDIAAIHHI PALERDFAQSPLCSTPFVVA MREGHPLAGARRLAELLDAE WIVTVGADQFPHSVMMSMFN AHGLPLPQRLLRAPSSFAVT LGLVARSDVIGCFTKPLAAM VAPLGIRAAHLDDMLPSYDL SILSRRDLLPTPAVAQFVAC LRQAADETLT >_0004.004577_ 17743329 gi|17743329|gb|AAL45609.1| glycosyltransferase [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008689|:c1995280-1994105, Atu4815 MKIIHVIASIDPKHGGLQAV AMRLAAAQAGLGLDVNIVSY GDAAIEAQVKEIGRSIPHFE NIHWHLLPPAGRLETLFCFR GRRMLKALLKDASFMHIHGV WEPFLLYASKLARAAGVPYC ICPAGMLDHWSLGQRSWKKK IALMLCYRRMLNEAAFLHLL NVDEMAAVETLNFQARNLII PNGVFAEEFDPLPARGHFRQ KIALAHGRRYILFLSRLHIK KGIDILASAFAAICETYVDV DLVVAGPPGGAEGHFMHLVK KLNIRHRVFMVGAIYGKAKL EAMVDADCFCLPSRQEGFSM AITEALACGTPVVITDQCHF PEVGSADAGLIVSVDAAEVA KALASMLGNPARARTMGENG RRLVLEKFTWPAIAHATLEG YRLSALEAAAS >_0003.003178_ ARTH_26JUL04_CONTIG45_REVISED_GENE3183 arth_26jul04_Contig45_revised_gene3183 MNPSSRHPQHSDTAASGASE GVPRRYASGRQYELRRGDAL AVVTELAAGLRLYSRGGVQL TETYGDGDIPPGATGITLAP WANRVEDGVWYLNGKKQQLD ITEVSRNNASHGLLRNAAYE LVDEAEFSVTLEAPVFPQHG YPFLVRHRVQYLLAEDLGLV VRQTLLNDSQAPAPFVLGAH PYLRLGDVPASDLVLTVGTG SRLVADERLIPRSSEQVDGA TDLRGGRTVSELDIDVAYTD LTFDGGVARQTLKASDGRSV SLWQDENCSYVHVFVTDQYP GRAKAVAIEPMTGPANAFNS GDGLRWLPAGESFTMTWGIS ASL >_0114.002261_ YP_877628.1 gi|118444554|ref|YP_877628.1| Transcriptional regulator, AcrR family, putative [Clostridium novyi NT] MPKQTFLNLSSERQKEIINI SLKEFSSHNFETASMNKIIT ELGIAKGSFYRYFKNKNDLY LFLLDYAVDKKIDYLERHID TESNNFFEIYRSVVFNYMKF DLTFPIMSKFLRSAVENRDI EQTKILNSLNGKTFIENVVI HGQEQGQIKKTLSVDFIILC IINLSESIINYIKLKLGIDY KDLLNMLDGKAIHYKNQLED IFNQLMSVLETGLKPVEIH >_0078.003009_ NP_827710.1 gi|29833076|ref|NP_827710.1| putative transferase [Streptomyces avermitilis MA-4680] MHISFLIHNAYGIGGTIRTT YNLANTLAEQHDVEIVSVFR HREQPVFAPDPRVRVRHLVD LREGAPDAGHPDLGRPARVF PRAEGRYQQYSALSDRRIAG HLASVEADVVVGTRPGLNVH LARETRRGPLRVGQEHLTLD SHSRALRATLRGAYPRLDAL TTTTEADAHAYRGTMRLPGV QVHAVPNPVPAPGIEPADGT GKWVVAAGRLAPVKRYDLLI KAFARVRAQRPDWRLRIYGG GKQKDTLRALIDELGLYNHV YLMGPANPIEPEWAKGSIAA VTSSLESFGMTIVEAMRCGL PVVSTDCPHGPGEIIDNGVD GRLVEVGNVEAIAEGLLELI NDDALRQRMSVAALKDSERF DPSRIAERYESLFTGLAPRG GATLGGKVRGSLHRTRGALL GGAYAVRDVGRTALKKGRAA >_0078.001157_ NP_824358.1 gi|29829724|ref|NP_824358.1| putative succinate dehydrogenase iron-sulfur protein [Streptomyces avermitilis MA-4680] MTSYEARFKVWRGDVKGGGL EDFEVEVNDGEVVLDIIHRL QATQAPDLAVRWNCKAGKCG SCSAEVNGRPRLMCMTRMSV FTREETITVTPLRAFPVVRD LVTDVGFNYAKAREVPAFVP PAGLGPGEYRMQQEDVERPQ EFRKCIECFLCQDTCHVVRD HEENKPAFAGPRFLMRVAEL DMHPLDAAADTGLDRKRTAQ DDHGLGYCNITKCCTEVCPE GIKITDNALIPLKERAVDRK YDPLVWLGSKIGRRPRPS >_0077.002569_ NP_370995.1 gi|15923461|ref|NP_370995.1| transcription activator of glutamate synthase operon [Staphylococcus aureus subsp. aureus Mu50] MEIKQLRYFIEVAKREHISE TALELNIAQSAISRQITLLE QELNVSLFKKQGRNITLTSE GKLLFNEALRIIEHLDSTIE QFQSHGLTKNKSIYIGYDES DVSHMLLPLIQTFHLQNDTH VIPSLLDHDTIINSVLNGNI DIGFTELTPEIRKHKQLHML PLFEEHYHLYAPSDDPITMA THPPLIQFEHSHIYCLAPFA ETVKKQLRKITKSDVYTISS QPLAQYLLRQKEGYIISSQN IHLPESKSWIDIKLDHTELK RTICAITKEPYTKSDIGILL TLIQQLMTKTSTFH >_0069.001715_ NP_895767.1 gi|33864207|ref|NP_895767.1| hypothetical protein PMT1942 [Prochlorococcus marinus str. MIT 9313] MIDGDSSPLLRKQSPYPHWQ YLHPESGDRLRIVPERGGLV TDWCCNGHELLYLDQERFAD PEKSIRGGIPVLFPICGNLP GDLLPLPSGEFILKQHGFAR DQPWELQLLEDHSGVKLSFV DSEETRAAYPFSFLLEMVVR PIRNALEIDVNVHNRSQSAM PFSFGLHPYFKVTDLNKVRL EGLPNSCLNHLQMAEAETAI QLAALTEGVDFLTRPAGPVT LVDEASGRSLQLQHENPMDL TVVWTDPPRQMLCLEPWTGP REALISGDRKLEIEAGGQQR LRCSFSINLEKTVRAASC >_0056.003335_ SARO_25NOV03_CONTIG30_REVISED_GENE3543 saro_25nov03_Contig30_revised_gene3543 MTGTTATRERRGKPSHRNDL RSELLAAAYGFVVREGHEGL SIRKLAEEIGVSPGAPYHHF PDRRSLLIAVALEGYHQLFA ETGKAVAASPEGLLFANLLH FIRFAAANPNMFTLMYESEL VRPQVAPELAEAQDIGFQML RREVTRATQHLSEHERSLRI ATIWSAIFGFALQTNRAMLR AHPLEPMPDELAPEIVRQAL RLMA >_0055.003796_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA3171 rgel_26jun05_Contig562_revised_geneMpeA3171 MSFDAMNTTLSLNTSAGSQA LTVRDPWALVPSLGDLNAYI AAVNRLPMLTLEEEQSLGQR LRDEHDLEAAGRLVLSHLRL VVSVSRQYLGYGLPHGDLIQ EGNVGLMKAVKRFDPSQGVR LVSYALHWIKAEIHEYVLRN WRMVKLATTKAQRKLFFNLR SMKQGFKGDATDSDLHRSTL TDAEIDIVASELKVKREEVI EMETRLSGGDVALDPQTDDG DESYAPIAYLADDRHEPTRV LDAQRRDALAGDGIGEALDV LDARSRRIVEERWLKVNDNG SGGMTLHELAAEYGVSAERI RQIEVAAMKKMRKALAAYA >_0054.001598_ NP_987907.1 gi|45358350|ref|NP_987907.1| Transcriptional regulator, MarR Family Member [Methanococcus maripaludis S2] MEKIKNTCKIWEYIYTISRD RLYQKIPKEEFLRLSTSDKE YLEAIKNLNNPKISELARDM GYTKTAVTTMVQKLEKNGYL KKIKSKEDKREINVELTDKG RYIFELKEEVYEYTIKRIKE VLTPQELGAFKAILEKIAKE FDQELGKIYEKNENIDKSDG PPII >_0053.003569_ MMAG_12JAN01_CONTIG3873_REVISED_GENE3588 mmag_12jan01_Contig3873_revised_gene3588 MSVKPPRKFFEPLAIGAPAP YRELPVRLERMIHFFPPHNE KMRAKAAEMGRNVDVLLGNL EDAIPADAKEAARAGFVEVA KAWDNPNTGLWTRVNCLNSP WFLDDMNAIVGEAGNKVDVV MLPKVEGPWDIHYLDQLLAQ LEAKHGVKRPIMIHAILETA LGVENVAAICQASPRMHGIS LGPADLAASRGMKTTRVGGG HPFYGVLEDSTEGKPGRTLF QQDLWHYTVAKMVDACQSAG IKAFYGPFGDFSDDAACEAQ FRNAFLLGCAGGWSLHPKQI DIAKKVFSPDVAEVLFAKKI LEAMPDGTGAVMIDGKMQDD ATWKQAKVIVDLAKAVAAKD PAMAQAYGL >_0053.001155_ MMAG_12JAN01_CONTIG3773_REVISED_GENE1166 mmag_12jan01_Contig3773_revised_gene1166 MIILAIETGMRRGEQLSMRW ADIDLDRRVVHLSMTKNGSP RDVPLSTLAVETLRGLYAAS DRHPEVVFNVSPNAVRLAWN RLRIRAGIPDINFHDLRHEG VSRLFEKGFDMMEVATISGH KTLSMLRRYTHLRAEDLAQR LG >_0046.002367_ NP_471832.1 gi|16801564|ref|NP_471832.1| hypothetical protein lin2502 [Listeria innocua Clip11262] MMKHLTLWHTNDVHSHLEHW PRIFNFLKEKRTAADKEYKA ALFFDIGDFLDRVHPLTEGT NGLANTDLLNQLPYDAVTFG NNEGTTLAHEDLDNLYEHAA FPVVCCNFYADKECTEQPDW VKSIVYKEIEQVKIAIIGAT APFREYYEEMGWGVEEPISA IKKQIAGLDADTAVVILLSH LGLPTDERIALELPEIDIIL GGHTHHLLENGKIEGNALLA AAGRWGEHVGKVTIELDENN QIISKKAVTFATEKLPIPPN ETAEIQAFFDKGREELSEKV VAIPGKLAHNWFDDSEIAHI LNEAVCEWTGSETFVMNAGI FMTDFEAGIVTDFDIHQMLP HPLNAIALTMSGEELEILID GIYRKKAELQDIPLRGFGFR GEYFGTVLMDRASFDSENQV ALFDNKPIDKTREYRIATHD TFVFAPFFPIVKQIKRKEVY TPELLRDILKWKLKKMYGQE EDT >_0045.001334_ LBUL_20SEP02_SCAFFOLD7_REVISED_GENE1843 lbul_20sep02_Scaffold7_revised_gene1843 LKAIKYGVTREHIREFKVAL TDGKVYKFGSKSVKSSSGYS LNDLIIGSEGTLGVITEAVL RLYPRPKHALNAIIPFPTLD DAIESVPAILASGVEYMGRK VLNLWEKYYDATFPIKEGDG FIILGIDSFTASDAEAQLAQ ALEAVKPFHALKETVLNAES DEAKTIWDAREKLLLAIQKS TPKMDEVDVSVPINKIPLVL HRIEELEKEEDMRIPNFGHA GDGNLHIYLCSDDMTDEEFA KKGDNVISELYKTAKSVDGN MSGEHGVGYARQNYYEDFYG KDYTDLLRKVKGLFDPKGIL NPDKIFPLD >_0045.000043_ LBUL_20SEP02_SCAFFOLD107_REVISED_GENE200 lbul_20sep02_Scaffold107_revised_gene200 MVGIVLASHGGFADGIAESA QMLFGPQDNFAHVILKPSEG PEDIKGKMNEAIASFADQEE VLFLVDLWGGTPFNQANGLL DGHDKWAIVSGLNLPMVVEA LTQRMINDKATAQDIATAII KPAKDGIKTKPESLMPAEEK AAAPAAAAGAPKEAIPEGTV LGDGHIKIVGARIDSRLLHG QVVTGWIPSLHPDRVIVVSD KIAKDDLRKSMIREAAPAGT VAHTVPLEKMKEIYEDPRFG TTHAFLLFENPEDALQTIKN GVDIKTLNVGSMSYSKGKVN ANNVLSMDQTDVDTFRELEK MGIKFDVRKVPSDNPENMDS ILKKAQSLLDEQSSSEKRS >_0043.002439_ JANN_22DEC04_CONTIG26_REVISED_GENE2440 jann_22dec04_Contig26_revised_gene2440 MAGEIPDLIAEARTGTGKGA ARQARREGNVPGIVYGGGID PLAINIPFNVLLKRLKDGRF LSTLFNMKVEGQDDVRVICR NVQRDVVKDLPTHVDFMRLK RTSKIALFIPVEVIGEEECP GLKKGGVLTMVRPEVELRVT AGDIPEQITIDLTGLEIGDT VTISSVELPAGAKATIDRDF VIANLTAPSGLVSAESEEDE DAPAADEVPATEVSEE >_0043.000310_ JANN_22DEC04_CONTIG16_REVISED_GENE311 jann_22dec04_Contig16_revised_gene311 MLELLTLTRVTKSESTWIMK KQKETSARASYHHGDLRAQL IEATRHLVEEKGPDHLSVSE ACRRAGVSTAAPYKHFRDKD DLLRAVAFEGMERQHAQMLE AIEPLPERDLARFSALGRVY IGFAQTEPAVFRLMFGLSED HADDEALVAKGRSTFGIVQT EAAAFRGSDTVEDIDLRRAF QLWSFVHGLAFLLIDGKLQQ MDLPLDLEDMLDDIGRKVML EE >_0029.001283_ DGEO_15APR04_CONTIG109_REVISED_GENE1288 dgeo_15apr04_Contig109_revised_gene1288 MSSLFQRLLNPRPSAIGVEI GTSTIKVVALRPGTPPVLQH AVMVPTPIGSMRDGLVIEPQ AVANELKNLLAEHRITARHA VTAVPNQSAVTRNIMVPRME RKDLQEAIKWEAERYLPYPI DEVNLDFDLLDDPATIPEDG QMEAVIAAAPSEAVARQVEV LRLAGLEPIIVDLKSFAALR ALRGNLLGEHLNKTTLAGLN YTEAGEVALVLEIGASSSVI SLVRGDRILMARNIAIAADD FTTALQKAFDLDFSAAEEVK LGYATAITPTEDEEALLDFD RAREQYSPARVFEVIRPVLG DLITEIRRSLEFYRVQSGDV VIDRTFIAGGGAKLRGLANA ISDALGFRVEVGSPWLTVQT EQANADTGYLQANAPEFTVP LGLALRGVQGHG >_0018.003838_ BFUN_06OCT04_CONTIG481_REVISED_GENE3839 bfun_06oct04_Contig481_revised_gene3839 MSYWIQISNNVRYSNTFEPV CQPGATREERSTTDPGGVGT VPAIHGASNLNIIPIMRARR ASITWPSEGLTRVPYALFQD EGVYADEQDAIFRGPNWSYL CLEAEVPNQGDFRSTFVGDA PVVVTRDTDGEIYAFENRCA HRGAMVCLEDQGNARDFSCV YHAWTYSLQGDLVGVAFKDG IDGKGGMKPDFCTGDHGLRK LRVATLHGLVFGSFSDDVPP LDEYLGEEIVERIARVLENR KPVVLGRFTQMLPNNWKLYF ENVKDSYHASILHLFFTTFQ LNRLSQRGGIIVDPSGGHHV SYSAVDHAAEAAAQRKATSD YADQKIRSESEHRLEDTSVL AGVDEFGDGVTLQILSVFPG FVLQQIQNAIAVRQILPRGT QQTELNWTYLGFEDDTPELR EMRLRQSNLVGPAGYVSMED GCVGGFVQRGIEGAGDGRSV IEMGGDSAESSASRVTEASI RGFWKAYRNAMGY >_0017.004540_ NP_813500.1 gi|29349997|ref|NP_813500.1| putative 50S ribosomal protein L25 [Bacteroides thetaiotaomicron VPI-5482] MKSIEVKGTARTIAERSSEQ ARALKEIRKNGGVPCVLYGA GEVVHFTVTNEGLRNLVYTP HIYVVDLDIDGKKVNAILKD IQFHPVKDNILHVDFYQIDE AKPIVMEVPVQLEGLAEGVK AGGKLALQMRKIKVKALYNV IPEKLTVNVSHLGLGKTVKV GELSFEGLELISAKEAVVCA VKLTRAARGAAAAAGK >_0016.000560_ YP_438228.1 gi|83717311|ref|YP_438228.1| cytochrome P450-related protein [Burkholderia thailandensis E264] MNGLPPSLPRVDSTESLFAE PLAFLAQARSRHGDVFVMRE HGPIFSRASDCSGVIAVFGE HRLRQILTDIDNFALPMSAA AKMALPKNLVNLNRGLHSMR EPEHGRHKRLLTGTINRELF DAHRFEIRAALNRFCEMLKV DRRISVVSRMRELTVEMASH IFLGAQCQEDDELAFLLSAY FTLRREASSLNARDPLLYRD ELIGVGQQLDRTLRERIRRY RKRPVDARAGLLQRLATAGP PGSPALSEDEIVGHANVMFV SSTEPVAMSLTWLLLVLSQL PDLRRALRAEIADRASMPAS TNGASWLENVVNETLRLLTP NALMVRATTRAVSLQGVALP ARCEIVVCPFLAHREAKPFP DPHAFSPSRWETARPSPYEY FPFGAGGHFCAGRNLALSLI REVLSTLLSRFDFVLDGEQS IDWRIHIMLMPKGDPALIAH PVDERGDTPSPKWRGPITDL FHFAPGLS >_0013.002652_ NP_878940.1 gi|33591296|ref|NP_878940.1| putative thiolase [Bordetella pertussis] MKAMAMTPKPTAAIVGMGDA YASMQDRKDPIQLAVAATHK ALADAGITKDQVDAVFTGRS PWADKRSQWSNIFCSHMHMP VTITSEITMHGAGLNATVAM AAQMIAAGRAQYVLCLQSDA TELFVDAVAMGAEADADPQF EIPYGPTIPALYAQAACRYF HEFGITEEDLADVAIANQSW AMHHPHAAKARFGSIDRAKV LSSPYVATPLRRWMCSTWGG GTGGALVVTSVDNARTAQDP VYVMGYGSATTHEYLGDRMN MRRCRYPSLGAFPNLTHTAT AEAARQAYESSGLTPADIDM AQISVNFAHMGPLIMEDLGF AAKGRGMDLYRAGRTGVDGD LPIDTNGGWLSFGQPGISCN MDSYAEAVRQLRGNALGRAP ARRPRTVLVQGSGGMLAAGS VTILASEL >_0011.003497_ YP_106618.1 gi|53715871|ref|YP_106618.1| sigma-70 factor, putative [Burkholderia mallei ATCC 23344] MQHARAAHDDPAYLAQLRHD LLRFARLQLRDADAAEDAVQ EALAAAWAQAERFDGQSSHK TWVFGILRNKLIDTIRARRR TINASALDAELDGEALLERE LFADNGHWAPHAKPRPWPKP ETILQQRQFWILFETCLEHL PEQIGRVFMMREFLDFAIDD ICTELELKANHCSVLLYRAR TRLRTCLTEKGLTTEDATGE M >_0005.003671_ YP_322020.1 gi|75907724|ref|YP_322020.1| Protein of unknown function DUF820 [Anabaena variabilis ATCC 29413] MQNTETTLQLRLWTVEEYHR MAEAGIFGADERVELLEGKI IWMIAKGTAHRSAVTRTDRL LQNSLKDLALICVQDPVKLN DRSEPEPDIAVVKIDPLDYA DHHPTPSEVYLIIEVADSSL KLDCVTKSQAYSQAGITDYW VLDVINRQLHVFRQPTPQGY ESKVVLAEDATIVPLEFPDL QIAILDMLPPVKKS >_0005.001618_ YP_323119.1 gi|75908823|ref|YP_323119.1| Protein of unknown function DUF820 [Anabaena variabilis ATCC 29413] MLQDDKEILNSPSTPLTVMS QTKDGVRWTIQDVAALPDNE WIRYEIIDGELFVTRSPHHK HQHVIGCIFSVLNSWSIDSG LGEPSIMPGLIFSDSDNVAP DVVWVSYERLAQIQDEAGHF RGAPELVVEVLSLGKANEDR DRLAKLKLYSVQGVQEYWIV DRIAQRIEVYRRDYAQLKLA TTLLVDDVITSPLLPNFTCE VARLFVSRS >_0004.003959_ 17742650 gi|17742650|gb|AAL44991.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008689|:1320836-1321711, Atu4197 MVLMPSPETIFIESGPLSAR FRSDWGGRMTHLVHAEMGDI LAPTQADVFEPYNWPRAGAY PLVPYHNRVYGASFVESGKT HRLLPHPALTPDAMHGPAHR RPWHVASAAEDRVTLQLNYE ADDEWPFSFVAEQHFRLEPD RLIVELSIINSSDASAPVSL GWHPYLAVPLSSHAETDARL EYPLDALNVPTGNEAHPRSS RSIPAATGYTLHFRYWSTAT VELDKGTLLLEADPIFEYLA VHRMEKYLCLEPVSMAAGAL CLPETERESRGLRTILPEGR LSGRISLSIRG >_0000.000457_ ADEH_23AUG04_CONTIG49_REVISED_GENE459 adeh_23aug04_Contig49_revised_gene459 VRIVLEIWRQRDPRADGRFV RYEVTDVSEHMSFLEMLDVL NQSLVRRGEMPVAFDSDCRE GICGTCGFLVNGEAHGPLRN ATICQTHMRHFHDGDVLRLE PWRAAAFPVVRDLVVDRGAL DRIVAAGGYVSVRTGGAPDA NALPVPKTDADRAMEAAACI GCGACVAACPNASAALFTGA KITHLSLLPQGQPERPLRVR SMAEQARAEGFGHCTSIGEC AAVCPAGIQLEVIARMNRDF LRAAFSRAAGEPLTVLPVTS PMRRYESQPPGARETAEPAE EPAKP >_0120.003204_ YP_001304154.1 gi|150009411|ref|YP_001304154.1| glycosyltransferase family 9 [Parabacteroides distasonis ATCC 8503] MANILVIRLSAIGDVAMTVP VIYSAAKANPTDSFTVLTQA FLIPVFINRPKNVNVIGINT KSTEKTLAGLLRFASALVKY DFDMVLDLHDVLRTMIIRTL FRLNGRRVFVVDKARKDRAR LTDAKHKRFKQLRPVIERYA DVFRNAGLHYTESFTSLYET STPDLSALEPLAGTKTGKWI GIAPFAKHRGKIYPTGEMEL VMAELSDREDFTIFLFGGRG YEEALLDQWAFEYPRVKSVA GKYSLDQELALISRLDLLIC MDSANMHFASLVGTRVLSVW GATHPYAGFYGYHQDPEDCI QLDLPCRPCSVFGQKPCLRK DWACMRQLNPDLIINKVLTS LNEMVSDEGGH >_0118.000120_ NP_267250.1 gi|15673076|ref|NP_267250.1| nucleotidase [Lactococcus lactis subsp. lactis Il1403] MIMKLSILHINNTHLTLDKD FSLINKIKEENKKRVIETLV LGDGNSIFNNIEHKNDEQDL LNTVPFDATTLGSHQFDEGS KGIVENLNKINFPILAANVN FEDDKLLNPLVKEGKFIPYL LKNTTSGKKIGIFGLTTMDI VYRSHPSAETIFFNPFDKAD FIVKILKNLGVDVIILLSQL GEEMNQMLAQQFLEIDIIID KPVGTSGKKVEKCGNTLIIQ SETDNYSLIESEINIDDEGN LLFIQENNYV >_0112.000734_ YP_909511.1 gi|119025666|ref|YP_909511.1| possible exopolyphosphatase-like protein [Bifidobacterium adolescentis ATCC 15703] MKSVTVAGIDCGTNSIRLKV SRVSEDGVEDIVPRILRVIR LGQDVDKTHRFADEALARAY EAAREFAGVLAEHPVDGIRF VATSATRDAENREEFEDNIE KILGVRPEVIPGTEEADLSF LGATSIVHREVEAPYLVVDL GGGSTELVLGGDGVTHPSTQ VQAAFSMNIGSVRMTERHLK NDPPTEGQIAEAVADIDAHI DEAFKTVPAGKTHTIIGVSG TVTTMTALAMGLTEYDHTAV DGARCTLEDAYAVDNRFLHM PREERLTYKTIHPGRVDVVG GGALVWNRVLAKVSEAAYED HGQRIDSFMASEHGLLDGIV LDYGKRLLAAR >_0091.003696_ SPUT_CN32_28JUL04_CONTIG70_REVISED_GENE3700 sput_cn32_28jul04_Contig70_revised_gene3700 MDEATANAKATEQAVRAAQA RVVKATESLKYTVVSAPFSG IVTERLVQLGESISPGQPLL SGFSQHQMRAITHVPQRYIN QLKDAPQFRVHLSDGREFTS TDLTIFSFADPVSHSYQVRI NLPKDEANLQPGVWAKATFK NGEREKIQLPISALLTMNEL SSVYLKQGNEFVLTQVRVTE PVDGEVEVLAGLRSGDKVAV DAYQVLLNKSSRPE >_0086.000462_ ROSE_TM1040_30MAR04_CONTIG46_REVISED_GENE463 rose_tm1040_30mar04_Contig46_revised_gene463 MPGQNRATSRPRMLPARTHI NRGQNNIGRVDVSEPASISA GIADRYATAIFEIAAESNAL DNLETSINDLAASLADSEDL RTLITSPLVSREEQAAAISA VADKMGLVDVLRNSLALMAA KRRLFVVPALIDALRARIAE ARGEVTAEVVSAKALTKTQS EKLAKTLAERVGKKVTINAT VDASIIGGLVVKVGSKMIDS SIRSKLNSLQNAMKEVG >_0084.002493_ SFRI_16AUG04_CONTIG85_REVISED_GENE2496 sfri_16aug04_Contig85_revised_gene2496 MDHRLKRHVDSMSSKSIVTT DKKQAILHSALQLFVNKGFN ATSTASIAKAAGVATGTLFH HFPTKKDIMNQLFLSIKQEF ATNMVSSTDFSGDIERDANT LWQKAIDWAIAQPLKQLFFL QYSMSADIDADVRKQAMNSI LGFVVELITQGKQQQIIADI PNALLLENCHGQYLAAIRFF TDNPHLGDDEIHRKASFRLF WQAMKA >_0073.005860_ YP_299649.1 gi|73539282|ref|YP_299649.1| regulatory protein, LysR:LysR, substrate-binding [Ralstonia eutropha JMP134] MDARLKQAVAVGRFGSFSKA AEAVNVTQSAVTKSVAELER RLGFPIFIRTSRGVVPTSEG RAFLERASRLIADTDELLSG ALRTDPFSGSLRIGIFPTTF EWMLAKPLEVLVQRHPRLLL DITSGTKERGVQLLDQGDVD VAIGLESAFGDKARFKLEYI ATISASIFVRRGHPLLELEA PVASDIARYDMVLPGQLWDL SLYPILSQIYGQAEAVRFHR IENFSLQCKVIENTDAMGFV DQAITRTDYFQSRFAVLQGL ETATTAGIVCAVREQWTPKA GVGALVDILQRVHAEGMLNS DIQNRLFGHECAALAAEAVP HRG >_0066.001799_ NP_904416.1 gi|34539937|ref|NP_904416.1| efflux transporter, RND family, MFP subunit [Porphyromonas gingivalis W83] MQVEEVAPASFNYVIKTSGQ ILSAQGDEATIAATANGIVS FAGSAFTEGASVRAGECIVR ISAKNLPDGDPVTKAKIAYE SAQKEHRRAEKLVKDQIISN KEYEQTCRDYQTAKTIYEAQ ASGTGAGGVSVASPMSGYIK NRMVNQGEYVSVGQPIATVS QNRRLQLRAEVSESHFKNVR NVSGANFQTSYDNKLYELSD LNGRLLSFGRASTQSSYIPV TFEFDNVGDVVPGSFVTVYL LSNTQEEVLSVPVSAITEEQ GLHFVYLRLAEEEYQKQEVT LGQSNGRRVRILSGLKAGDN VVTHGVYQVKLAATSSVMPE GHTH >_0063.005128_ NP_249203.1 gi|15595709|ref|NP_249203.1| conserved hypothetical protein [Pseudomonas aeruginosa PA01] MSACISPSDALLARRLIELT QAGLPLVADPWAWIAAQLRL SEAETLALLKRLRDAGVIRR IAAVPNHYRLGYRHNGMTVW DVADERIERLGRLVGGLSFV SHCYRRPRHLPQWRYNLFAM VHGRSEAEIEGYRQQIRLLL GEDCRADEMLVSSRILKKTG LRLAQKEERPC >_0062.000714_ OOEN_16SEP02_SCAFFOLD27_REVISED_GENE1031 ooen_16sep02_Scaffold27_revised_gene1031 MKVLQYFENPGLISRSGIGH AQRLQQEELSYTDIVLDTSP FSKDYDLIDVNTYGPKSLAM VAKARLQNRKVVYHAHSTYE DFRNSFIGSNFLSKPFKKHL IKAYRQADLIITPTPYAKSL LRSYGLKQPIVPISNGVRVS SYRRNKSKIIKFRQFLNLKE NDKRKIIISVGLYFQRKGIM DFVELARRNPEHLFVWFGYT DLRIIPKKIRETVKKDHPDN CVFAGYITGDVLQGAYSGAD LFLYPSYEETEGIVVLEALA SSQKVLVRDIPVYNDWLHDG FDCYKASNLDEFDKRLNEIL AGKVKNVSKAGHEVALKRDV SLIGPQLKKAYEEALSLPSQ ESR >_0057.000284_ NP_841656.1 gi|30249586|ref|NP_841656.1| Sigma factor, ECF subfamily [Nitrosomonas europaea ATCC 19718] MHSRDHQEDVAPAANPAMQQ VHFLYSDHHNWLYGWLRFRL GDAADLAHDTFVRLIARPRC FDSSQEARAYLRTIANGLCI DLWRHREIERAWLETLATQP EAYAPSAEEQTAVLQALHEI DGMLRHLPLKAARAFVLAMA CGMTHQEVARELGVSSRMVG HYITRAMLHCMQLEARNLGQ AGAIAP >_0053.001231_ MMAG_12JAN01_CONTIG3779_REVISED_GENE1242 mmag_12jan01_Contig3779_revised_gene1242 MPDLPKPETILVYVGLDAVG DGLIKLPFLRALRAAFPTAR ITWMSGKGPTVFQDILAPLV TGLIDEILAFTRIGERPLEL VGRRPLPDRSFDLIIDTQRR GLTTLILKRIRHRRFISATA GYLFSDGKPADTTRPASMIR QLMQLVEAAAGQPVDADFPL PRDPAIEAEADRLLPPGPRY LGFAPGAGGRWKCWPLDRYI ALAAAQKDAVPVFLLGPAEA EDWAATIREALPQALLPLQD TARLTPMLTIALGRRLAAAV ANDSGTGHMLGAADIPLVSL FGPTAADKFAPHTRRAAVLR AQDFGGEDMEAIPQDAVEKA LALLLL >_0052.002199_ NP_106891.1 gi|13475327|ref|NP_106891.1| cytochrome P450 [Mesorhizobium loti] MDMLLNPLDRRHRLRDDIPV VPGAFPLVGHLPAIVCDLPR LLRRAERTLGSHFWLDFGPA GHLMTCVDPDAFALLRHKDV SSALIEEIAPELLGGTLVAQ DGGAHRQARDAIKAAFLPKG LTQAGIGNLFAPVIQARVQA WRDRGDVTILRETGDLMLKL IFSLMGIPAQDLPGWHRKYR QLLQLIVAPPVDLPGLPLRR GRAARDWIDAQLRQFVRDAR AHAARTGLINDMVSSFDRGD DALSDDVLVANIRLLLLAGH DTTASTMAWMVIELARQPGL WDALVEEAQRVGAVPTRHAD LAQCPVAEALFRETLRVHPA TTLLPRRALQELQLGQRRIP AGTPLCIPLLHFSTSALLHE APDQFRLARWLQRTEPIRPV DMLQFGTGPHVCIGYHLVWL EMVQFCIALALTMHKAGVRP RLLSAVEKGRRYFPTAHPSM KIRIGFS >_0044.001928_ LCAS_20SEP02_SCAFFOLD63_REVISED_GENE3530 lcas_20sep02_Scaffold63_revised_gene3530 MNYEIVIVGHGRYPDGVLSA LQLLIGTTEGIKAFNLDEQT THEKFEKQLTELLSEHERVL VFADMTGGAPHQIVSRLVLE GNQPHQYVISSAPLNLMLDL YAKSLTGFEDDTIEGELQHT LTLSKQLIEILPDRMNTNSA TAPVPDMTHQDEGDGI >_0043.000336_ JANN_22DEC04_CONTIG16_REVISED_GENE337 jann_22dec04_Contig16_revised_gene337 MVPGADGLLQSAGLHRLCGW AAIDPNAEDPADRPKRPCRS PHDGPANRSDRAPQETAGEL MTRRQDYSGIAITAPFSMAY QRYSIDTAHWWIAKALRGSL DGAGLKPADIDGFSVASFTL FPDSAVGLTQHLGLCPRWLD HIPMGGASASVALRRAARAV QAGDAHIVACVAGDTNHIDS FRNMLSSFSRFAQDASYPYG YGGPNANFALLTDRYMQEYG ATREDFGRICVAQRANALRY PNAVMKKPLSLEQYINARPI ADPIALFDCVMPCAGSECFL VMSEEEATRRNLPYATLGGA IERHNAHAQDDVQLRGGWTM DVDELYGMAGCTPDDIDLLQ TYDDYPVISMMQMEDLGFCA KGEGPDFVRHHDLTIDGDFP HNTSGGQLSAGQAGAAGGYI GMVEAVRQVTGTAGGTQVAD ARTAMVSGFGMINYDRGVCT SASILKTGGRA >_0031.000376_ NP_293860.1 gi|15805173|ref|NP_293860.1| hypothetical protein [Deinococcus radiodurans] MLDECRAEGFEVQPGDLGEN VLTRGVDLLALPRGTRLHLG PQAVVEVTGLRNPCAQIDAF MPGLLRVMVQPRPGGPPALR CGVMGIVLAGGEVRVGDDIR AEWPAAPYHHLERV >_0026.000204_ NP_662323.1 gi|21674258|ref|NP_662323.1| 6-pyruvoyl tetrahydrobiopterin synthase, putative [Chlorobium tepidum TLS] MPSMIDLSGYPDSSVFYGKI YVHFVNISVNTTLRDTMLIS RKIEIDYGHTLPNSFTFCNQ LHGHRGVIVATVEGPVIDRA GDAEEGMVIDFKFLRQIMDE HIHDQLDHGFAVWKEDKEDL EFILKRNTRVLVTDAPPTAE CLARWAFNQISGKLPEGVIL KNLRWYETPNNWADYTGG >_0023.002585_ NP_602052.1 gi|19554050|ref|NP_602052.1| predicted glycosyltransferase [Corynebacterium glutamicum ATCC 13032] MKILLLCWRDTTHPQGGGSE RYLERVGEFLADQGHEVVFR TAGHTDAPRRSFRDGVRYSR SGGKFSVYPKAWVAMMLGRV GIGTFSKVDVVVDTQNGIPF FGKFFSGKPTVLLTHHCHKE QWPVVGRVLAKVGWLIESQI APRAYKTAPYVTVSEPSAEE LIALGVDQQRIHIVRNGVDP VPLHTPKLDRDGQHAVTLSR LVPHKQIEHAMDVVAALDGV VLDVVGSGWWQEELVDYART LGVSDRVVFHGQVAEDHKHA LLERATIHLMPSRKEGWGLA VTEAAQHGVPTIGYRSSGGL RDSVVDGETGLLVDSKAELI SATKTLLIDASLRSKLGASA KQRAENYKWDTAGAQFEELL LGLASKK >_0008.002174_ NP_981289.1 gi|42784042|ref|NP_981289.1| transcriptional regulator, TetR family domain protein [Bacillus cereus ATCC 10987] MTDSFIAEARREQIILACID TLEEVGYNNLSLTKVAKKAK ISTGLISYHFNDKLYLMNRT LQFLVEKQHEFISNRVLLAQ SEINKLEAFIEAHLAYQETH CKNNIALIEIVFNARNEENI PYYRIEDDEEDDALRTMLLN ILKTGQQNGVFSNSFQADTL ASFILGAIEERMLKANASIS IENYSDELIKMVKKLTI >_0005.000994_ YP_322044.1 gi|75907748|ref|YP_322044.1| Glycosyl transferase, group 1 [Anabaena variabilis ATCC 29413] MNKLSIAYITFDIVPAPKGA SIHIEAFSSALADVYKEIHL LTVSPTVGLIESDQIHLNIK QVMLPAVGNDFIQRVLYFRR ELQGWLNNRRFDVIHIRSIY EGLIIACNKKQYCDQMIFEV NGLPSIELKYRYPAVANDDD LLYKLYSQEQICLDAADLIV TPSNVTAKYLQGRGISENKI KVIPNGVDLDIFICKNTRQL NINVGHLPYQMLYFGTLSPW QGVNLAVEALGLLNRDFPAC LTVIGQGRNHQIKTLKQLAY KLGVADKLNILEPMSQKDLV AHIHSSDVILAPLAANDRNL VQGCCPLKILEGMATGTPVI TSDIPVVQELGENGVHFLSV KPGSAKAIKDAVLQLRNDGE LGSQLAANARQRIEEYYTWQ GAGKELIAAYQELLTID >_0002.000078_ NP_148172.1 gi|14601632|ref|NP_148172.1| hypothetical protein [Aeropyrum pernix] MGSIEISVEEPLLKLGVRLV YRVFDGVSVGESPGELVEIL RSVEDETKRLFRGPEALKDD PRVKAYRRFLWRLGIDPTKV RPSSEALARRVLRGSSIPLI NNVVDAGNIASLKTLVPIGL YDLDTVKPPLKLALSRGGEV FEPIGGKHQSLPQGYPILVD SRGLVMHIYPHRDSRVSMIT SSTRRVLAVAAGVEGVGMDD LRRAIEMLSELLERFAGGEP LGPAVEVGG >_0120.001774_ YP_001301821.1 gi|150007078|ref|YP_001301821.1| putative transcriptional regulator UpxY-like protein [Parabacteroides distasonis ATCC 8503] MRANVVKTVVNSVPHEGIDA VGVERTIPENPLRRKSDQKH WYIAIVNNKSEKLCRDKLEK RIASQPEGEKDYEVYIASQK EMCLLPSGKRKQVDRIVFRS IVFIRCTDVLRRKEIVHLPY IKRFMVNIAGERSGGIRPVA FIPDEQMVKLRRMLDDSEEP VIIDPRPLPLGARVRINGGK LHGLEGNVLEVEDGNLNFVI RVDLLGCAKVNITRDLLELL >_0112.001624_ YP_909629.1 gi|119025784|ref|YP_909629.1| LysR-type transcriptional regulator [Bifidobacterium adolescentis ATCC 15703] MLSEIGIIYLSDFNKDVIGK LLREKHLEFHPLFRAPLHVF ISRDNPLAGKKKVTMDDLKP FPFIQYEQGEEGSFFFAEEA VWPEYSPKQINVTDRATILN FIIGLNGYTVCTGIDNGDLN NEKIVTVPLDTDETMLVGWI TNERAKLSKAAETYLEKLKS VVASHGYTLID >_0105.000612_ YP_056871.1 gi|50843644|ref|YP_056871.1| hypothetical protein PPA2209 [Propionibacterium acnes KPA171202] MPEPMTPETFLDACTVDEAV FELRPDYRALLLVVDGLTPP ASGEGNNMVDTLIAQAEAHA RNLLADSPVNELAHIASWRE AFRGFGAKPQRTRNCLEALT RRAEKGLPRVNALTDVYNAI SVLHQIPLGGEDLHRYNGPA RLIRATGQESFDTTANGEAV IEHPEIGEVIWCDDAGVTCR RWNWRQDRRTGLTDTTRTAL FILDALAPVTDEELHATSDA LIEALANLGENVQTTTRLIG AQPSGN >_0084.001754_ SFRI_16AUG04_CONTIG80_REVISED_GENE1756 sfri_16aug04_Contig80_revised_gene1756 MKTLINKRILPTSTAGSLPK PSWLAEPETLWSAWKLQGEE LIEGKQDALRLSLQDQQHAG IDIVSDGEQTRQHFVTTFIE HLNGVDFNKRKTVKIRDRYD ASVPTVVGPVSRQKSVFVED AKFLRQQTTQPIKWALPGPM TMIDTLFDDHYKSREKLAWE FAKILNQEAKELEAAGVDII QFDEPAFNVFFDEVNDWGIA CLERAIEGLKCETAVHICYG YGIKANTDWKKTLGSEWRQY EKAFPKLQQSNIDIISLECH NSHVPIELLELIRGKKVMVG AIDVASNTIETPEEVANTLR KALQYVDADKLYPCTNCGMA PLPHHVARGKLHALSAGADI VRKELSAKGLLNLRD >_0079.001596_ SBAL_17SEP04_CONTIG214_REVISED_GENE1598 sbal_17sep04_Contig214_revised_gene1598 MVIELSPNEITSMEPNIAFI ANLIGDAARSRMLIALMGGE ALTATELALEADITPQTASS HLTKLVEGELLLVRKQGRHK YFQLQSRQVAELLESLLNMS AAIANPNVIHGPADPRLRLA RICYDHLAGELGVALYDSLS RQDLIVHEGGETKITAAGMT FFAKRGVEHSLLGAPDDVFD VPKSRQSRRPLCKSCLDWSE RRSHLAGVLGQWVLKDILAK GWAEKALDTRALQFSSRGLK SFRADYGIESK >_0077.002179_ NP_372835.1 gi|15925301|ref|NP_372835.1| similar to suppressor protein suhB [Staphylococcus aureus subsp. aureus Mu50] MTDKTLQQIDKLICSWLKQI DNVIPQLIMEMTTETKRHRF DLVTNVDKQIQQQFQQFLAT HFPEHQLLAEEKSNEMITNE INHLWIMDPIDGTANLVKQQ EDYCIILAYFYEGKPMLSYV YDYPHKKLYKAIRGEGAFCN GIKMEEPPSLKLEDAIISFN AQVMNLDTVQDLFDASFSYR LVGACGLDSMRVAKGQFGAH INTNPKPWDIAAQFLFAELL NLKMTTLDGKAIDHLKGAPF IISNKACHETVLKILNANGG YQKYR >_0075.000322_ NP_689147.1 gi|22538296|ref|NP_689147.1| hypothetical protein SAG2162 [Streptococcus agalactiae 2603V/R] MTKFIVDSSYWNLFPTSKIG VILIKDYHMDRNLETELKQL LSDSHSLAKKYLQEKEFSQN RVIQTYRKAYQTFKTKKGAR SSIEALLKRVNSGNEITSIN PLVDIYNAASLRFGLPIGAE DSDTFRGDLKLTITNGGDEF YLIGEDFNRPTLSGELAYVD DVGAVCRCFNWRDGKRTMIT DNTQNAFLVIELIDNGREII FKEALDFIATNTNRFLKAKT QTIILDKEHSEITL >_0073.003939_ YP_294978.1 gi|73540458|ref|YP_294978.1| regulatory protein, TetR [Ralstonia eutropha JMP134] MATPGKTTATDHSAPRKDTE APRWSRRKAARPQELVAAAL DLFVERGFAATRLEDVAAAA GVSKGTVYLYFANKEELFKT VVRENLVPALARGTDLVDRF EGTTPELLRELLRGWWGLIG ATSVSGLTKLIMAESGNFPD IARFYQEEVMVPGDELFTRV LARGVERGEFRALPPNPTTT LVCAPLVFLMLWQRSFGLYS HKEIDPDAYLDNLLETLLFG LTAGETRDRPLPPKSGPYIW EKIRDEMTAQRLGQRTDDLD TVAPAATAQQDKT >_0069.000627_ NP_894016.1 gi|33862456|ref|NP_894016.1| inositol monophosphate family protein [Prochlorococcus marinus str. MIT 9313] MSSGPLPESLGSLQLSAVHQ LLDRVADRQRQDFGNIVSDF KPDGSLITSCDRWSDAAIVA GLAQIAEQEGVLSEEGSKCV PDSPAYWVVDPLDGTTNFAA GIPYWAISMARFVGGRPVEA FLDVPSLNQRIVAIRGEGVW RNGKRLTNETRSTGSACVSL CSRSIRVLQRRADQPFPGKI RLLGVASLNLVSVAMGQTVA ALEATPKIWDLAAAWLVLNE LNCPIQWLAADPAQLHPGQD LTAADFPVLATGSHAEMQRL RSWGEALLHG >_0066.000911_ NP_905135.1 gi|34540656|ref|NP_905135.1| peptidase, M24 family [Porphyromonas gingivalis W83] MIKLAPAALKADLQVRQERV RIAMEEEGYDALLVTSNVNL LYLFGSIYGGAAYLPAEGEP IFFVRRPQVIEEGNVCPIRK LEDIPALIQSRGGVLPRRIM LENDESSYSDIKRQHSIFPN AEYGNATALLRRLRMIKTPG EIELFRRTAAIHGEVYACIP SVFRPGMTDREFQVEIEHLM RNYGSEGIFRTFGSAMEIHM GNVIVGDNAESPSPYDFAMG GGGTDALPLGADGSPMKEGM CVMVDMAGNYSAYISDMTRS YAIGKVPDEARRLHDLSREI QAKVMETAEPGMSCADLYKR SVEMAEEAGAADKFMGTKQQ AKFVGHGIGLQINEMPVLMA RSKEILTPGMVIAFEPKFVL PGIGAVGNENSFLVTESGVE KLTVCSEELIDLLAVAKQ >_0065.000965_ NP_578620.1 gi|18977263|ref|NP_578620.1| sulfhydrogenase beta subunit [Pyrococcus furiosus DSM 3638] MRYVKLPKENTYEFLERLKD WGKLYAPVKISDKFYDFREI DDVRKIEFHYNRTIMPPKKF FFKPREKLFEFDISKPEYRE VIEEVEPFIIFGVHACDIYG LKILDTVYLDEFPDKYYKVR REKGIIIGISCMPDEYCFCN LRETDFADDGFDLFFHELPD GWLVRVGTPTGHRLVDKNIK LFEEVTDKDICAFRDFEKRR QQAFKYHEDWGNLRYLLELE MEHPMWDEEADKCLACGICN TTCPTCRCYEVQDIVNLDGV TGYRERRWDSCQFRSHGLVA GGHNFRPTKKDRFRNRYLCK NAYNEKLGLSYCVGCGRCTA FCPANISFVGNLRRILGLEE NKCPPTVSEEIPKRGFAYSS NIRGDGV >_0056.001645_ SARO_25NOV03_CONTIG28_REVISED_GENE1750 saro_25nov03_Contig28_revised_gene1750 MCRILGYLGTPVLLEDLLYA PDSSLLNQTIGAQMLQMLNL AGFGMAAWDPASHDPHLPFR YRTTQVALFDRNLKALAGKL RPGALLAHIRGVPYNSTVQI NEQNCHPFRFEGVPLAMAHN GDLAGFREMRFDLAPHVRPE FARQIQGSTDSEWIYALAVS ALDDPAGVNEPDAILAAIRR ALSILRDARARHGITRSSST NLMFCDGVNLVAVRFTFDFG RFDAGKLQGTTEYLSMWYTF GRDYGLHDGEWKLTGGAASA DSVMVASEPLTRDFATWIEV PEYSALIVRHEGARRRAEIH ALEV >_0056.000395_ SARO_25NOV03_CONTIG24_REVISED_GENE424 saro_25nov03_Contig24_revised_gene424 VKADANSTSGSGRRGRPSAE STRLRMAHLLEVARGIFVRR GYRATTMDEVAAAAGVTKRT LYAWHSDKEALFRACVMLGA ERFPRIEPQAGEDARAALER YVLELHRELTCEDSYGMGAL FLREAAEFPELAGSIQRGHF DFMVEPLAAYLRGQGLEEEA STERTMLFVAMALSPLHNAM LVGMALPGTAGVAAHARRCV SIFLDGSRLR >_0051.000205_ NP_248444.1 gi|15669631|ref|NP_248444.1| conserved hypothetical protein [Methanococcus jannaschii] MEGKAYALASGTIINAIATG KGSAFGLDLKVYAKVKLIDD GKNKIEGKVLDNPNIKPNLI VRCVKNTLDYFGLNYSAYVE TKTEIPIKSGLSSSSATSNA VVLATFDALGEKIDDELILN LGIKSSFDEKLTVTGAYDDA TASYYGGITITDNIERKILK RDKMRDDLNVLILIPNLEKN VDVNRMKLIKDYVEIAFNEA INGNYFKALFLNGILYASAL NFPTNIAIDALDAGAITAGL SGTGPSYIAMVEDENVEKVK EKLNRYGKVILTKPNNDGAS IY >_0037.000327_ NP_952404.1 gi|39996453|ref|NP_952404.1| iron-sulfur cluster-binding protein [Geobacter sulfurreducens PCA] MSEEKNTAIDLKNLKAGGFI KERGKDLFTVRLRVPGGRLP VGRLKKIAEVAEKYGQGMVH LSVRQSVELININFRDFDAV VAELGEGTQKVASCGARVRV PTACGGCEYNPNGLVDTQKS ALEVDEKLFGIPTGHHKFKV GFAGCPFDCPKSATNDVGFQ GAVWPVLSADHCIGCGLCAK SCTEDAIAMGDNGKPLFIPA NCLYCGDCLKVCPTEAWRAG KKGYTVRIGGKWGRRPLVGT LFAEFLPEDQVVDFIAAVLG WYQEKAEGQGRIRLGDVIRA QGPEALLARLRERFPAFVVN ATIAPQAIATQVGKEKQAHD NR >_0032.003345_ YP_010075.1 gi|46579267|ref|YP_010075.1| NirD protein, putative [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MTEAHNACCHPSGTAAGHHG AGKASTDMMDAVDRRLLDII QTGFPIEPRPYAVLGETLGI TECEALARVRALRERKVIRR LGANFDSWKLGFRSTLCAAK VPEDRIDAFVAEVNRHVNVT HNYLRNHEYNIWFTCICPSW EQVCSLLDGITERTGIPILN LPATKLYKIRVDFRMD >_0030.000155_ DHAF_12NOV03_CONTIG1009_REVISED_GENE184 dhaf_12nov03_Contig1009_revised_gene184 MQDKDTTQSTFTQVFQPFFS KDLWKKIDQEVPNLDQRNYK LKTNQLTLLISHAQLQEYKA LRKISSNVQSNDFSEAIGLE SISHSQISRRLRTLPIKVSE MLFKGVLNKVAQKKGDGKIQ QRLGKLYMIDASVISLCLSR FPWAVFRKIKAGVKMHLRLS FDEMAIPDEVIITPAKTADR KKLDELIVVDKDALTIFDRG YIDYLLFDEYCEKEIRFVTR LKNNAVIEFTGVERPVEEEG SIEEDVDIILGTGTRKMKHT LREVTIDDNVNEPFTILTND FDLSAEELGEVYRYRWQIEL FFKWLKQHAQIKHFYGTSEA AVINQIRLDLMTYCTLILLK LEVEHQRDLLTLQRMLIACL YESYDEFLGKLRRRRRKGSK RIKHDTIYQMTDHYIMAEED TEWLNDLIYDPVIL >_0028.000526_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE0527 ddes_06jun05_Contig143_revised_geneDde0527 MAFISSGYNPEKPMEGRITD IGPQHYAQFYPPVIARNKGK WLYHEIIEPGVLVHVAESGE KVYTVRVGAARLMSITHIRE ICEIADKHCGGHLRFTTRNN VEFMVETEEAMKALRDDLAS RKFDGGSKKFPVGGTGAGIS NIVHTQGWVHCHTPATDASG PVKAIMDEVFEDFQSMRLPA PVRISLACCINMCGAVHCSD IGVVGIHRKPPMIDHAWTDQ LCEIPLAVASCPTAAVRPTK VELDGKKVNSIAIKNERCMY CGNCYTMCPALPISDGEGDG CVIMVGGKVSNRISMPKFSK VVVAYIPNEAPRWPTLTNTI KHIIEVYSENAMKYERLGDW AERIGWERFFQLTGLEFTHH LIDDFRDPAYYTWRQSTQFK F >_0024.003231_ CHUT_08NOV04_CONTIG199_REVISED_GENE730 chut_08nov04_Contig199_revised_gene730 VSEKEKVIIESIRSGANNIA LEYLYDISLKKVRQYILKNN GSKDDANDIFQDAVIVLFNQ IRLNKFNEAYSIDAFIYSVA RNLWIDKVRRDKKFTKYDSP DDYAVIASDTNHLDALIQKE KSAAMKTVFNLLDEKCRNIL TYVIYEKRSMKEIKELMGYS SEDVAKTNHYRCKQYLTKLV KSNPSLVDLLRN >_0019.001778_ NP_347198.1 gi|15893849|ref|NP_347198.1| Transcriptional regulator, MarR/EmrR family [Clostridium acetobutylicum] MKNSNYEKQISELILETICK FDERDSKERNFGTDVNIHHS EIHMIKFIKENTDLHISAIA RKLGITRGAVSQTIKRLQSK GLITKEVDEGNNSKIVVRLT GKGQTAYINHENYHKQYEIR IKNILENMGAGSEKVVYDFL LEFYRKI >_0019.000163_ NP_149243.1 gi|15004783|ref|NP_149243.1| ThlR, HTH transcriptional regulator TetR/AcrR family [Clostridium acetobutylicum] MAKTTKGEESKIKLIECAAK LFLQKGYNGTGINDILSGTG LPKGSFYFHFASKKDLAISV SEYFEKKLLGWIKINSKDKK WDEFIISLVGEMIKGAEKGK HYGCPFAVLGLEIAFLDSDI SKCYYKSIKELIDLFASIFE HSGVPKDNIYILANRAFAIY EGYLVYYRIGKDINTLKTMQ EDLIKMYEDFKEVK >_0018.001517_ BFUN_06OCT04_CONTIG481_REVISED_GENE1518 bfun_06oct04_Contig481_revised_gene1518 MTSPNTTQGAADPIQSYLNQ GLRNYWYPVAPSWQVGDAPV GITRLGDQIVLWRDKEGKVR ALEDRCPHRGARLSLGWNLG GSVACWYHGIEVDGGGTVTK VPAVSNCPLEGQKCVKSYPV EEHAGAIFLWFGDDAHKEPA PLVLPEELVGEEYASFLCMS HWKCNYQYAIDNVMDPMHGA YLHATSHSMAEGDKQADMRV RKTETGLMFEKIGQRDVNFD WVELGETGCLWMRLAIPYKK KFGPGGNFGIIGFAVPVDED NCQVYFWRTRKVQGWQRDAW RFMYRNRLEGLHWAVLEQDR YVLESMAPNARDHEFLYQHD VGMTRVRRMLRQRAQQHFAD LDAHRAQAVSATPAAAEAGH SHA >_0017.003162_ NP_810934.1 gi|29347431|ref|NP_810934.1| conserved hypothetical protein, putative non-specific DNA-binding protein [Bacteroides thetaiotaomicron VPI-5482] MPLIYKPYQANIANKAGQKL YYPRLVKFSKMVNTQKMAEL IAEKASLTPGDVHNVIRNLM SVMREQLLNSRTVRLEGLGT FTMIAKAGGKGVVLESKVSS SQIVSLRCQFTPEYTRSADG VTTRALTSGVEFVHVKDVAG GFVDDDDKNHSGGGDNPGGG STPGGDDDEAPDPTV >_0011.003941_ YP_105792.1 gi|53716780|ref|YP_105792.1| 3,4-dihydroxyphenylacetate 2,3-dioxygenase [Burkholderia mallei ATCC 23344] MGKLSLAAKITHVPSMYLSE LPGRHHGCRAEAIRGHQAIG ERCRALGVDTIVVADTHWLV NAGYHVNCNGHFAGVYTSNE LPHFIRDMRYEYPGNPALGH LIAASANERGIGTRAHEIDS LELEYGTLVPMRYMNADQRF KVVSIAGWCMWHALDESRRF GEALRHAIDASDANVAFLAS GSLSHRFNDNGSPEEAIHMI SREFYRQVDLRVVELWKQGD FATFCKMLPEYNAHCHGEGG MHDTAMLLGLLGWDRYDKPV EIVTDYFASSGTGQINAIFP LP >_0011.002500_ YP_104477.1 gi|53724030|ref|YP_104477.1| transcriptional regulator, PadR family [Burkholderia mallei ATCC 23344] MAQSASRYSPLALIVLAMLT EAPMHAYRIQQLVKLRGKDE VVNVKQRNSLYQTIERLQRD ALIAVRETERDGAFPERTVY EVTDAGRDTARMWLREQLAR PAREYPSFPAALSVLPLLSV EDARRQFEARVAALEAELAR LDETQNAALAMQIPRLFLLD GELMRVTLEAELDWVRSVIG HLKVGALTWSEAWLREVAAR FAQADSPDSADSD >_0011.002192_ YP_104028.1 gi|53726151|ref|YP_104028.1| hypothetical protein BMA2482 [Burkholderia mallei ATCC 23344] MTATHSRSTATDAVQTLQTR TARRAAAAHRVSPGPQTSQI ARGAGVTETAIATLSNDMLR LDVAPHLGGGVTRFDWRGDG ALTPIFRRCDAPGARTDPNE LACYSLLPYSNRIGGGRFEC DGRLVRVPRNRSAEPLPIHG DGWLAHWQLDDATDTQLGLS LDRSNGAPYAYRATQVYALD GATLTIALGIENTGATRLPF GLGVHPFIVRDASTELAAAA SGLWLSTPDWLPSRHVGAPP AWRFGIAYPLPDTLVNHAFT GWGGGATIAWPQRRLGLTVT ADADCYVLYTPPGEPFFCFE PVDHPIDAVNLPGGGGAHGM TLLAPGERLMRRFRFTVACT DARAAPVARQSRRRAIA >_0010.001085_ YP_034215.1 gi|49476174|ref|YP_034215.1| hypothetical protein BH15190 [Bartonella henselae str. Houston-1] MTTFTILLNGDLFITDRLRN QIRNSRVIAADGGMRHAEAL NVVPELWLGDFDSSQQALKS KYADIPREIFPPDKDMTDSA LACERALQKGAEKLILCGAF GGERSDHSLSHMTQALVMEE KGISVLLTSGREEGWPLLPK PFSCDLPDDSLFSIIGFSDL KELTISGAKWPLYNKNVLFG SSLTLSNRICGTFSCHLCSG KAIVLASVPIS >_0009.003662_ NP_244600.1 gi|15616295|ref|NP_244600.1| transcriptional regulator (TetR/AcrR family) [Bacillus halodurans] MSTKDGQAKERILKAAEELF QVKGYHQVTVREIARKAGCS HTSIYVYYGEKRKLLELLAK KPLNELREDVRQILTKSSVT PSDRLVALAKRFVHFGLVHR NLYEAFLHAEATRVDIPTTL WELNDIRMQLFDMLKKAVAL NHQPWNEERVVSLSRMLYYA LHGMIMTYKDSDESIRSIER RVLPIVEQTVHVFLKGAIQS >_0005.003421_ YP_321560.1 gi|75907264|ref|YP_321560.1| Glycosyl transferase, group 1 [Anabaena variabilis ATCC 29413] MKILHLSTYDNRGGAAIATY RLHDGLQNIGITSQMLVQIK FSDDKSVIATGNKIVHKYPK LKPHLDSLPKLFFRHIDKSR RTSYSLQWLPDSIATSIIKI DTDIIHLHWISGGLINIETI AKLNKPLVWTLHDMWAFTGG CHYNQECQLYKENCGNCPQM PKRFNIDLSSWVWERKAKAW ADLNCTVVTPSHWLAKCAAS SSLFKDCNIKVIPYGLDTEV YKPYQKNLVRDKFNLPQDKL LILFGAENAASNTRKGFHFL RCALEILKHTYWHDKCELVI FGASKSDSISNLGFNTHYLG RLNNESTVAQVYSAADVFVA PSIQDNLPNTVMESLACGTP CVAFDIGGMPDMINHKQNGY LSQPYNIDDLANGIIWVIED KERHQKLCASSCATVKEKFT LELQAKNYLSLYQNILKINN >_0116.001262_ YP_194278.1 gi|58337693|ref|YP_194278.1| pyrazinamidase-nicotinamidase [Lactobacillus acidophilus NCFM] MTNEALLIIDYTNDFVDDKG ALTCGKPAQELDDTIANLAD KFLKEDKWVIFPTDKHFKDN PYHPETKLFPPHNLPNTWGR ELYGKVGKWYEAHKDNNHVI LMDKTRYSAFAGTSLDLLLR ERKIDTLHLTGVCTDICVLH TAMDAYNHCYNLVVHENGVA SFDQNGHKWALNHFKTCLGA KVVD >_0111.000795_ YP_211664.1 gi|60681520|ref|YP_211664.1| putative TetR-family transcriptional regulator [Bacteroides fragilis NCTC 9343] MYGNKIVFSGLENIKDCCKM KITRDELLIAAFKLFMSVNY EKASFAELGKMLGMSKAGIF KYYKNKQELFIAVVDKFWFS TQNPRNKFTETNGTFAEFID EYVRGVQRTMDMLGDLIGAE REKVAQGKFTYHAQYFHFLF QLLQYDPDAKEKLRNLVEVD YAYWRAAIQRAIATGELRED VDVEDAVVMFRQVYMGLSFE MAFMGGLNTQRLAKHLHAVY SLLKR >_0108.0000801_ YP_462394.1 gi|85860192|ref|YP_462394.1| asnC family transcription regulatory protein [Syntrophus aciditrophicus SB] MEQRMRDRHGEQNLPTENQK ERGERIVLTRDEQQILQALQ GDIPLHSRPFAALGKRLNLS EDTVIRIIQELADRGIIRKI GAILRHRQAGFKQNAMVVWS VPESRMEITGKILASFPEVS HCYERTPSFEGKYNLFTMVH LKYPDIANQVREMAQAADLD DYQILVSETEYKKSSMVYFT >_0106.001695_ SWOL_07JAN05_CONTIG134_REVISED_GENE1696 swol_07jan05_Contig134_revised_gene1696 MNAVPKDEMTPLERMAAFAA GKPYDRIPCSSFSGETACHF IGTTVSEYRHSAKLMARVET VTYKMFGHDGAGVGPGFLAL AEAMGTELKYPEDNIPFVAN PVLKNWDDFDKLEPADPHKD GKLPIYLEALAMIKDAIGDE VPIGSSVGGPFTTAALVRGT ENFLKDLRRNPEMAHRLLQL VTNTTLNYIDAVMDMGCSVS IGDPTASGSLLSVKQFREFA KPYLSKLADRVIERTGKGPM LHICGDSTPIWSDMADTGAP TLSLDNVIDLAEAKKVVGER VCIMGNVKPIDTIMKGSHQQ IEAEVKECLRKAYDSPKGFI LAVGCQVPLGTTADNFMCFM NAARKYGRYPLNPDNFI >_0096.001891_ SYN_PCC7942_21JUN05_CONTIG52_REVISED_GENESYN_PCC79421844 syn_PCC7942_21jun05_Contig52_revised_geneSyn_pcc79421844 MGREGWSRVLSATGENRERS LQKFMTETLTQLGQMVGLPA SPEVAVLETFDNPHPDRQYL VRFVAPEFTSLCPLTGQPDF AHLVLDYVPDQRLVESKSLK LFLGSFRNHGAFHENCTLTI AKRLEEAMNPTWLRLGGYWY PRGGLPIDVFYQSGEPPAGV WVPEQGVAPYRGRG >_0078.007446_ NP_828660.1 gi|29834026|ref|NP_828660.1| putative N5,N10- methylenetetrahydromethanopterin reductase-related protein [Streptomyces avermitilis MA-4680] MKFGVSTFLTDEGIGPAALG PALEERGFDALLLAEHSHIP VKRETPYPGGGELPRVYYRT LDPFVALGAVASVTSELLLG TGIALVVQRDPIHTAKEVAS LDLVSGGRAVFGVGAGWNRE EMRNHRTDPTERGRLMDERI RAIIELWTKDEAEFHGDFVD FAPVYSWPKPVQRPHPPIWV GGGSERTFARVAEYGAAWMP SGVPPKELGAQIERMRKTAG DRPAVVVYAAQHDRESLDTY AELGVERVLLYLPTQPEDET LRTLDELAEAVSAYR >_0075.001360_ NP_688407.1 gi|22537556|ref|NP_688407.1| glycosyl transferase, group 1 family protein [Streptococcus agalactiae 2603V/R] MENKVKTVAVFSGYYLPFLG GIERYTDKMTADLVKRGYRV VIVTTNHGDLPIIDEDKGRK IYRLPTKNIVKQRYPIINKN REYNTLMKYVSDENIDFVIC NTRFQLTTLEGLSFAKNHHL PSIVLDHGSSHFSVNNRFLD FFGAIYEHLLTARVKHYRPD FYAVSKRSVEWLKHFNIEAK GVIYNSVSESLGSDFAGTAY LEKSADDIFITYAGRIIKEK GIELLLEAFSMSQYSENVYL QIAGDGPELAHLKEKYQSKQ INFLGKLNFEQTMSLMAQTD IFVYPSMYPEGLPTSILEAG LLSSAIIATDRGGTVEVIDS PELGIIMEENTQSLHESLDL LVKDKALREKLQQNIAKRIK EHFTWEKTVEKLDYIIQKN >_0061.007261_ NPUN_22DEC03_PLASMIDD_REVISED_GENEPNPDF031 npun_22dec03_plasmidD_revised_genepNPDF031 MRLPPPTRLPNKHYRSREYL TPSEVRSLLDAALDRKARYS HRDYTLMLLMFRHGLRVGEA VGAKCGLRWDAVMWGERQIF ITREKGSDSGVHPLRDDELV LLKELREMLPDSKYIFVSER GEVMSTDAVRKLLGRLAAQA GLDIKVHCHMMRHACGYYLV NQGYNTREIQDFLGHRDIKH TEKYTKLNARRFLNFDWGDL >_0061.004192_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR1866 npun_22dec03_Contig1_revised_geneNpR1866 MTKIEERGKVYLVGAGPGDV AYLTVKAYNLLAAAQVLVYD ALVDAQLLQCVPPNCLKLDV GKRGGKPSTTQTEINKLLVK YCQQGKQVVRLKSGDPLIFG RCTSEIEALKASGCDFELIP GISSALAAPLFAGIPLTDPV LSRCFAVLTAHEPEVLDWEA LSRLETLVILMGGQHLPEIV HQLVQHGRSHLTPIAIIRWA GTPRQKIWTAQLGNILEQTT GLSLSPAVIVVGEVVGLHKY LQPEKIDCQDSTTAQTPMLN NLSASPEPLTGKTILVTRSV GQSSTFSDRLTTLGATVIEM PTLEIGPPSSWEALDDAIAH LSQFDWLILTSSNGIDYFFE RLIAQGKDTRALAGVKIAVV GEKTANSLKQYSLQPDFIPP NFVADSLIENFPEKLDGKKV LFPRVESGGREILVKELTLK GAKVIEVAAYQSCCPSSIPS GAELALQNHNVNVITFASSK TVQFFYQLTEKIFSQNSDVS QLLENVCIASIGPQTSKTCH DLFGRVDVEAQEYTLDGLTQ ALIIWATNY >_0055.000681_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA0050 rgel_26jun05_Contig562_revised_geneMpeA0050 VSPRKPGTQVPAAVTDNTAM APSTADTAATVRNEQLHQHG ALVRRIAKRMRARLPANVEM DELLQAGLIGLNEAITRFEE SFGASFDTYASRRIEGAMLD SLRAADTLSRDARAHQRAIR AAVQALEHRLLRPPRAMEVA SELGWSLARLHQRLVEAGAA GQRAGDAPLDGLADEASARG AEDDLHATADDHADPLAALQ RRQRTTALNAAFDTLEEREQ LMMKMIYERGLDHQDAAESL GVSPSRVSQMHAVVVQKLKR RLKDW >_0054.000139_ NP_988347.1 gi|45358790|ref|NP_988347.1| precorrin-6B methylase [Methanococcus maripaludis S2] MVFLKNFFERLIVFKSYIKY HFNIVSLKTCSKNYKWDTVI YIVGIGPGSRDYVTEKAFDT VETSDFVLGSKRSLEIFEIN GKRMELTLNLKNELYDFLKN YKNSNSKEKVSILSTGDPCF SGLLKTIFSFDFIDQNDFEV IPGISSIQMAAAKAKISWED YNILTIHGKEENLKKLLDFV KNDEKVIFLPNNLKNDIKFL IENGISDEKEITVLENLSYS NERIITNKISNLVKNDYSYL LVCIIN >_0052.006578_ NP_108646.1 gi|13477075|ref|NP_108646.1| similar to actin interacting protein [Mesorhizobium loti] MTIHGISPDVLAALEHVLGQ SGVAADTADMAKYLVDWSGD HHGGALAVLKPASVAEVQAA VRLCGTLGLAMIPQGGNTGL VAGAIDIGTAGGAVVISLER LNRIRSVDADNFILQADAGC ILQHIKDAADERDCLFPLAL GAQGSCQIGGNAASNAGGVN VLRYGMARDLIVGLEAVLPD GELWNGFSGLRKDNRGYDLK QLFIGAEGTLGIITGVEMKL FPKPGRVETAYLGLPSFEAA ISLFRRARRQCCDLISAFEI IGAECMELARLADPNIVTPV TGPVHVLIELSSSAEIDLRA LLMNFLADTMEEEIVTDAVL AESGAQARAFWGIREGLVEG QAKRGYHVRTDLSVRISDIP TLIAQARQFIELQHPGWISQ AYGHAGDGNIHFNVLPPLDL ADPDARARGATITTGLYDIT NALGGSISAEHGIGRTRQRV YWAGMSAVQRRLVSTLKDAL DPGGLMNPGCLFPATETFS >_0050.001968_ MFLA_01DEC03_CONTIG130_REVISED_GENE2186 mfla_01dec03_Contig130_revised_gene2186 MRYWPWLEHVEKPLPDHPLH LPTIVNEHRPKVARRERGET QAANILSVALDLFAKHHVAS VTTKQIAAASKVNSALLYYY FKNKDDLFRQAVDHAIGQVL LRFEHVQKTAHGPAEILSAW LNLHLQQVELIRKFVKVAVD YASSESRTAETDATISRFYD TERRMLCQVLQEGIETGDFA QVDVEQTATFISTFLDGIVI RSIVLEGFNYAQSVQDLRTF LSHELQVSIQESTSFPME >_0043.002916_ JANN_22DEC04_CONTIG27_REVISED_GENE2917 jann_22dec04_Contig27_revised_gene2917 MANPFTPQTMISTTHSCPSG GTFGCNWSAIGCLREDMMFE DRITTFRTRIDEAGIDVALI TDDDAVYYLTGYYDYLHMEF GRPTILVVPSDGPSLLITPT IDLNTAQANARVDRIAPWND GMGDEWRAELPWALKGCASV GIEPDHMPPLVRAYVDDLVP RDRQTSATPILSTMRMIKSD GELQLARHAGQVANAMMAAG RAAIADGVPEFEVALATSQA GTRKAAKLLAAHYHDADMSP NTHFLQIMASGTEITKTHHR ASTRVMRRGEPVFLCFCGMT NFHRFKLGFDRTFWIGEAPD DHVAVYEVALASQAAALAAL RPGVSAESVHAAYAEVIEGA GHAYPFRCGRATGFSFLEAP QLVTGDTTILQPGMVLAVDG SVAVEDFRAQIGDSVIITED GYEPLTDHPKQISDIILA >_0043.002087_ JANN_22DEC04_CONTIG25_REVISED_GENE2088 jann_22dec04_Contig25_revised_gene2088 MCRWAAYLGQAIYLEELLTA PGHSLIDQSRAATECKTAIN ADGFGVAWYGDRPDPGLYRD VFPAWSDPNLASLARTLKSH AFLSHVRASTGAATSRNNCH PFAHGRWSFMHNGQIGGFDR FRRRADMGVSDPLYAHRKGA TDSELLFLYALDFGLEADPI GAVLLAHRRLEQMSRETGTT PHLRSSAAWSDGTRLYALRL SSDHIAPSVYYRWSKSRTGW AVVSEPLEVGQGGWTELPPG HSAMFEGAQVWVEPVAAMAA >_0039.001102_ NP_874109.1 gi|33152756|ref|NP_874109.1| possible uroporphyrinogen III synthase [Haemophilus ducreyi 35000HP] MNVLITYPASRAQKLVDMCQ QHRIFAIYQPIFSVELGADL LDLPSAMSRLNAGDNVIILS KEAFDFAWQTLADTGFAYRA DLNYFVADKYLAAYVTAKTT RPVHYPDTEQCNQLLQLGKL LQLTDKKVLILCAEEQRGLF QNQLAEAKQVQYLACYYHKH IEQLTAQLSLAKRAGIDTIL IENEDVLLTLYEQIAAEDCQ WLVKTRLVVISQHIANIAEK LGWQSDQVIIAGQTDNQTLL NTMLTFINKAN >_0038.000905_ NP_279693.1 gi|15789869|ref|NP_279693.1| farnesyl-diphosphate farnesyltransferase; FdfT [Halobacterium sp. NRC-1] MLCRSALYCWHPGTPSRGAM IAVGDGSTPQPLLYSPTLAR PRVMDGRQSHQPTASDREWC HSAVQDVSRTFALTIDALEE PMATRICVGYLLCRVADTVE DATAMPPTEQAQLLETYDAA LDPDCGTTVAEFQAAVGDHV PSEPNADWRVVANTTRVVSA FESLDDAAKAAIRPTVREMA TGMASFVQRYADAGGLRIQS PTELEEYCWYVAGTVGELVT ELLARDAPSDAAAAMRENAR SFALLLQLVNVAKDVRPDYE EENNVYLPQEWLDDHDLAPE DVADPEHAGRVASVVERVTD RARGYVADAQRWLEVMPTSQ GNTLAAWAVPFLLAVGTMRE LGNRPADVVREGDVKVDREE VHAVIGLVADDFDRSAIDSV RERIATESLS >_0036.001110_ EXIG_01APR05_CONTIG277_REVISED_GENE1111 exig_01apr05_Contig277_revised_gene1111 MNLSISCTEKGRLIYALVSV NKTIAQKFDLCTDGFSQTRM DLLAQLEVDTTISQKELQQR VNVDNAAVTRHLKHLETKGM IERVRSATDNRVILVSLTAE GATRITRLRQQKDEFLEQLL EGFTADEQHQLTQMIQRIET NALTLPKTTVT >_0032.003220_ YP_009868.1 gi|46579060|ref|YP_009868.1| precorrin-2 C20-methyltransferase [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MQPGTLYGIGVGPGAPDLLT LRAVSTLARVDVVFAAASSR NDYSVALDIARPHLRHDVTI ERLDFPMTRDQSVLTAAWKA NADAVEAVLRTGRNAAFLTL GDPLIYSTFGYLLRTLTALA PDLPVEVIPGVTSYQAAAAK TRTILCESGENLLLLAGIQD AARLDTALTRADNAVILKAY RNFSSIRTMLASHEATRDTV FVSRLGMEGEVVARDLHAAP EHPHYLSLLLVTDRKHGNPE HTAESTSDAEADETAD >_0025.001246_ NP_281436.1 gi|15791613|ref|NP_281436.1| putative iron-binding protein [Campylobacter jejuni] MTYNEKIISMNNDLLDHQHK ELFEISKKLSLMNQRHVGTK ELKIVLRELLIMINRHFSDE EAFMREIEYPYINHHTRIHR KIILEIEEIIISEAKFVNIM TEKLNLVVQDFIFKHTAKED SKIVKYYEEKFKK >_0019.000180_ NP_347512.1 gi|15894163|ref|NP_347512.1| Transcriptional regulator, MarR/EmrR family [Clostridium acetobutylicum] MGNDQIDEIVNDIFYVMPAF AKTMFGLLESTFYHCELSNS SIRVLFALNNHKKITITDLG KVLCAHKPNVTSWVDCLVKN DLACRLYDDNDRRIIYISLT ENGKLTIDKCKEVLNKSFAE RLAHLNDEDLELLIQTLNNM SMLLNKINGHS >_0018.007663_ BFUN_06OCT04_CONTIG482_REVISED_GENE7664 bfun_06oct04_Contig482_revised_gene7664 MKNERGQNLLKRVDLVVGGA LLSLLGMFRRKRRPPAECTT AGIFAFAAIGDSILSSIIIS ELRKVRNISKVVVFASRSNA RIYDLFDGLDEVVVVPLTSP LNALREIRKHDVDVLIDTSQ WPRISAILAAFMKAKFTVGF KTRRQFRHYAYDACVEHRDD IHELDNFRRLLRPLGIESTE TPQIGKSIRPSGLPFVMDDR YIVFHPWASGTNFAMREWPA EYWQKFARMLIGSGYAIVIT GGPEDAARAQQLVDLIGSSK VVSVAAKVSLAEVVGIVRRA DCIVCVNTGIMHLSAMLDTP MISLHGPTNPFRWGPVSKER GVLAVSRQDGGMFLNLGFEY PKDAKYVMDKISVASVADVL SEVYSINLGRHDKKPAGEAF AP >_0016.004739_ YP_442946.1 gi|83719522|ref|YP_442946.1| syringomycin biosynthesis enzyme, putative [Burkholderia thailandensis E264] MGRTRRDPARAPFASFVSPE QKDFRMTQLSMPAAAARPTL DDLRVEPGLPTVVSPREGAD IALHEAAPLLREIANDVVER AGGVLFTGFRVASIETFQRF AADFGDPLIGYEFASTPRSQ VEGAVYTSTEYPPHRSIPLH NEQSYTREWPMRIWFHCALA ARTGGATPIADSRAVYRALD PALAARFAERELLYVRNFGQ GLDLPWQQAFGADDPREVER ICAARGIDCEWREGDDGEPL LRTRERCQAVARHPRTGELV WFNQANLFHLSTLDEDMQEA LVDAVGIENVPRNVYYGDGA PLEPDALAEIRAVLDGQRIV FPWRTGDVLMLDNMLSAHAR DPFEGPRKVVVAMARSHRET GGA >_0005.001025_ YP_322096.1 gi|75907800|ref|YP_322096.1| Isochorismatase hydrolase [Anabaena variabilis ATCC 29413] MDQSLRTLGVAPNAWMVNQA IADITRPQKTPQPVILSTET KTLRLDLAKTAIIVIDMQND FCHPDGWLAHIGVDVTPAAK PIVPLNNLLPELRAVGVPVI WLNWGNRPDLLNISANLLHV YNPTGEGVGLGDRLPKNDAR VLMAGSWAAAVVDELQQLPQ DICVDKYRMSGFWDTPLDSI LKNLGITTILFAGVNADQCV LTTLCDANFLGYDCILVKDC TATTSPDYCWLATLYNVKQC FGFVSDSQAIFTALNHPEAT GRDK >_0120.001268_ YP_001304494.1 gi|150009751|ref|YP_001304494.1| glycosyltransferase family 4 [Parabacteroides distasonis ATCC 8503] MIQIILVNEESRASQYGIGT YIKQLCLILSQKQGIHLSVI CCRSTKNDFYIERKGDIDYY HIPDTIQNNDPEIRIKTYYK NIWYLIFPYFSSHSKENIIL HINYYQHIHLLNTFRAFFSK GRCCFTIHYMDWCFKIKGNY SYLKSILYKELDSIKNPLEK NIKEICEKEKNVFEQVDKII CLSNSTQNLLSKIYDLPINK FCILYNGLKDESILLNSDKR LNLKKNFLFSKEEKIILFVG RLDEIKGLGELIFTFKKLLQ IDPYCHLVIVGDGFYSYYLN SCNPTWNKITFTGKLNKEDL YKLYQIADIGVLPSFHEQCS YVAIEMMMYGIPLVASTSTG LSEMIEDGVSGYHIPIIEYE NHTDLNTYELQSKLLILLKD SSMRKEMAKNSRLRYERYYT SEIFSSCMQEFYNRFVL >_0118.000557_ NP_266393.1 gi|15672219|ref|NP_266393.1| transcription regulator [Lactococcus lactis subsp. lactis Il1403] MKKTSQEKLYEAALELITAQ GYENTTVLQIATKAGLTERT FFRKFKNKADIFFAGGEQYS QMLLTKMKESKANNPLAIVL ESYFYAADFFDEHRERTVKR QKIIASHPDLEERELLKQSK IEELLVEYLLTAYGERSARL AVRLARAVYSVAWEEWLEND ADSLHHLLEKAIADFNELKN Y >_0113.002264_ YP_001088596.1 gi|126699699|ref|YP_001088596.1| probable amidohydrolase [Clostridium difficile 630] MRGEIRMMDILIKNAIIVTV NKEREVIFDGALVVKDNKIA DIGNSKEIESKYTDVKKIID AKGKVLFPGFINTHNHLFQT LLKGLGDDMVLKDWLETMTF PAANYLEPKDTYDAAMLGCI EGLRSGITTMVDYMYPHSKP GLCDGIIDAYKELGIRGILG RGCMNTGAQFGVHPGIMQDV ETVEKDVRRLFEKHHNTENG RIKIGVAPAAIWSNSQEMLE MLWRVVKEYDDALFTVHISE TPFDREAAKELHGQYDIDVL EKLGILGPNVLMVHCVYLTE KDMELTKKYDMKVSHNTASN MYLSSGVAPVPEMLKKGITV SLGVDGAASNNSQDMLELMK LTALQHKVNKCDPLAMSAEK VLELATIDGARAIGMEDEIG SLEIGKKADLLIFNPMLSPK AIPMHNPVSTLVYSSSMKNI ESVIVDGNIIMEDSKILTAN EEKALKDAQDTAERLCVRGT IKNRMEGHKWNSLY >_0108.0003161_ YP_461003.1 gi|85858801|ref|YP_461003.1| cell division inhibitor [Syntrophus aciditrophicus SB] MMQKPKVGQDVPEPSAGGMF QRDAMSPSVAHKPVLELKGN LLSLMVLYLFDSDRNLIEQQ FTEKISGAKNFFKNAPIVID LHALQNTSVPVDLSHLSDLF RKHGLVPVGVRGGNVQQQQA ALNLSLGILPDAKPASSRGE PQTEVAVQATRDKVFTQPVR SGQQVAALQGDLVVLAAVNP GAEILAGKNIHVYGPLRGRA IAGVHGYSEARIFCRYFDAE LVAVAGQYMLNEDFEDSVRG KPVHIFLENDRLKIQPFEI >_0104.000883_ NP_637720.1 gi|21231803|ref|NP_637720.1| transcriptional regulator tetR/acrR family [Xanthomonas campestris pv. campestris str. ATCC 33913] MTPRSPRAARRSDCDRRIHA AVHALLAERGMRVSMTAVAE RAGCSKQTLYSHYGCKENLL RDVLQEHVQLATVPLGTATG DLREDLLAFALAHLDRLNRP DVLQTCRLVEAESHRFPGQS QQIFHEGVVGMQERLASRFA QAIDAGQLRHDDPHFMAELL LSMIVGLDFDRQRFQVPHRA GLPARQQWAQFAVDTFMRAF AAVVTPPRTPARSFLTSRP >_0095.004434_ NP_490521.1 gi|17233417|ref|NP_490521.1| resolvase [Salmonella typhimurium LT2] MSQPPLPAVCTQAASALLPV AIDYPAALALRQMAMQHDDY PKYLLAPEVSALLHYVPDLH RRMLLATLWNTGARINEALA LTRGDFSLAPPYPFVQLATL KQRAEKAARTAGRMPSGSQP HRLVPLSDNQYVSELQMMVA TLKIPLERRNRRTGRTEKAR LWEITDRTVRTWIGEAVEAA AADDVTFSVPVTPHTFRHSY AMHMLYAGIPLKVLQALMGH KSVSSTEVYTKVFALDVAAR HRVQFQMPGADAVAMLKGGS >_0093.000304_ NP_341740.1 gi|15897135|ref|NP_341740.1| Uroporphyrinogen III synthase, putative [Sulfolobus solfataricus] MRVLFLRPDNEDNELNEKLM ILRKNGIEVLNIPIFKIKCV SYSLPTYDYEALAFTSRNSV ICFKDRTLIKNVHKIYAIGE ETAELLMKMYNVNPITPERF TSIELAKRILEDKVNSILSI RSRKASEDMRNILNGKIKYD EIYVYDSEIIRDNINEISKI LTECEVDAIAFTSSLMAKLI GPFIRGKCNIIIFSIGPMTT ETLKRVNNKVKIIESKTHSI KGIIETILVEMKRNGRD >_0091.000039_ SPUT_CN32_28JUL04_CONTIG102_REVISED_GENE40 sput_cn32_28jul04_Contig102_revised_gene40 MTTFSVVESGETTFIREQAF GGELFTQSILSFYGMSYEQA EKAKIEGDLPRNYMFEVLSP FQTQLLQQVKRTLQIYCTSS GRDKVDYLVLCGGTSKLEGM ANLLINELGVHTIIADPFQG CLHADESIKHILQPNISKYM VACGLALRSYGQWRT >_0085.002653_ SHEW_20DEC04_CONTIG154_REVISED_GENE2655 shew_20dec04_Contig154_revised_gene2655 MLKKVSRLDYYTLQVFIGLV ELKSGSAVAERLRTTQSKVS RSLTCLREVLEDELFIRQQY GFEPNQVALTIYPMVKTIVE QYDKIVAATIDKGAEPYQLC VATYEQWSLMAMNCVHNTCH CIEGGVSINIQPWTDKVNQR LCQGKVDCSISTEPINHPMV NNFKLGDITHFFIVARRDHP ILTSDDPLAQMFNYHIALVN THLQEQEPHPIELYAKARQI EIRVALKSPSLRMLVDHVSH SEDIALLSSAMSMSYFENRD DVDYLDISRIWLQTRDLETE SYYLHCHKGIKPELANCLRR VLTEKLIEMQAHCDQVAQER TPLAQPSIAPEPSA >_0078.007005_ NP_827747.1 gi|29833113|ref|NP_827747.1| putative polysaccharide deacetylase [Streptomyces avermitilis MA-4680] MREPAVPILMYHSIATAPND ATRALSVAPEAFAEQMALLG DLKITPVSTAQLAESWRSGG PLPERPVLITFDDGYEGVHR HALPVLAEHGFASTLFVSTG WIRGAYDTGGGLDTMLDWDQ VRELADAQVEIGGHSHTHPQ LDEIDDDALRFELLRCKEIV ADELGARPVSFAYPYGYSSR RVREAVREAGFAQSLAVGNA LARRRQGPYALRRVTVRRST GVEEFTRLVEGRAIALDFAR DRALTKGYAMVRRARQVRRK AIRSRV >_0078.004195_ NP_822892.1 gi|29828258|ref|NP_822892.1| putative LysR-family transcriptional regulator [Streptomyces avermitilis MA-4680] MDLTPTLPRGGTADRGPLTP HRPSTTPFCDRSPRARVALE SLSSADITDGLTEFELDAAM TYLDDDTLRHVRRFPLYEER YVLLTPVDGPLAAQPTARWA QAAALPLCLLGPRMRNRRII DECFAADGATATPAIESDSV AGLYAQLPGGRWSSVISHAW LHMFGVPEGMGVVPLEGPAR GPRVGLVVARSEPRSALAEA LLTVAREADVRDALDSLLHT YLGGGHG >_0067.000611_ NP_142471.1 gi|14590405|ref|NP_142471.1| hypothetical protein [Pyrococcus horikoshii] MVVNMFEDIDTFEEAFNKLL REVLEFDLQNPFKDAKKVLC IEPHPDDCVIGMGGTIKKLS DMGVEVIYVCMTDGYMGTTD ESLSGHELAAIRRKEEEESA RLLGVKKIYWLNYRDTELPY SREVRKDLTKILRKEQPDGV FAPDPWLPYESHPDHRRTGF LAIESVAFSQLPNFSNTDLD IGLNPYNSGSFIALYYTHKP NYIVDITDLMELKLKAIRVH RSQFPDDIWEKWEPFLRTIA MFYGEKIGVRYGEGFRIMPG LFYHITPFTDLI >_0063.005062_ NP_253691.1 gi|15600197|ref|NP_253691.1| probable glycosyl transferase [Pseudomonas aeruginosa PA01] MTRSAEPRVLQFCHGYDGPF LDCARQYASLFAGTPYKVTT VFLTGRSDPEVAARCASDEV LFLEYGSRDIRGLKLKAIRD LREIARSRDFRFCIAHRFKP IWIASLATRLPIVGVHHAFG DYQRLSRKVFAGLMSKRLSL LAVSDAVRDDIRRCLPKWPA ERIETLYNRIDVAALQAEQV ERLSARRQLRLPDEAWVVGN VGRLHPDKDQATLLRGFAAA LPRLPQNSLLAILGSGRLEE QLKDLACELGIGERVLFLGQ VEEARRYFKAFDAFALSSDH EPFGMVLLEAMVAGVPLIAT ACGGAREVVEGVGILFPLGD ETALAEGLTHLAALDRRQRE ACARLMLERLETRFSDAAVR REFWRLPTIAGFASESAC >_0061.000909_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF1827 npun_22dec03_Contig1_revised_geneNpF1827 VEASYAQRLRQEKGYAYADN CNTKTSLSIKMSMVLLTPTV SSNFEKDSLIYMAKKIHFIS NSIVKEINNSDNVSILVLDK EEIISLFKANGILLFRGFDV DVDIFKEFTNLLSIDFINYA GGAFSRRVINGDQTVLSVND FKSEIKLHGEMYYQKNIPLM LWFFCANPPLEDGETTVCDG RQFFHEISSSTKELFRQKNL KFTVRMSKEDWQKKYKTDDV NQLKEICRNNNTHLKIFDDR SIMLEYISPAIIPSRCGNYQ VFINSLLPTKQLSPNILKFE DDSDIPEEVVSELNEIAEKI TTEISWRKGDILMIDNTRIL HGRRSFADDQRDIYIRLCSP AFSF >_0057.000483_ NP_842028.1 gi|30249958|ref|NP_842028.1| possible capK protein, putative [Nitrosomonas europaea ATCC 19718] MARHALMRTELLPTFGMPDK KVDGFIARIRERHPKMIFGY PSAMSHIVPCAEQRGVRFDD LGIKVVFCTSERLYNHQRET IVRLFGCPVANGYGGRDAGF IAHECPNGNMHVITEDIVVE IIGEAGRVLPHRQSGKIVVT HLATRDYPFIRYRTGDIASL SEETCSCGRGLPLLTDIQGR NTDFVVAAEGTVLHGLALIY VVHDLNEVRTFKIVRKNREH TRILLAPEAGGDMTGIDTII IDGFRCRLGAKVTVKFVDAV LAEKSGKFCYVASHVVLH >_0053.003928_ MMAG_12JAN01_CONTIG3878_REVISED_GENE3949 mmag_12jan01_Contig3878_revised_gene3949 MKSGWPGLMARAPSRISEAV VNVDRMRQIDFWAGVPLAFL MTLYWRIRCFFSPPAPSSAK NILFIELSEMGSAVIADAAL KRAGALFPEAKVYFLIFAKN RPSLDIMGTIARENMLTIRA DSLFTLAIDTIRMIGKMRSL DLMAAIDLEMFSRFSALLTF LSGAPKRVGFHRFHTEGLYR GELLTHRVPYNPHQHVAKCF MALVHALTEPEGTTPHGKVL VTDEEIRLAPVIPTPDVLEA FKARLFEAYPVLRRVDRWVI FNPNASELMPLRRWPYDRYM EVARRLLAEDESLAVIITGV ASEKAEAQTLVEATGSDRAV NLAGFTRMEDLIPLYALSKA MVTNDSGPAHFAAPVGLPTL VLFGPETPALYGALNDKAEF LTARLACSPCVSAMNHRSTA CTDPACMRAITVEQVHATMR RLLG >_0031.000605_ NP_295485.1 gi|15807653|ref|NP_295485.1| putative transposase [Deinococcus radiodurans] MTVDVRQTLTGSEAPSRLEQ AADQLDDVLLDLRTVQQLDL AAVVADGNYAKEPIVETVTG HGLPFISRLPRNANLNDLYT GEHPRRRGRKKKFDGKVDFS DLQRFDLVSARPTERVWTQV VWSVQWAREVRAVVIQQIGK KGQVTGYAVLFSTAVTMPAH EVMALYRSRFEIELIFRDAK QFLGGQDVQLRSQQGIEAHW NVVLLTLNLCRLEALRAAGG GQDLVFSLEDMKRRAYNALL AQVILSNLDLSARFEE >_0026.000490_ NP_662896.1 gi|21674831|ref|NP_662896.1| diaminopimelate epimerase [Chlorobium tepidum TLS] MSGAGNDFIVIDNRQGLFNL THEQVRAMCTRRTGIGADGL ILLETSETADFRMNYHNADG FPGTMCGNGGRCAVWFAHLI GIRPTGKHYRFEAGPSTYEA EVTGEESVRLHMLPPSDFRD GLQAGAWNCHFVDTGSPHAI AYVNNLDQLDVLTEGGNIRH NKELFPDGTNVNFLEITAPD ALSIRTFERGVEDETLACGT GTVAAALMSFRLGKVTSSLV RVKVKSGETLMVGFNEMMDE IYLEGPARAVYRGTITL >_0023.002922_ NP_600029.1 gi|19552027|ref|NP_600029.1| archaeal fructose-1,6-bisphosphatase [Corynebacterium glutamicum ATCC 13032] MEGMTNPEQTHPAASLEDMI KTITKTFVIAHDQDSDEHLA QALVYNAGRLAWRMRENGVD TDYKTSVSDVVTDADRAAEA FVAGVLEALRPEDGVLGEEG ADRASKSGKTWVIDPVDGTY NFTQGSDYWCSALALVEGDP SAPSRVLFGAVHRPAMGYTW FGGPGIRTTLDGKELDLLVD APLNQISLATYIHPSRIAEP DIQKAWMSVATHPATLRMFG AGSIDLANIADGSIGAWVQH SVADWDWLPGRALIEGVGGA CIKVTAGGVEWSVAGNAEAV SEISETLSALD >_0020.001789_ CAUR_25MAY01_CONTIG1131_REVISED_GENE2644 caur_25may01_Contig1131_revised_gene2644 MALRIAFLTVGDPQRRTGGY LYHREVFRCWQARNQPVEEI VLGPADVAGQIAASNRAGQL VEAGRYDVIVVDALARAVVA PWLAHWQRLCPLVTLVHELP TVAGASDPREYEWEQMLLRA DALVAVSDDGANTLIARGVE AARIHIASGGCDRLLSRMPV GKAREELVIAVAQWIPRKNL AHLVRVWGQVAEHPWRLELI GESDADPVYAGEVWQAIQHC PANVIVRGVLADDELAERYA RASLFALPSRFEGYGLVFAE ALACGLPVIAGAVGPVPALV GAGGMLVHPDDEQALSVALR TVMRDAHLRQHLSTAARQRA STLPRWQDTAQHLLCAIEWA YAQRHRVST >_0018.008870_ BFUN_06OCT04_CONTIG482_REVISED_GENE8871 bfun_06oct04_Contig482_revised_gene8871 LNMTDQTSTRFPPTRDAHAP VPSGDNSFAWSDARLLGFAL MDEVHKEFYAVALNLVTCTD ATAATAIDRFERHAVSHFEQ EDQWMRSTNFPPRDCHIDEH AAVLKSVSEVKEAVEQGRAG AEMVRDLGMHLFEWFPGHAD YLDSALAAWMMKRTMGGKPV VLRRKI >_0018.001855_ BFUN_06OCT04_CONTIG481_REVISED_GENE1856 bfun_06oct04_Contig481_revised_gene1856 MGWKSLLIETPVWIIGRFHR PRFDPHRDTPREILVLRPND FGELLTTTPLFEALRKRFPT TRLIAGVGSWGRPILENNPF VDEIVEIDAPWNNKLVEDRS HGNVLRFLWKSPQVAALRAR GGFDVGIDVLGSYMGALLMM RTGVRYRVGVRGYRGGWSAC QTYIEFAARQCGRAALAQGE LLGATALPEARPQLFLTDEE RATAAQIWKAGDVPGRRTVR LIVGVGAGVPSKAWDARQVG AALAQIAQTLDKAGDAGDFV IVGSAADRTRAAEAIAAAGP GVPVRSVAGEVPMRITFALT EQAGVVLTNSSMLMHVAAAF RRPTVAVIGGSVTKPDRHDA IWGYPPPYRSVSPERCVEGE HARNWPGVERVVQAVLEAVG TQRAPVGASA >_0017.001913_ NP_813301.1 gi|29349798|ref|NP_813301.1| hypothetical protein BT4390 [Bacteroides thetaiotaomicron VPI-5482] MFEIKVSQEIKNACPVFAGA AVYAAVKNTAYCDGLWKEIN TFTEDLTTTTQMADIKLQPV IAATREAYKRCGKDPGRYRP SAEALRRRLMRGIPLYQIDT LVDLINLVSLRTGHSIGGFD ADKIQGNHLELGIGKAEEPF EGIGRGILNIEGLPVYRDSF GGIGTPTSDHERTKMDIGTT HILAIVNGYNGKEGLKEAAE MIQSLLKDYAGSDGGELIYF E >_0013.001003_ NP_881529.1 gi|33593885|ref|NP_881529.1| hypothetical protein BP2949 [Bordetella pertussis] MAVLARPRPGAQNSASATPK PLPRSLAMKQALIVIDVQES FRHRPFWDDSELPAFLAQVQ ALIDGCRQRDIPVLQVFHVN AANDPASPFAPASGHVVALR ELRIAPTAVFRKTVHSSLYA RDEAGHTLHDWLLEHGVGEI IVCGIRTEQCCETTTRHASD AGFKVRFALDATLTFAMRAA SGKTYTPAEIRERTELVLQD RFAQVLPAAAALAA >_0011.002880_ YP_102444.1 gi|53725613|ref|YP_102444.1| cbiX protein [Burkholderia mallei ATCC 23344] MNKHGIVLFGHGARDARWAG PFERLAAKLRAARGAEASVV LAFLELMEPSLAAATAALAA QGCDTITVIPVFFGEGGHVR RDLPGLIDACRAAHPGVDIR CATAVGEDDAVLDAIVAYCM RASADPTS >_0009.003580_ NP_244487.1 gi|15616182|ref|NP_244487.1| flagellar hook-associated protein 3 (HAP3) [Bacillus halodurans] MRVTQGMLAGNSLRHISQGY MRLGELQDQLSTGKRITRAS QDPVVAMKGIRYRTQVAEVE QFKRNLREVYNWMDTADAAL GEATDALARIHELATQAAND TYEETQRANIAKEIRQLYEH LQSIANTKNSGKYIFNGTNT TNPPIVAPERMDLGFAHLLE PGTDLQTVEIVYDGQTYHYV GENADGHLVFQDVRQMNSLP FDETDSDEFNEKAFQLVIDP DLGRITSSQPNVNTGAAETK NVRLNDVVVADREAVSFNTQ RVEIELLKGVKIPVNTNPTN VFNNALFGDILRLEKALEDP SVSGAELTAYIDNFTKHIDQ VTAERAELGARVNRVEMLES RILQQEITAKRIMSDNEDVD LEKVIIDLTIQESVHRAALA VGARIMQPTLMDFLR >_0009.003106_ NP_243894.1 gi|15615590|ref|NP_243894.1| septum site-determining protein [Bacillus halodurans] MTTQKKQAVTIKGTKDGLTF HLDDRCSFDSIVGELAEKLS SKHYQMEDQPRIQVKVDVGY RYLTVEQKRHIQELITDGRN LDVEEFVSQVMTKEEAEEKR KEAQIVSVAKVIRSGQVFSV DGDLLLIGDVNPGGTIRASG NIFVIGSLRGIAHAGYKGNG EAVIATSHMAPAQLRIGEQI FHWSKEEQQAGDRIMECAYI ASDTNEIQLDRVQKLLKIRP NLATFLDEMVEQ >_0006.004351_ NP_890690.1 gi|33603130|ref|NP_890690.1| putative DNA-binding protein [Bordetella bronchiseptica RB50] MAPFSICVSILLLVGDSNRA GVTARYVSEQLGLSLSTAYR WIDMLRQSGLVVRHPSTRKL HLGPLAFDLGVQAVARPYVP PAIEKVLKPLASQHACRLHL VRCSGDYSVLQRTYFDVAEL EIVPSKVAVMRLLGIGPAGV CLLSRWPDYEIDRYLVRNQD GLLAAGYQGADIRERVRQCR KLGCFVTRSVLTPGYTAVAI DFEVDGVVYGVSAASAGSEG GKQFHAIREALDRARASLAT QAVPEFALEG >_0006.003501_ NP_889320.1 gi|33601760|ref|NP_889320.1| putative transcriptional regulator [Bordetella bronchiseptica RB50] MIQESLYYFGMNAEQAVSSL GALAHAQRLSVFRALVVAGP AGLTPSVMADGLGIARNALS FHLKELAHAGLVSVEQQGRN LIYRADFFRMNALLGYLTEH CCQGATCEVSDSRKNTAGCD AGARRA >_0006.003174_ NP_888669.1 gi|33601109|ref|NP_888669.1| hypothetical protein BB2126 [Bordetella bronchiseptica RB50] MRHIEISYSLGPSDPRPALL RNALMDLLSAVREHGSISAA AKALDLSYRHVWGELKKWEQ TLGRTLIVWDKGQPARLNEF GEKLLWAERQAQARLAPQIN ALRADLERAFAVAFDDAAHV VPLYASHDNALQALREHAVA SAKLHLDIRFTGSVDAISAL NEGRCVMAGFHTREAPEPGS LAERTYKPMLQPGLHKIIGF AQRSQGLIVARGNPHGITSL ADLAASGVRYVNRALGTGTR VLFDELLEQAGLKPAAIAGY ERTEPSHAAVAHAIVSGSAD AGLGIEPAAHRERLDFIPLV RENYFLVCLKSTLDQPSTQA LLSILRSPAWQATLAAIPGY APAQTGQVLSMRRVLPWWDF ARKKVRRARPA >_0005.000700_ YP_321517.1 gi|75907221|ref|YP_321517.1| Protein of unknown function DUF820 [Anabaena variabilis ATCC 29413] MSLSTPVFANKTLEEFLKLP ETKPASEYIDGKIYQKLMPQ GEHSTLQSSLVTAINEIAKP QKLAYAFPELRCTFSGNSIV PDIAVFEWSRIPLLPNGRIA NKFEISPDWIIEILSPEQSP NRVIRKIMFSMQNGAKLGWF LDPNDESIMVFQPDILPEIK AGKEILPVMSVLANWQLTVE DIFSCLNFS >_0004.000437_ 17738794 gi|17738794|gb|AAL41469.1| transcription antitermination protein [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:444725-445291, Atu0450 MLSMAAENQPGKHEWFVVET KHKAEKAVEDALRKAGVKVF LPLETIGETVVRGRIIPAVS RPLLPGYVLVNIVYSPAAVC GIARLEGVAGFVGGMVHPHR VSDEEMNRFKAFGDDETAPD VKHCEQFKRGDKVRFVLGPF ASFGGNILKLRKDRAVDGER VATGAVVAVDVFGKVSTIEA PLALLEQL >_0120.000124_ YP_001302391.1 gi|150007648|ref|YP_001302391.1| hypothetical protein BDI_1003 [Parabacteroides distasonis ATCC 8503] MAKYKLIKKVNPQKRQEPGK WYATPKSETPLSGKAMTCAA TANTTTAPIEMEASLELLAK FVPQQLQQGHTVKIPGLGTF RLTFKSDGVEDINSFRANSM IKNVRIVFTPSKELRESVLS GLTFEDGGVLEDDISYASIA DYRLAKGVPEGGGDSESPGE I >_0112.001116_ YP_910030.1 gi|119026185|ref|YP_910030.1| diaminopimelate epimerase [Bifidobacterium adolescentis ATCC 15703] MSIPSIVYKAHATGNDFVVY LDEDGTHEPTADEVRFLCDR HFGIGGDGLIRLAHPQAVSD VNERQIADCAAGDADWFMDY RNADGSLAEMCGNGTRAITL FAQRQGIADQPGGKPFHLGT RAGVKVLTSLGDVPGLGKDV FQVEMGAWKRGDVDGYEVTI PGTSGSARGTFVDMGNPHVV AVLEDAFASLPNVEDLDLVT KPVVAPEIPSDQNVEFVRID EQSEGDDAGEATMRVNERGC GETLSCGTGLCATAITLRAK TGIDHWTITVRGGTLRVDVT DEDVKLTGSATIVGKIELL >_0111.002919_ YP_209875.1 gi|60679731|ref|YP_209875.1| hypothetical protein BF0135 [Bacteroides fragilis NCTC 9343] MYKNLFNLLTILLILPSCTD MSPNISVVCEENNIGNCILK WETTPLIKGQVKVYTSDNPE FIPEDNPVAMANISDARMTI VTNDPSRRSYYMLVFNDKYR VKVAPRNVNMPGIQNFRDLG GYKSATGKHVRWGKLYRSAQ IDSLNCFALRKLQNLGIKTI LDLRSESELHNTPPLQKGFN VVHIPINTGDMEHILHGIQQ EKIKTDTIYHMVEAMNRELV AKYQKEYKEIFDILLDKNSY PVVIHCSSGKGRTGIVSALI LASLDVNADIIMEDYRLSND YFNIPKASKYAYNLPVNSQE AITTLFSAKEDFLNAAKDEI ERKYGDVPTYLRKAIGLQSE DIHRLRTILLE >_0110.001841_ YP_001301271.1 gi|150006527|ref|YP_001301271.1| hypothetical protein BVU_4047 [Bacteroides vulgatus ATCC 8482] MIDYSVFMMGNPMDVDAAQK AYAKAQVSEIMPFSQFVKHI ADHNGVFTRGTVKGVIADMC ECLVEMLLEGKKVQLAELGN FWISIGSEGAEDLKKFTESN ITAVNIVFTPGEDFQNLRSR AAFNPVASRIAQAATLKAEK AGKGTVDLDAAKGKTPASTN ENNPSL >_0106.002325_ SWOL_07JAN05_CONTIG94_REVISED_GENE2326 swol_07jan05_Contig94_revised_gene2326 MVKIKGLNGNLVFIFGPGSF DEYLSFLEEKLTANKQLFSG SRVIFKGEELKKLSHSQIAS LQELCLRHGVIMNNNAVTVN KGSSKDLVIYRNLRSGQKLR SEGSVIIWGDVHESAEILAA GDIIVLGKLDGIAHAGCYGD MNRLIFALSLSPGQIRIGDR LSRSPEDPDKSPHPEIAFWD GENICIKAYSSRSGLRG >_0106.001312_ SWOL_07JAN05_CONTIG131_REVISED_GENE1313 swol_07jan05_Contig131_revised_gene1313 MILLNLGTYPPKQCGIATFS MDLRNSLLINGNEVKVMAVS DNSYQYHYTDEVLFNLKQNH KQSYIRAANYLNKAPLELII IQHEYGIYGGMEGEYIAELV RLLHKPFVLITHTVLPRPSK RQKQVLNYLCSRASAIVCMT RRSEHLLSDLYEAPPELIQV IPHGVPEFKEQTQDSLKEKY GLQGRDLISTFGLIGPGKGL ELGIQAIAQIVKEHPQATYL ILGQTHPMLKKQEGEKYRHM LEDMVVKLGLEHNVVFVNKF LSDEELGEYLYMSDIYLSPY PNKDQAVSGTMAFAIGCGRA IVSTSYAYACEFLSGGRGLL AAEADPEELAGLMKRILTNP DLKQSLQYNALKLGKSWSWP SVGQQYTRLFEKLLAETPLT KEPRISYARL >_0105.000024_ YP_055669.1 gi|50842442|ref|YP_055669.1| RNA polymerase sigma factor [Propionibacterium acnes KPA171202] MRYTDFIDGVEILDPAEEHD LFRRLDAGAIADAILAGHFL AAVPVSEAELRRISDDGVGA RQTLWRQMLAIVLTQARQAA VSYRCSVEDLIQAGCIGLGE AIERYDVRRGARFSTVAWTW IRRRIGEEVVRLSGARSRSV MREAAEVARIEEELTARWQS VPSSDAIAARMGKDVMWVEK RRVECWEVETSFFEQVAVVQ KSCDDPESGLIGLLTGQERR IVELRHGFEGEPMTITAVAH ELGLSASNVRRIEQRALGRL RRHLQQVVAA >_0089.001388_ NP_345962.1 gi|15901358|ref|NP_345962.1| ATP synthase F1, delta subunit [Streptococcus pneumoniae TIGR4] MDKKTVKVIEKYSMPFVQLV LEKGEEDRIFSDLTQIKQVV EKTGLPSFLKQVAVDESDKE KTIAFFQDSVSPLLQNFIQV LAYNHRANLFYDVLVDCLNR LEKETNRFEVTITSAHPLTD EQKTRLLPLIEKKMSLKVRS VKEQIDESLIGGFVIFANHK TIDVSIKQQLKVVKENLK >_0088.004312_ NP_716575.1 gi|24372533|ref|NP_716575.1| hypothetical protein SO0946 [Shewanella oneidensis MR-1] MKTKETPEQKPEDMPIPIVD ITSVEQQTVSLNLPSYGVVT PKYKTQLVTEVQGRMLTISP QFVAGGIVKKGDQLAQIEPS DYEADLMQAEATLAQATAAL NEEIARGEVAKIEFKGYDKG LPPELGLRIPQLKKEQANVK YAQAALARAQRNLERTIIRA PFDGIIKARNVDLGQYVTLG TNLGELYDTSIAEIRLPISN DDLAYLESVDNPDTQVTLSA SLAGKENTWIGNIIRSEGVI DADNRMVYLVAEIKDPYLRE HKTQGSLPLKYGSFVNAVIK GRTVDGIVKLPRHVVRNEHV ALINDNNIVEMRHVNVVRSD LQNVFIKDSLKTGERVAITH FNNMANGQLVKVIGEETKPA QTPAPESSLAATGVK >_0084.001391_ SFRI_16AUG04_CONTIG77_REVISED_GENE1393 sfri_16aug04_Contig77_revised_gene1393 MKTLAQNIQEKLKASGLTQK ELAERANISQVMVHKLISGK AKESSKLVAIAGVLGCTAEE LMYGLEKHVPTSNAEWAGPM ETWDSNTPLNDDEVEIPFYM EVELAAGHGIAEASEYHGPK LRFAKSTLRKSSVDPTNAAC VRVSGNSMEPVLPSGSTVGV DTSQTDVIDGKMYAINHDGM LRVKTLYKLPGGGLRLRSYN IDEWPDERYEGEAIKQIKII GKVFWYSVLM >_0058.001153_ YP_208482.1 gi|59801770|ref|YP_208482.1| putative transcriptional regulator, repressor [Neisseria gonorrhoeae FA 1090] MKGSGRIFMETFKDRLVFLW KSEARQAKIASDIEMTIAGF SRIWNEGGLPKSETLKKIKQ LKGCSIDWLLTGEGNPFPDE APKKSLAYDTLGNEVDTDEF VFVPRYDIRAAAGYGQFVGH EEPVFTMAFRRHWIENYVTR DTKNLSVISVKGDSMEGVLN DGDSILVNHGENTPRDGLYV LRINENLLVKRLQIVPGGII NVISANEAYPAFEINLNDLT DDVEIIGRVEWFGRTV >_0056.003497_ SARO_25NOV03_CONTIG30_REVISED_GENE3721 saro_25nov03_Contig30_revised_gene3721 MNAPVPAADFEFPTEIPGLP LAGRKVLIVVENLPLPFDRR VWQEARTLKAAGAQVSIICP TGKGYEKRFEVIDGIDIHRH PLPIEASGALGFLLEYGAAL FWETVLAWKIFLKRGIDVIQ GCNPPDLIFLVALPFKLLGV KYIFDHHDINPELYEAKFDK RGFFWKLMVLFEKLTFKAAD VSMATNHSYRKIAIERGGMD PDKVFVVRSGPDLSRLKRVP PVESWKNGRKHLVGYVGVMG DQEGIDLLIDAVDHIVRVMG RDDIQFCLVGGGPSLAKLKA LVAEKGLADFIQFTGRAPDQ DLFEVLSTMDVGVNPDRVNA MNDKSTMNKIMEYMSLEKPI VQFDVTEGRFSAQEASLYAR ANDPVDMAEKIVELIGDPER RARMGALGRMRVETELNWGH QIAPLIAAYRKALCLAD >_0055.000954_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA0323 rgel_26jun05_Contig562_revised_geneMpeA0323 MAGLFIIAHAPLASALKAAA GHAFPEALSVVEALDVTPQL TPEEIEAQARILLERSLSVP GRREALILTDVFGATPCNVA QRLADGVRVKIVTGANVPML WRTMNYVDQPLDMLVARALA GATQGVLQVAPARPQNQAQN VGARPGDDFNAHQHQQ >_0053.001607_ MMAG_12JAN01_CONTIG3805_REVISED_GENE1620 mmag_12jan01_Contig3805_revised_gene1620 MAEEYRRAVNGDETMIEHAS QLLHAFRRDSVRDYCADWAD LMEYCRFSAAPVGRFLLDLS GENH-YPFAPSDALCAALQV LNHLQDCGKDYHELKRVYLP QDWMAAQGLSVDVLAGSSSP AALRRVIDQTLDATDGLIAL AQPLPGLVKDWRMRLQSAIT VRVAEQLSAKLRRGDPLAER VKLSKLDYAGVFLVGLWRGL TA >_0046.001003_ NP_469828.1 gi|16799560|ref|NP_469828.1| hypothetical protein lin0485 [Listeria innocua Clip11262] MEVEIVERNAFTAVGKKRTF SVENDAQKEKISQFWQEANA NGDAERINELAEFATIDGIL GVCQMNGDKMDYYIAIESEL TPPEDMEKLTIPASKWAIFK SVGPLPSSIQKVWEYIYGEW FHTSNYNHGNAPELEVYTEG DTTAVDYYSEVWIPVVEKE >_0034.004442_ NP_418470.3 gi|49176454|ref|NP_418470.3| transcriptional repressor of Zn transport system [Escherichia coli K12] MFFLEHGKVRTFLTPTLRCP MEKTTTQELLAQAEKICAQR NVRLTPQRLEVLRLMSLQDG AISAYDLLDLLREAEPQAKP PTVYRALDFLLEQGFVHKVE STNSYVLCHLFDQPTHTSAM FICDRCGAVKEECAEGVEDI MHTLAAKMGFALRHNVIEAH GLCAACVEVEACRHPEQCQH DHSVQVKKKPR >_0032.003530_ YP_009112.1 gi|46562180|ref|YP_009112.1| glycosyl transferase, group 1 family protein [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MNILTITALPAGGATYAAQR LHSALVEYGHACTLACCDPV QPGHVALNYQRPDATPFMSP LFRYWSTLSTPETREQGACE LFSDTSTLLASPALPDHIVE AADVVHLHWASGILYSPALF RLLRNKKLVWTLHDMNPFTG GCHYHVSCERYEHECGNCPL LAHPAPNDVSLQAHRFKRLL YRELDITMVSPSAWLADKAR RSPLFTGKKVAVIPNTHDTT LFAPRDRASLRSQYGVEKDT FIVLTGVESLDNPRKNIGCM VEAMEQFAQRRPHISFELVL FGAGGANLLASFPTRHVGSL DAESLVDWYNMADVFVHPSR LDNLSNTLCEAQCCGTPVIS FDAGGSTEAFQDGETGFISG NQPQHLAEALDRLLSSDMES LRQKARMFAMNRFNGKNIAQ RYTDIYEDDIINQEHHGMHG PADERLASNAMDGFLKVAPF DKPAP >_0030.002655_ DHAF_12NOV03_CONTIG1083_REVISED_GENE3023 dhaf_12nov03_Contig1083_revised_gene3023 MIKRRRSLLILSGNSPKSMQ DAPVFDPDVVLFDLEERVVP QEKDSARHLVKEALSFLDYS KTEVMVRINPLDTEDGQKDM DSISRVKPGALVIPKANHEV IKQVDSILTTIENEEGFPQG GIELIPVLETALGIETINTL IETSPRVTGVWLNAEGLMDE FGSKRTKEGDELRYVRGRVG IACRAAGIHAIDTCFADSND NEGLEKDALTAKRMGFTGKL ALDGRQIDTLNAIF >_0026.002084_ NP_661521.1 gi|21673456|ref|NP_661521.1| transcriptional regulator, NusG/RfaH family [Chlorobium tepidum TLS] MTNALKKDGCWYAVYVRSRY EKKVHQYLLEKGLSSFLPLI ETLRQWSDRKKRVEEPLIRG YVFVNINYHKEHVHVLETDG VVKFIGIGKTPSVISERDID WLKRLAHEPDAIGETVISIP VGKKVRVLAGPFKDMEGVVK KEGREERLLVYFDSIMQGVE ITISPELLAPIEKGASGQAV EGSKTGDHEVESAIRHLAHS >_0021.000024_ NP_419762.1 gi|16125198|ref|NP_419762.1| cytochrome P450 family protein [Caulobacter crescentus CB15] MDLISQTVVDGKAGPGAPPT YPTLKDVDLADIFRFTKGQP WADFARMRQEAPVMWHPEPM GGPGFWALTRYEDVHRVNGD PETFSSQRGGILMSMGAPEK RHALLFRASMDTMINMDAPH HLQLRREHMPYFTPSYLRGL TERVKGEVTRLLDEMEPLLA NGAEIDMVEHFSSVLPLFTL CEILGVPPEDRPKFLTWMHY LERAQDLAVKQANAPMQPTL ELMQFVMDFNNNVEEMFEYG RTMLHKRREDPKEDLMTAIA RAQLDGAVLPDEYLDGSWLL IVFAGNDTTRNTLSGAMRLL TEFPDQKQKLIADPSLLGGA VDEFIRMVSPVVYMRRTATR DVEVNGQLIREGEKAIMYYG AANRDPAMFENPDQLDVTRA NAGKHIAFGYGPHTCIGKRV AQIQLEEAYRQILARFPDLN WTGNIEIAPNNFVHAISKLG VKRG >_0018.007510_ BFUN_06OCT04_CONTIG482_REVISED_GENE7511 bfun_06oct04_Contig482_revised_gene7511 MSKNDLADLLRVGPETTMGK FMRQYWLPAVLSHEVKADGD PVRLMLLGEKLIAFRDSSGR VGVLDHRCPHRCASLFVGRN EENGIRCVYHGWKFDVDGNC VDMPSVPARLDFKEKVHARA YKAVERNGLIWVYMGENQNN PPPLPAIEANMIPGSKIWCL QRQCNWLQALEGDIDTSHVG FLHVGALQPEDLPEGSPLRP TVLNRAPEYRVQNAEWGATY GAYRLAEDGRMSWRAAQFMF PFWTMTPNILFSTRAIARAW VPMDDTHVMLFDITGGVDEG NPFFTTKLKNGEPMYEKIRR KPNGTGWYDRWLPVDGLQND WGIDREAQRNMIHYTGIDNP SMQDQAITESMGPITDHSFE HLAPTDQMIAFVRRRLTQAL RNFRDDSAPAPCAFSPETYF GARSGAFWADPVVEFSTAYR VELEKAQRWPANPDGSAVER EPEVAR >_0018.006164_ BFUN_06OCT04_CONTIG482_REVISED_GENE6165 bfun_06oct04_Contig482_revised_gene6165 MKPVPEAAAAAEHGVAAGQE LPAGKRKLIEAALRLTAGGR SFTSLGLRELAREAGLNPNT FYRHFDTLDDLAREAVESVS RRLRPMLRRERWLAAHDEPH SVPRRACVAFFAFTLENREA FLSALAEYHGTSRALREAVR ANLHEVSAEMADDVVELELM PTLSRETVDEVCTQIVLQLF HLSAEYIDANEARRDALIAY AERFIVRLFAGAVLLAQHEA GRAPLSMRA >_0018.001261_ BFUN_06OCT04_CONTIG480_REVISED_GENE873 bfun_06oct04_Contig480_revised_gene873 MRAVILDAAVELLRQTSPEK VSLREVARLSNVDPGLVRYY FKDKSGLLTEVGSRVLASLA VTADEKANRPMSFKKHLEAR IADLLSLLSHNRYLHQVILN QIIHWQKDAGRGALTQLADN AIRRSTAMIEQGNKAGEVRL VDPRFLHIAIIGLCESFVTM SPLVAEFFGGEPDADVELAY RDFVVDLLMHGLTTEPTAKK KPSRPKNRQAKKELKE >_0017.001127_ NP_812077.1 gi|29348574|ref|NP_812077.1| tRNA/rRNA methyltransferase [Bacteroides thetaiotaomicron VPI-5482] MRKLKITELNRISAEEFKQA EKLPLVVVLDDIRSLHNIGS VFRTSDAFRIECIYLCGITA TPPHPEMHKTALGAEFTVDW KYVNNAVDAVDNLRKEGYIV YSIEQAEGSIMLDRLELDKT KKYAIVMGNEVKGVQQEVID HSDGCIEIPQYGTKHSLNVS VTTGIVIWDLFKKLR >_0016.000754_ YP_438436.1 gi|83718071|ref|YP_438436.1| class III extradiol-type catecholic dioxygenase, putative [Burkholderia thailandensis E264] MLISRPAARPASCCSCRDTP SFARSPTHRTGCLFMGKIIG AGLISHAPVVMMPRAVRLRE NDGRDFTLATGLARLRREVF DAHDYDTVLVLDSHWRTTTE AVVTAHARRTGRFTSDEMPN AIRQLPYDLAGDPELARAIA ELATRRACWIAAVDDPCLPI HYATLNPWTYLGRPDKRWIS MSVCQTATTDDFLRMGEIVA QAISRLDRNVLLVASGGLSH AFWPLAELRRRMAGAASNIV TPAARAADERRIAWLEQGRH DRVIDAMSEFLRFDPEANFG HYLMMAGAIGARACAARARR FSEYENGIGTGHVHLWFGPV DGGWTRAETRAEREAARA >_0011.001470_ YP_102872.1 gi|53723420|ref|YP_102872.1| RNA polymerase sigma-70 factor, ECF subfamily [Burkholderia mallei ATCC 23344] MRAPSRPAARTPGAREHGAL VDVLVANRPMLVKLARGFVG CASRAEDVVHDVFVKLVDFP NQDAIRQPIAYVTRMVRNAS IDACRRQTLENTYHADEDDG LDVPSPELSPEAALVVRDTL RHVYDALAQLPARSRAAFEM VRLREETLQSAARALNVSQT LVHFMVRDAERHCVACVDAS ERGLACPAFCGARARTVKKC VRDSSIE >_0011.000002_ YP_102680.1 gi|53725402|ref|YP_102680.1| transcriptional regulator, TetR family [Burkholderia mallei ATCC 23344] MLDFPAMKPIRLTREQSKDL TRERLLSAAHAIFTKKGYVA ASVEDIASAAGYTRGAFYSN FRSKAELLIELLKRDHEEAE ADLQKIFESGGTREQMEAHA LEYYSQFFRNNPAFLLWGEA KLQATRDAKFRARFNEFVKE KRDRFTHYILTFAERVGTPL LLPADVLALGLMSLCDGVQS YHAADPRHVTGDAAQQVLAG FFARVVLARAPD >_0008.004450_ NP_976706.1 gi|42779459|ref|NP_976706.1| transcriptional regulator, TetR family [Bacillus cereus ATCC 10987] MPKIVDHDEKRKQIAEAAWN IIRKEGVEKASIRRVAAEAG MSSGALRHYFSTQDEMLLFI MNYYLEEGEKRSQNKEWSEN PVQAVEEVLLELVPIDEEKK IETSVWWILALRSLTSDTIK DKKDEMTDGTYELANSMIEI LALKGVLSDSMNAELEKSRL TALIEGLSIHALLRPDVYSP EKVKEVIRYHLETLCNKIN >_0003.003925_ ARTH_26JUL04_CONTIG47_REVISED_GENE3931 arth_26jul04_Contig47_revised_gene3931 VKAAASIEAMGASTGLERLH HLFELFAVLAYAPAEERRYL ADEWFRPQLDGQAAAVVDII LEYVFTNHAGNVKMSEAAAL VGMPEPTFSKYFKRATGQNF SALVRKLRLAHARRLLERSD KAISEICYEVGFSNLSNFNR HFLNDAGETPRNYRQRVQS >_0117.002052_ YP_794280.1 gi|116332753|ref|YP_794280.1| Transcriptional regulator [Lactobacillus brevis ATCC 367] MSNQTKEKIIIATQQIMLET ASINVRLTDIADRVGITHGA LYRHFPDKQSIMVAVAKRWF EQAILQNIHVASTVINPLNF LHDWLWQFANAKKAAYRDSP QMFELNTLYVENDPAILREV LLSSMKMIDDLMNFHDATYL RAEIILAAFSEFTLPSFRKS WYSPDYTVRFERLWQLISMG LTNL >_0113.001123_ YP_001088737.1 gi|126699840|ref|YP_001088737.1| putative conjugative transposon protein [Clostridium difficile 630] MCIFFSFFLKFIEDLTPSHR DRSKQGYLTPCDGKTIEHQF DSFCKTVLRNYARDIYDENK RRNDYLVSLESLSLAELSKL SILDDYDSNYICMVSYDYNI RIEDVLMAQAIGKLTKRKQD IILLSFFLNMTNADIATLMD LAENTIHYHKTNTLKELKEL MQEH >_0111.004314_ YP_212899.1 gi|60682755|ref|YP_212899.1| hypothetical protein BF3287 [Bacteroides fragilis NCTC 9343] MYNLKNEPNMYIENYDFKEW MQKLFDKLDELCKDVRVLRN VDKVLSEDDNLLDNQDLCLL FKVSIRTLQRLRSKNKLPYM LISGKVYYRASDVREFIKER FNAVMLRNFEKQFGEKK >_0109.002601_ RER070207002602 REr070207002602 MRENTGKVVLVGAGPGDTGL LTLKGEKYIKQADCLVYDRL SSPEFLSMAKAGCELIYVGK ENHKHVMKQDAINELLYEKS KYHELVVRLKGGDPYVFGRG GEEALYLVDRNVEVEVVPGV SSSVAALAAAGIPITHRGIA KGFQVITAHSRKDEEADIDY SLLTDETITCVFLMGLAHVK SIAAGLMKAGRRADTPAAVI SNGTLAAQRKCIGTLADIGE KIEEAKLTSPAIIVVGDVVS LNDRLDFFEKRPLFGRKITV PYIKTNELIAKLQQLGADIT PVKTGIIKPVIIPKFVDKVR SADWIVFTSKNGVRSFFYNL DLAGADIRLIANARFAVVGK ATEKELVKHHINADIIPAEQ TGKGLADAMKLCMPYAYGVS DEDTFDLGKNSTLDLSNDNI SDKMCKVCIFSAKEASPDLE AGLKEICELEKIDAYVNEQA YEGIPESIGNMVSEAVFTSA SNVERFFHMLPENAYVETAY SIGEKTTAALEQHNVREIVQ ADDSSYEALVDKISYKDAFK D >_0108.0001331_ YP_460475.1 gi|85858273|ref|YP_460475.1| coenzyme F390 Synthetase / phenylacetate-coenzyme A ligase [Syntrophus aciditrophicus SB] MSRIIHLNMPPRPYGMHAKL HWEQSTCKVKPAGAMYTNWL EKRLYQGVFYKINLFNKGIS EEMLATYYGRDLLTQNQIAE IQNEKLREFLKYCNDNSPYY HKQIQEHSVNLRAEDMFAEL RKIPSISKMEITRNLESIFS REYVGRKGLIAKFTGGSTGT PLEIWGNHDDYRENNVIIAR QRRWVNWVGGMKTMTLFGGS VDLPSSLRRITKRVLINDTI LNIMDRSNTDFSSIVAVIRK KTPHALIAYFSILKELSQVC ECEKRPLSGINVAIACAEPI EERARRHAEQWLNAPIYFQY GTRETGTFAQECRAQNGYHY AQDIIFCEVLDDDDNPCEFG NLVITWFANKVVPLVRYKIG DSASIVTEACACGLPYHRID RIEGRMASMIITPDNRRITS LIFPHLLKDYDWIIEYQAEQ TAKDHIVIRIRTNRKSSISD ALEDIKKKFNELLGRDICLE FKINEDFLKVPTGKHLYFVS HLNSIPGCTE >_0108.0000556_ YP_461888.1 gi|85859686|ref|YP_461888.1| glutamine amidotransferases class-II [Syntrophus aciditrophicus SB] MCRLLGITNFEFSKHQQIVL NFCELARSGMVMKGDPPGHA DGWGVAFYQNGELEVYKSGG NLLEETDQALELLSKTRECP VVILHLRKSAWKNTTSFRHA HPFRYGNVAFAHNGTVYDYK ELIPDITPSVLRKDALDTEV FFHHFMRNPSPELGKAFLHT VSLIKSLCRYSALNCLFSDG LKLFAYRDYSREPAYYSLFK AFSGNSCFISSQTLDKITCW ELMKKEEFLVV >_0107.002108_ AFE_2164 AFE_2164 transcriptional regulator, LysR family {Acidithiobacillus ferrooxidans ATCC 23270} MKSPVHLNALRAFEASARHQ SFSAAAAELNVTAAAVGQLV RSLEDWLGTPLFVRGSSGRV RLIPTEAAERALPDIRAGFD RLTLGILLREHWRLQKPADL AQETLIHDLSMDRHTAFPTW EAWMKKAGVTDVITQRGMRI NNSAAVLQAAIEGHGVALAR SVMARDDLASGRLVRLFPDI DFASALAYYVVYRPECASLP KLTTFRDWLLAEAAPAQAGN GEF >_0097.000804_ NP_394439.1 gi|16082017|ref|NP_394439.1| 5, 10-methylenetetrahydrofolate reductase related protein [Thermoplasma acidophilum] MLNRRKEMLSSSIGSLNFVK SVEIVPSRNFGTDDLIAAAD MLDGRVNALTCPENPRGSPG IDPIMALYIISNERNIIPVP HITPRDKNRVHILSQIETAQ KVGIRNFFTIGGDPINPKYE SREVREIDVMEIIRMIKGRE NAIVGAALNPYRDVEPEIVG AKIKSGADFFISQAVYSAEY LQKDWIKKRNFKLIAGFLPL TKKSQVKFSENLGIVIPDSV KQRLLNSEDVISTSMKIITE VFDEVKEYVDGIHIMPLGHN EIAAQILETI >_0091.001517_ SPUT_CN32_28JUL04_CONTIG148_REVISED_GENE1520 sput_cn32_28jul04_Contig148_revised_gene1520 MTSASELVYMSAKQVAEYLD LNEKKVYAMANDRILPATKI TGKWLFPKVLIDRWVMDSCH SGMLTDRLLITGSDDPLLSM LVARLMSQVGSRELISYSAT GSRLGLELLAKGYADVCTLH WGSMEDRNIRHPALLKGYNN HQQWIMVHGYSRQQGLIMRA DMHHRCQEEDKVVSLPWRWV SRQGGAGSQQHLEHWLLKQG ARLDQLNVLLTAYSERELAG YIARGDADIGFGCQSVALES GLSFVPLVKESFDFVMPQSI YFRRQLQQLFTMLSSCHTRQ MAALLGGYDLTDCGQLLWSA S >_0091.000105_ SPUT_CN32_28JUL04_CONTIG106_REVISED_GENE107 sput_cn32_28jul04_Contig106_revised_gene107 MPKLLGIAYKTVKNGLMNEV LYANVTQLSGVEKDIFGRPG KRQVTVLSKQQWLIACQSIN VDLPWTTRRANLFVDGLVFS SADVGKHLQIGELLLEITGE TDPCKKMEVAHIGLEAALSP DWRGGVTCRVIVGAMIHQGD TVTLVTESLS >_0087.001363_ NP_721940.1 gi|24379985|ref|NP_721940.1| hypothetical protein SMU.1604c [Streptococcus mutans UA159] MQGKDIILGILSKKERSGYE INDILQNQLSYFYDGTYGMI YPTLRKLEKDGKITKEVVIQ DGRPNKNIYAITESGKKELA SYLQSDVNDEIFKSDFLMRL FFGNSLNDDDLEQLIREEIE RKEEKIKRLSENLEIWKKKG ELTPTQEITIKYGLAQYKST KKVLEEELAK >_0082.001066_ NP_764517.1 gi|27467880|ref|NP_764517.1| competence-damage inducible protein cinA [Staphylococcus epidermidis ATCC 12228] MIEDAIVLPNKHGMAPGMLV ELGKQKIILLPGPPKEMQPM AKNELLPYLMDKDEVIFSEL LRFAGIGESKLETLLIDLID EQTNPTIAPLAGTHEVYVRL TANAESKERCQLLINPIRDE ILNRIGTYYFGSDEVSIEES VINSAKQNFAIYDGVTNGAL FTRLKNADNKNLVKGMLPHS NQFIDVTSEFNTVLFNAAQY VRDLYQTDLGIVLLNKDNIV YLGIYDGNNFDIETFKMSQS RNLLRSRSQNYAMIRLLNWF NK >_0081.000995_ SDEN_20JUL04_CONTIG112_REVISED_GENE952 sden_20jul04_Contig112_revised_gene952 MTQVQGSLTCVGVGMMLGGH LSPIAHSHITQADVVFSGVS DGFVEQWLSGLNADVRSLQV HYGEGKSRNISYGEMVQVML AEVRLGKKVVGAFYGHPGIF VKSTHEAIAKAKEEGFAAKM IPGISAESCLYADLGIDPGK VGCQHFETTQFMLYHRQLDP SAHLILWQPGLAGDLTYGIK PTGRAERQLLVELLSKDYPL EHECILYEAATMPLQSGRIE RLPLNHLPLAQIALHTTLVL PPSQRLSPNNEMRTKLQALA LQQDQALHQGSTAQVLPFTT LKRNLL >_0077.002408_ NP_373152.1 gi|15925618|ref|NP_373152.1| similar to transcriptional regulator [Staphylococcus aureus subsp. aureus Mu50] MKGVIIMSTNKNDYEHMLFY FAYKTFITTADEIIEKYGMS RQHHRFLFFINKLPGITIKS LLEILEISKQGSHATLQKLK EQGLIIEKVLETDRRVKKLY STDKGDQLIAELNKAQDELL QNIYQQVGSDWYDVMEALAK GRPGFDFIKHLKDEKES >_0074.003746_ RRUB_10JAN05_CONTIG98_REVISED_GENE971 rrub_10jan05_Contig98_revised_gene971 MTEKLTGKTPTGAPAMAGQA VGDQRGADDRAGRFSERLAT ALQGRSANWLAGRIGGSAST VRDWLSATSEPGLGKLVATA ACLGVPLDWLATGRVAEGAT APPSDLFGERGGGAVTRAPL SVRPAEDETAPHRGLALYDT VAAAARMGGGHPGAAGLDLP ESYLRDHLRLDPQRAIAVAV HGDSMEPALSDGDLALIDTA TRALERDDLYAFLLEGEGYI KRLQKAGAAVIVHSDNPAYS DWTIPREIMASMHLLGRVAG VLHRL >_0073.006342_ YP_298346.1 gi|73537979|ref|YP_298346.1| putative citrate lyase [Ralstonia eutropha JMP134] MTTATHPNDALFAGEKSFPV LAACEHFAGSEKLIGKAMDL QVEYGPVFDVTCDCEDGAAA GQEREHAEMVARMIASDRNV HGRAGARIHDPSHPAWRQDV DIIVNGAGGRLAYITVPKAT NSGQVAEVIRYIGDVAKRAG LDKPVPVHVLIETHGALRDV FQIAELPNIEVLDFGLMDFV SGHHGAIPAAAMRSPGQFEH ALLVRAKADMVAAALANGIV PAHNVCLNLKDAEVIASDAC RARNEFGFLRMWSIYPAQIQ PIVNAMRPDFTEVEDAAGIL VAAQDADWGPIQYKGELHDR ATYRYFWEVLQKAKVTGMAV PAEAERRFFVN >_0072.000795_ PROC_21JUN05_CONTIG39_REVISED_GENEPMN12A0797 proc_21jun05_Contig39_revised_genePMN12a0797 VIKKQLKLILVSTPIGYLGS GKGGGVELTIVSLIKGLISL GHKIILIAPKGSKLPFESEL LEIRLIDGVDQPSWQHQDRK DPVLIPSKSVLPNLWEEVID IANKSDAVINFAYDWLPLWL TKTQSIKIFHLISMGAESIV MKETISEISELFPFRLAFHT KRQSKDYSLKTDPIIVGNGF DTDDYLFNEMENGPLGWAGR IAPEKGLEDAVKVANDLGEK LLVWGLIEDKEYASKIENTF TNEIVEWKGFLPTNKFQEQL GRCRALINTPKWNEAYGNVI VEAMACGVPVIAYDLGGPGE LIEDGFNGFLVKPNDIEGLI KATKSISEIKRKNCRAWFEK KATSKVFAERVENWLHKGLN KKISADLQD >_0071.004766_ NP_746840.1 gi|26991415|ref|NP_746840.1| hypothetical protein PP4732 [Pseudomonas putida KT2440] MCLSHWNEYRTIQTLSYAQN LFSARCEAPPPSLGSDAWMT THIQRSALLPYPAQALYDLV NDVASYPEFLPWCSASTVIE TSDTHMRAKLEVAKGGMSQH FVTRNVLVPGQSIEMNLEEG PFTQLHGVWVFKPLGEKACK ISLDLSFDYAGPIVRATLGP LFNQAANTLVDAFCQRAKQL NA >_0063.001587_ NP_252410.1 gi|15598916|ref|NP_252410.1| probable transcriptional regulator [Pseudomonas aeruginosa PA01] MNDASPRLTERGRQRRRAML DAATQAFLEHGFEGTTLDMV IERAGGSRGTLYSSFGGKEG LFAAVIAHMIGEIFDDSADQ PRPAATLSATLEHFGRRFLT SLLDPRCQSLYRLVVAESPR FPAIGKSFYEQGPQQSYLLL SERLAAVAPHMDEETLYAVA CQFLEMLKADLFLKALSVAD FQPTMALLETRLKLSVDIIA CYLEHLSQSPAQG >_0060.000078_ NMUL_10JAN05_CONTIG12_REVISED_GENE79 nmul_10jan05_Contig12_revised_gene79 MTNANQRNQRKKQPALVRRR LLDNGARLCVEQGVAGLTLQ AVADAADVTKGGLLHHFSTK QALIEEIFSERLQAFDTSIK QAMKLDPEPSGCFTRAYITA TLELIEKDDFIQWAALMMAM LTEPGLKKRWTQWLEQSLEQ HRDTDSAPEMRLARYAVDGL WLAGLLDEEERVATKREALH LHSQLIEMTRKP >_0055.001801_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA1171 rgel_26jun05_Contig562_revised_geneMpeA1171 MARMKFICDAERCIECNGCV TACKAEHEVPWGVNRRRVVT LNDGVPGERSISVACMHCSD APCMAVCPVDCFYRTDEGVV LHDKDICIGCGYCSYACPFG APQFPTNGTFGLRGKMDKCT FCAGGPEANGSEAEFEKYGH NRLAEGKLPACAEMCSTKAL LGGDGDVVADIFRTRVLTRG KGSEVWGWGTAYGKPTGGAA QPAAAEGGKS >_0055.000870_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA0239 rgel_26jun05_Contig562_revised_geneMpeA0239 MLYPELFKQLESVRWNMDKD IPWDQFDASRLSEEQAQTVK MNAITEWAALPATEMFLRDN RDDSDFSAFMSVWFFEEQKH SLVLMEYLRRFRPDLVPTEA ELHEVRFEFDPAPALETLML HFCGEIRLNHWYRRAAEWHD EPVIKAIYETLARDEARHGG AYLRYMKRALQKFGDEAKAA FAKVGVLMASARRTAQALHP TNLHVNERLFPRDTIQSRLP DPEWLEAWLDKQIQFDAVWE NKVVERILHNLSLLMERSFE SVQELNRFRKEMTQKLAATA AAASAAGGPPGTQPA >_0053.003083_ MMAG_12JAN01_CONTIG3864_REVISED_GENE3100 mmag_12jan01_Contig3864_revised_gene3100 MHALTGGWGLDVPGHTPAEL YALAADIESYPRFLPWCQKA RVRSRDGDHLEVDNLFGLGP MQARFISQAHQEPPERLTIT SQDGPFRRFRLIWAFSAQGD GCRVEAEYKMELRSPMLQSM AAMTLPAMEHKVVQNFRNRV REVYGR >_0048.002503_ YP_015257.1 gi|46908868|ref|YP_015257.1| transcriptional regulator, TetR family [Listeria monocytogenes str. 4b F2365] MITNESIMDATLCMMAKHGI KGSTTRQLAEAAGINEATIF KKFKSKDNLIHMTLEVQFES MKAEINQFFDKDFESAKVFL RQASQFISDIYEKYRDFMVI SVREMGSKDMEFIDPSIVEY LYERVNQKVKEMVPSKNSAQ EADAISLILNSVILLIMVEK VRDDIYKRPPTITTTADSLA DVLLKLLK >_0043.001227_ JANN_22DEC04_CONTIG22_REVISED_GENE1228 jann_22dec04_Contig22_revised_gene1228 MIAIFEPWLKASHRYTSAYL ANSQSTGDDLSAFLSKYEID RSVGVISLAQAPLFRDDEDE AATRKADGAPQSVERVSMPT PQRIRVLQTHPYVLCVGTLE ARKNIWKIAQSWQRLANIDG LSLPRLVFAGQPGWMIRDFE RMMDATGYLSGWIETVERAS DAELDALYKNCLFTIKASYY EGWGLPIGESLAYGKTAVVS QTSSMPEVGGNMVEYCDPNS IDSIVAACRKLIDDPGHRKT LEARISAASLRQWGSVADDL LAALDAHVAETPRSDMQTPV LS >_0032.002026_ YP_010772.1 gi|46579964|ref|YP_010772.1| AMP-binding protein [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MSRFAPLDALLAERMKCDSP PDAAALERWQMERLRATLEH ARHSPYHAERLAGIAASSLY SRHDLARLPFMEADILREAP LKLLCVSQDDVARAVTFDTS GSTGPPKRVFCTAEDIENTI TFFSHGLLSFMQRGEALLAL LPAERPASVGRLLGEAAGRV GVRPLSASPEHGWDSIADMA HREGVRAVVGSPLHVREFAL AWHRAGFPSGAISTVLLCWD AAPRALRALLAETLGCEVRH HWGMTETGMGGALSCGEAGM HLRENDLLVEVTDPVSGVPL PDGTWGELVVTTLDRRAMPL IRYRTGDRGRILPGTCPCGS PQRRIDLVPGRLADERLLPD GTPLRLIDLDECLLGLPDVV EYRACLHEDAPAVLAVDVLL RGDIPRLGPTMTAREESVAI LTRLLSTASPIARAMRLGTL ELRVAARDHLPPETLFAKRR IATRCQASS >_0030.002065_ DHAF_12NOV03_CONTIG1074_REVISED_GENE2351 dhaf_12nov03_Contig1074_revised_gene2351 MVDKEQTTQQRILQAAQEEF LRLGFQNSSLRSIAKACGVT TGALYGYYADKDALFEALVR EEAETLYTIYLRAHQEFEAL PPEQQLAEMTQQIEPRIWKC FNYVYEHYDAFKLLICHAEG TSYENYVHRLAEVEVESSTI FLSCMENMGRRVTHIPDDLN HMLASAYLTGFFEVVAHDMP KEEAREYIERLTDFFSAGWK NLLGLV >_0019.003208_ NP_349655.1 gi|15896306|ref|NP_349655.1| Sugar kinase [Clostridium acetobutylicum] MIIRAKAPLRISFGGGGTDV EPYCNEYGGVVLNTTIDKYA YCSIVPNNTDSIVVNSLDFD MTVKYNCNENLVYDGKLDLV KAALKRMNINKGCEVYLQCD APAGSGLGTSSTVVVALLGA MAKWKGVVLDQYALASIAYE VERKDLKIDGGYQDQYAAAF GGFNFMEVDGSDVVVNPLKI NKGITNELQYNLLLCYTGNV HVSANIIKDQVNNYVEKKEE VVNAMHEIKALAYAMKKELL RNNLNNFGSLLHYGWEMKKK MSSRISNPQIDELYEAALKK GALGGKLLGAGGGGYLLVYC PYNKKHVVAESLEKMGGQLT DWNFDLGGMQSWVVDDSRWN YNEIAVATTEGKYKFGINTM EDIG >_0019.002352_ NP_348610.1 gi|15895261|ref|NP_348610.1| Uncharacterized protein, YIIM family [Clostridium acetobutylicum] MMAKVVSVNISKKKGTIKQP IEEGFFKINHGLDGDAHAGD WNRQVSFLAVESIEKVGNKE IGLGPGKFAENITTEGVCLY KLPIGIRIEIGDVVFEVTQV GKECHFGCEIRKKLGDCIMP KEGIFAKVLQEGKIKTGDTI KIIYR >_0018.008978_ BFUN_06OCT04_CONTIG482_REVISED_GENE8979 bfun_06oct04_Contig482_revised_gene8979 VGIVVLTVPLIICARQRALT RHLGGDDRSSHTAGSANLLL IILTGALWHAILGRAGLDSP PVVRCARHATGDLDGDTATR SCRHPSELPPFLTMRDVSSN STRPSATGLMWLDDPAIVAT VVTLSAGAQRVVIAPEVGGA LAAFYETTPDGPLHWLRPAA PAAFAERDPLRMASFPLLPY CNRIRDARFEFDGGTIDLAG NDARFAHALHGNAWRHPWQV GARTESSVELHFEHEPDSRL RGDWPFRYRAQQRIELSGGA LHITLSAQNLAARPMPFGMG HHPYYPRTAQTRVYAEVGAM WHADADVLPTHLGPHPAVAA LREGMSADAFDLDNNFANWS REATIAWPDEHRRLTMIADA PFDHMVVFAPANEAQLCVEP VTNTTDCFNAVGMREQVGGC VLQPGEKIAATVKWTPHKN >_0018.007661_ BFUN_06OCT04_CONTIG482_REVISED_GENE7662 bfun_06oct04_Contig482_revised_gene7662 MHTTWCACSTGWHAQHHDGM QDAPPPTPSSKEIRSMNAPA SPIASAATLRHHFAHVVLPI WRGPGFNSAMHLPYEAVSAK AGHSPMPVERYRAMACARQL FVFSQAGEAAHAHVLFEALL HYFKDRQRGGWFYSIDAHGQ PLDTTKDLYTHAFLVFACAE YAARFGSGDALDVVRSTSAL IETRFAAPHGLFNASLNADF STVTGTPVQNPLMHLTEAWL AARESTQDSAFDSALRKLAN AIERTFVHAPTGCIAELPVG ADDNRLEPGHQFEWFWLARQ ASSVLNGTALDDALSRAFIF AQQHGVDPATGGVCASLDET GQVRDATQRIWAQSEYLRAL ATRDHDAAGAAALSGQIGRF RQRFLHPLGWRECMTATGEV SRADMPSTTPYHLATAYAAL PVEENAARNVAAA >_0018.005459_ BFUN_06OCT04_CONTIG482_REVISED_GENE5460 bfun_06oct04_Contig482_revised_gene5460 MTIVSAFLLPGSPLPQLRPD ITPWGRIREGLSRAGKALAE SKPDCVLVYSTQWFAVLDQL WITRARSTGVHVDENWHEFG EQAFDIHSDCALAQRCVDAC NAAGIKTRGVDYDQFPIDTG TITATTLMGFGSASLPVVIA ANNLYHSAEQTEQLGAIASA ALQDKRAAVLGIGGLSNSAF RENIDLEKDHILSEADDKWN QRVLALMESGDIEAIRAILP QYSTEARPDMGLKHFYWLLG AMNASFKKATVHEYAPLYGS GGAIVQFDVQ >_0018.005454_ BFUN_06OCT04_CONTIG482_REVISED_GENE5455 bfun_06oct04_Contig482_revised_gene5455 MHNATGQSPRLRLAIAPGAP SPQLSALLALQRAEEPDVAL AFFEVSGRDLHAGLHDGRYD AGVSIQVSSDAALKAQPLWA ECMAVAMPLRFPLLDQATLT IADLLDFALYRWQAEICSKL DERLSTPLPTGRQNIRYVTS FGLLATWVAAGYGVGVSAQS RIEHAHRWGITMRPLIDGPY EIVTHLQRPQEPTSSVVERF ERRALQVARASAA >_0017.000180_ NP_810116.1 gi|29346613|ref|NP_810116.1| putative UDP-N-acetylglucosamine 2-epimerase [Bacteroides thetaiotaomicron VPI-5482] MVANRNLNQTGTESENVYYV GNILVDTVRFNRNRLLKPVW FSVLGLQEGNYLLLTLNRRV LLNNKENLRKLMQTMIEKAA GMPIVAPMHTYVRNAIKDLG IEAPNLHIMPPQNYLFFGYL INKAKGIITDSGNVAEEATF LGIPCITLNTYAEHPETWRV GTNELVGEDPAALAQAMDTL MKGEWKKGELPERWDGRTAE RIVQILTSK >_0017.000134_ NP_810043.1 gi|29346540|ref|NP_810043.1| hypothetical protein BT1130 [Bacteroides thetaiotaomicron VPI-5482] MMNTDNRLLTRESSEHIREF FSTVERLSVSMERLFAGRSP AMAGENFYTDRELAEKLKVS RRSLQQYRDSGLLAFTRLGG KILYRSSDIEKLLDGCYREA RTRPEEL >_0014.000967_ YP_109327.1 gi|53720341|ref|YP_109327.1| putative AraC-family transcriptional regulatory protein [Burkholderia pseudomallei K96243] MRPAGPRTRARLYTRTIMHA PSPPPLDARLSVPAADFVGG EVPFGLQSVCRTLAEANAKL ERFAWLGDHLAIAEWTRVTD ESETVYAQPGHHTLSCYLDG GYRTERQKIARYGAPSLLCA LPGDHESRWWVRGEMHFVHL YFLPEHFARRAVRELDREPR ELKLADRTYFEDARVAALCR SLALERWDDADGRLRVNETA HEVLSLLLRGQSMTGAGAPF KGGLAPAVRRRVRDYIDTYL AQPMTLGELAQFASLSEYHF SRMFSVSFGRAPHAWIAEQR LARARTLLRTTSLPLAQVAA QCGYANAVHLSHRFRDTHGA TPGAYRRAMQAA >_0006.001973_ NP_890958.1 gi|33603398|ref|NP_890958.1| MarR-family transcriptional regulator [Bordetella bronchiseptica RB50] MIDTPDLETRAAPEDHHALR LWLRMLTCCNLIESEIRSRL RTEFDTTLPRFDLMAQLQRA PKGMKMGELSRHMMVTNGNI TGITDQLEKEGLVVRTKVES DRRSSVLKLTPQGKRTFARM ARAHESWVTGMLDDLPEASR HAMYKALGELKLQVVAHRAL AQRDGAS >_0005.003589_ YP_321884.1 gi|75907588|ref|YP_321884.1| Glycosyl transferase, group 1 [Anabaena variabilis ATCC 29413] MKILVASHTYIVDLNCQKLR ALSQLEPGIEVTVVVPKTWK PGGVQNKIIETQYRDEGAFK IVPVSNFSQNHQGLLTFGAD LVSLLREFRPQVIQVEQGSR GLAYAQMIALNQLLNLKAKN IFFTWWNLPYELKLPIALLE KYNLNNSHGIISGNQDGAEV LRQRGYQGKIKVMPQLGVDE TLFTPQAQPELAAKLGIKSD EFVIGFVGRFVPEKGLLTLL QALTKLPLDKTWKLLLLGRG PLQEELIKIAAENHIQDRVI LIESVPHDEVANYINLMSTL VLPSETTYNFKTLTSVGWKE QFGHVLIEAMACQVPVIGSD SGEIPYVIGDAGLVFPEGDV QALANCLLQLIEQPNFTKEL GERGYQKAMVKYTNKALEAI YKVNLKVSIKVSLEV >_0002.001421_ NP_147267.1 gi|14600746|ref|NP_147267.1| hypothetical protein [Aeropyrum pernix] MGEGLKETLEFLKVETSKCI YCGFCEPVCPTLPFGRHRGY GPRGRVFSARLVALDGKATE GDLESLYSCLLCGACMEVCP ARIDIVSVVREARALITGMN RL >_0116.001812_ YP_193741.1 gi|58337156|ref|YP_193741.1| diaminopimelate epimerase [Lactobacillus acidophilus NCFM] MVNLLKVHGSQNHFFILDQT ELDNSLTDKELRAFTKKITN PKTGILNGADGVLVINKSVR KNSLAQMRVINEDGSEASMC GNGLRTVARYLSEKYQKDHF LIDTMNASLRVHKEENFAKG VPAYSVEISPVRFNKEALPF TNLGHNRLIDTFVPEIYPGL RFTSIAVPNPHLINFVNQEQ IAGPILGNIGKRLNDNNPYF TDGVNVNFAQILDKNKLFVR TYERGVGFTNACGTGMSATS LALLLTHPDMVDINSKIDVF NPGGMVQTKVHYDDSTYWIE LTGNATITHHIIISKSDLRN NNFANVQISETKEQQAYEQF IDNLPKFNSITTLA >_0115.004016_ YP_001337498.1 gi|152972352|ref|YP_001337498.1| hypothetical protein KPN_03844 [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] MKQVGLRIDVDTFRGTRDGV PRLLELLGRHGIQASFFFSV GPDNMGRHLWRLVKPKFLWK MLRSRAASLYGWDILLAGTA WPGRRIGAGNEAVIRAAAES HEVGLHAWDHYSWQAWSGVW PQERLALEVERGLLELERII GRPVTCSAVAGWRADQRVVK AKESFDFLYNSDCRGTRPFL PQLGSGVSGTVQIPVTLPTW DEAVGTAVDIAGFNRYLLDC IHRDAGVPVYTIHAEVEGIA YADQFNELLTMAAEEEIQFC PLSQLLPADFSELPSGKVVR GELAGREGWLGREQLLTSGI >_0112.000110_ YP_909941.1 gi|119026096|ref|YP_909941.1| hypothetical protein BAD_1078 [Bifidobacterium adolescentis ATCC 15703] MDFHDARTCVIFGAGDYYDE TPAIPDDAFVVAADGGLDHA RAFGIDADFVVGDFDSITGD RPTQNDRTIALPSEKDDPDL LSALKIGWLRGARTFHIYGA LGGRIDHTISNIQLMALLAD RGATGYLHGDGSIVTAICDG ALDFPADDAVAGRMVSVFSH SDISTGVSETGLKYELHHAD MSSTRVNGLSNEFLAGRPSR ITVEHGTLIVTFPIEAPLPH VARWHGFSGDLGALDTDVSS ALVEPSGR >_0111.000969_ YP_211965.1 gi|60681821|ref|YP_211965.1| putative glutamine synthetase I [Bacteroides fragilis NCTC 9343] MMNQELLMSPNRLVTFLQKP AAEFTKADIINYIQQNEIRM VNFMYPAADGRLKTLNFVIN NASYLDAILTCGERVDGSSL FPFIEAGSSDLYVIPRFRTA FVDPFAEIPTLVMLCSFFNK DGEPLESSPEYTLHKACKAF TDVTGMEFQAMGELEYYVIS EDDGLFPATDQRGYHESGPY AKFNDFRTQCMSYIAQTGGQ IKYGHSEVGNFMLDGKVYEQ NEIEFLPVNAENAADQLMIA KWVIRNLAYQYGYDITFAPK ITVGKAGSGLHIHMRMMKDG QNQMLKDGALSDTARKAIAG MMQLAPSITAFGNTNPTSYF RLVPHQEAPTNVCWGDRNRS VLVRVPLGWSAQTDMCALAN PLESDSNYDTTQKQTVEMRS PDGSADLYQLLAGLAVACRH GFEIENALAIAEQTYVNVNI HQKENADKLKALAQLPDSCA ASADCLQKQRTVFEQYNVFS PAMIDGIISRLRSYNDATLR KDIQDKPEEMLALVSKFFHC G >_0110.002590_ YP_001299074.1 gi|150004330|ref|YP_001299074.1| hypothetical protein BVU_1776 [Bacteroides vulgatus ATCC 8482] MAIKFEFYESPNTIGTRKKR YHARVVNWQRINTDYLAREI QYGSSLTVADIKATIISLSE KLAYYLKDGARVHIEGIGYF HISLTCPETRTPSSTRANKV KFKSVTFRADKYLKHQLSDV KTERSKYKPHSMPVTKESID EALTEYFLTNSVLTRRKFES LCGLTRATAGRYIAQLAKDK KLRNISIPRNPIYEPMPGFY GKEKLPEPENETENVSATND TDLTIK >_0095.003740_ NP_462614.1 gi|16766999|ref|NP_462614.1| putative hexose transferase, lipopolysaccharide core biosynthesis [Salmonella typhimurium LT2] MIKKIIFTVTPIFSIPPRGA AAVETWIYQVAKRLSIPNAI ACIKNAGYPEYNKINDNCDI HYIGFSKVYKRLFQKWTRLD PLPYSQRILNIRDKVTTQED SVIVIHNSMKLYRQIRERNP NAKLVMHMHNAFEPELPDND AKIIVPSQFLKAFYEERLPA AAVSIVPNGFCAETYKRNPQ DNLRQQLNIAEDATVLLYAG RISPDKGILLLLQAFKQLRT LRSNIKLVVVGDPYASRKGE KAEYQKKVLDAAKEIGTDCI MAGGQSPDQMHNFYHIADLV IVPSQVEEAFCMVAVEAMAA GKAVLASKKGGISEFVLDGI TGYHLAEPMSSDSIINDINR ALADKERHQIAEKAKSLVFS KYSWENVAQRFEEQMKNWFD K >_0089.000487_ NP_344893.1 gi|15900289|ref|NP_344893.1| UDP-N-acetylglucosamine 2-epimerase [Streptococcus pneumoniae TIGR4] MKIKTDYSDIHFKDNGKLKL LIIVGTRPEIIRLSSVITKC RKYFDVILAHTGQNYDYNLN GIFFDNLGLDTPDVYMDAVG DDLGATVGNIINTSYKLMNQ IKPDALLILGDTNSCLSAIA AKRLHIPIFHMEAGNRCKDE CLPEETNRRIVDVISDVNLA YSEHARKYLHECGLPKERTY VTGSPMAEVLHKNLSAIESS DIHERLGLKKGGYILLSAHR EENIDTDKNFISLFTAINQL AEKYNMPILYSCHPRSKKRL QESGFKLDKRVIQHEPLGFH DYNCLQMNAFVVVSDSGTLP EESSFFTSQGYPFPAVCIRT STERPESLDKAGFILAGIDE NSLLQAVETAVSLAQDEDFG LPVPDYVEENVSTKVVKIIQ SYTGIVDKIVWRKS >_0088.000521_ NP_717484.1 gi|24373441|ref|NP_717484.1| NodD transcription activator-related protein [Shewanella oneidensis MR-1] MSVEIDKLLSDFQSLLAPKI FDPKFAQGTYVIAATDYAQQ VVLPKLISTLRQESPNLKII IRDFEIDKLHDLMESGSVNL AIAFPDYIPDSYPMIKLFEE YHVCVTSNHSTIAKVNTSLA EVAAYPCIIASPSRPNFKDS IDEWFKQFGLERNVVVSAPC FSVVPLYLQATDSIAFLPSR AIQALGLIEVVLDKKPNPFD IIAAWHPRYNDDALQKWIVS KLEAAYI >_0087.000094_ NP_720607.1 gi|24378652|ref|NP_720607.1| putative transcriptional regulator (TetR/AcrR family) [Streptococcus mutans UA159] MSNFEKRNRKMAEKNIRKPK QERSIEKRNKILQVAKDLFS DKTYFNVTTNEIAKKADVSV GTLYAYFASKEDILTALLKR YNDFFLTTIFADINSQDSLD RFKKNPKEWLNVLINQLLAA EDKIFHAQIEMLAYAIPQAK ALLEEHNNNLKNLTYKCLLY YSDQAANPSFKTLSLVVFDF ISALVDELLYHEHTQEEAHQ IKKTGIDSLDLIIKSYL >_0076.001353_ SAMA_14OCT04_CONTIG103_REVISED_GENE986 sama_14oct04_Contig103_revised_gene986 MARRKEHSHEQIREMAISEV ERWLETEPLQDLSLRKVAQH IGYAPSTLVKVFGSYPYLLL AVARRALEALDTTFAGHLGA ARDNSSFNNLPLDALKAKTA LNAMAQSYGQFALSSPGRFS LVFELKLPAEATLPSDHSAL IESLLQRPEACLSAMFPSLG AAQLKYQTRLLWAALHGLVA LSLQDKLFLLEHSLAELLTV QLGSQLHAISVLDAQHTEVR P >_0073.002874_ YP_296283.1 gi|73541763|ref|YP_296283.1| Phosphoribosyltransferase [Ralstonia eutropha JMP134] MNQPTNDDENLWVSWDEYHR LVARLSLNVHESGWKFDKIL CLARGGLRVGDQMSRIFDVP LAILATSSYREAAGTQQGDL DIAQYITMTRGELTGKILLV DDLVDSGVTLERVGRHLRER YPAVTEVRSAVLWYKACSKV QPDYHVTFLPSNPWIHQPFE EYDTLRPHNLSAWLKRGKRS QDGQDSAQA >_0069.000545_ NP_895641.1 gi|33864081|ref|NP_895641.1| Creatininase [Prochlorococcus marinus str. MIT 9313] MSSTLPSAVSSVNAIRLALR SWPEVDDYLNHCKGVILPLG STEQHGPTGAIGTDALTAEA LALEVGRRTGVLVTPTQAFG MAEHHLGFAGTMSLKPATLL ALIHDLVISLANHGFERIFV INGHGGNVATTKAAFAQAYS SAISNDLPVASKLRCKLANW FMAGSVFHAARELYGDQEGQ HATPSEIALTLHIEPSLLSK QRPLPEPAPAGPIHGPEDFR RRHPDGRMGSNPYLAKPEHG ASLLETAATALSEDLTSFLT AA >_0066.000208_ NP_905511.1 gi|34541032|ref|NP_905511.1| glycosyl transferase, group 1 family protein [Porphyromonas gingivalis W83] MNPPKRLLVIHRALAPYRIE LLNTLSAAFDTHIYFEFASP IEQRFDAGELAKRVHFQSSV LPPAPKIPGLKNFRPYAASL VRSLRPDVVLCSEFNLLTLT LTAASRLFSPKTKLYVLCDD NEQMAEAELHYGRGLKHRML SYVEGVILCDSRACDLYASR FATLDRERFVYLPIVQDEKV LRPLYEQVFGIGRDLRHALI PAGARMILYVGRLSEEKNLP ALIDNLSTLPDDVHLVIVGD GPMQSALMNQVQATGHPERI IFAGKKEGAELYAYYTQADC LVLPSMRECFGSVVNEALIA GVPVVCSDIAGASCLVTESN GRTFSPMLPNALSQACNDLL GSIAPFCGDSLRPSLMPFTF ERAIAPVLSLLSR >_0063.003827_ NP_251716.1 gi|15598222|ref|NP_251716.1| conserved hypothetical protein [Pseudomonas aeruginosa PA01] MRRWNGWGEESTVVELPDSA RGFLAELVGQGARLPDASLE AALAKVPASRLEADPRYSID PRERLLHARGQSLPDWLAMR EGDFGVYPDAVAYPETAEQI RELLALADSRDLQLIPYGGG TSVAGHINPQASRRPVLTVS LERMNRLIDLDRESLVATFG PGAAGPQVESQLRALGYTLG HFPQSWELSTLGGWVASRSS GQQSLRYGRIEQLFAGGTLE TFAGPLEIATFPASAAGPDL RELVLGSEGRFGIISSVKVR VSPLAEDERFYSVFLPNWQQ ALQATRELAQARVPLSMLRL SNAVETMTQLALAGHPGQIA WLEKYLALRGAGEGKCMLTF GVTGNRRQNGLSLSQAKALL KRFGGVFTGTLLGRKWAQNR FRFPYLREALWQAGYAVDTL ETATDWSNVDGLLQKIEASL RDGLAAEGEKVHVFTHLSHV YGEGSSIYTTYVFRPAASYA ATLERWKRLKHAASQAIVDN RGTISHQHGVGKDHAPYLPR EKGELGVAALRAMAGHFDPA GRLNRGTLLQD >_0062.001500_ OOEN_16SEP02_SCAFFOLD6_REVISED_GENE1814 ooen_16sep02_Scaffold6_revised_gene1814 MNFFINSNMQKNKSGIEHAE LKRAALFRDHHESFKIVLRD WSQTLHKDIKDSGLSDAEVI NMFDYFQGTEKVTEKNIQAK DIDFGVAVDLYENDPKNNRV LAWQVIKKINGESQRRLLGR INYFSFAKDRISSTEMFDAF GNLYAVNYYDIRGFLSLTQW YTPDNKIGTESWQTLDRRSV LESFNKFDAKGEFKKSGWRL VDKNGSIYTFQTIEELTKHF LDSINFDYSSQERGNVFILD RTHLGDWALRDLKKPAYTVI HLHNSQAGNAEDEMHSILNN HYEYSLWNTNDYDAIISATS KQTADVAKRFKPQARLFTIP VGVIPSKHFQEQRIPMSDRF SHKIVALARIAPEKRLNDLV KAVGIAKKEIPDISLDLYGY RDSSDNFKAYQDIKKAVEDY RLADEVKVHGYTTNVSVIEK KAQIFAVTSIMEGFNLSMME ALSEGDVGLTYDVNYGPNEL VVDGENGYIVPNGDYHALAE KIIFLFKHPDELQKKSERAY ELSKRYSETKVWQDWRELLD DAQDKWPEKISHYKSPLLFG LAENGGKL >_0056.002221_ SARO_25NOV03_CONTIG29_REVISED_GENE2364 saro_25nov03_Contig29_revised_gene2364 MQRTKVKSASRTLEVLELFM DERRPLRLNEIYKALNYPQS SATNLLKSMVLMGYLNYNRA NMTYLPTMRVTALGSWLPSM INREGGLVSLVDEIQRRTDE TVGLVAQNDLYIQYIILKTP GHEFKMAPNEGTMRLMVDSS SGLALMSRMRQREIDKIYRY SCHYGLGGETLPQFEDLMRE VRWTRQVGHAYVPKRPTPQL SSIAMPLDENLYGIPLAIGV GGMVDRISRAKQDILDVMSE AISAFKARQEQEDAREHEAL LNAA >_0056.000408_ SARO_25NOV03_CONTIG24_REVISED_GENE437 saro_25nov03_Contig24_revised_gene437 LRPRVWHDGRMERGGWVYIM ANRYRGGMYVGVTSDLIRRV WQHREGVGSSHVDDFGKTRL VYAERHEEIEPAIAREKLVK KWRREWKFALIEAGNPDWLD LWEQWYPAAMALRGVVER >_0054.000113_ NP_988305.1 gi|45358748|ref|NP_988305.1| Hydrogen uptake protein:Hydrogenase maturation protease HycI [Methanococcus maripaludis S2] MESVQGTIKSFINNSNKIAI LGIGNYLKSDDGFGVYVVES LVKNYSKTHENLSLEKEINS VNNRLILMNCGVVPENFTDV IKRENPDKIIMVDAALMHQE PGTLRVVESDEISETGFSTH SLPLSIIIKYINAHIDTEIL IIGIEPTDLEFGEPLSGLIK EKADEFSKILIEEIDSFLL >_0052.006342_ NP_102541.1 gi|13470972|ref|NP_102541.1| hypothetical protein [Mesorhizobium loti] MADFQKIRLRAAKRKGGEAE LATLLGPVPDNAAVADIPDD RILSTMAERIFAAGFVWRVI EQKWPGFEEAFLRFEPKRLL FQPDDFWHDLTSDQRIVRNP QKIKSVRDNAAFVERVSKEH GGFGEFLANWPAGDQVGLMA YLGKHGSRLGGNTGQYFLRW LGWDAFVISADMAAALCDAG LDIAESPTSKKDLDKIQAQI NQWVADTGLPRRHISRILAM SIGENHSPQSLREYMGDD >_0052.000022_ NP_102874.1 gi|13471305|ref|NP_102874.1| hypothetical protein [Mesorhizobium loti] MSARQRDENPAGIHLPLDPL PGHSSRGRLERVLRRGEFAV TTELNPPDSADPEDVYNRAK IFDGWVDAINAVDASGANCH MSSVGICALLTRMGYAPIMQ IACRDKNRIAIQGDVLGGAA MGVANMLCLTGDGVQAGDQP GAKPVFDLDSMSLLETIRIM RDNGKFLSGRKLTTPPQVFL GAAVNPFAPPFDFRPIHLGK KIAAGAQFVQTQYCFDVPMF RTFMQKARDLGHTEKVFVLC GVGPLASAKTAKWIRSNVPG IHIPDAVIKRLEGAQDQKKE GKQLCIDIINEVKEIPGISG VHVMAYRQEEYVAEIVDESG VLKGRQPWKREIRRDDQMVA DRLDHILHDEITETQVDMVK TAH >_0043.002020_ JANN_22DEC04_CONTIG25_REVISED_GENE2021 jann_22dec04_Contig25_revised_gene2021 MRAARLLQILLILQNRGRQT SVQLGEELEVAPRTILRDVD AMSEAGLPIIVHQGNRGGIE LGFNYRTRLTGLAKEEAQAL GLILGAANPMIATLGLSQPA AQARAKLVESLPDKTRDHVA AMMAQFKLVADDPVDDPRIR ALGLAIQAQTEVRIRFASRS EQLIHPVVLEMRGPQWRVRD GSTDSWIAMTDWGRINISAK RFASAKVD >_0043.000301_ JANN_22DEC04_CONTIG16_REVISED_GENE302 jann_22dec04_Contig16_revised_gene302 MAVNTYCMEPQKSEPNGQAS KSGTDRLGPEAWIDAAYVRF RKGGVSAVRVDPIAKGLGIT RGSFYWHFKDRAALLHAILK RWREEETERTIAENEAAGGD ASTRLLRLLHTCSADDGRLE IGMRDWAAQDEDAHKEVRLI DTRRIQYMATLAREAGIRSN AAEARCRVAYLAWLGSYVDA TVTSREELRAHMNTLWQMVM AK >_0041.000331_ NP_207019.1 gi|15644849|ref|NP_207019.1| nifU-like protein [Helicobacter pylori 26695] MAKHDLVGSVLWDAYSKEVQ RRMDNPTHLGVITEEQAKAK NAKLIVADYGAEACGDAVRL YWLVDESTDRIVDAKFKSFG CGTAIASSDMMVELCLNKRV QDAVKITNLDVERGLRDDPD TPAVPGQKMHCSVMAYDVIK KAAGMYLGKNAEDFEEEIIV CECARVSLGTIKEVIKLNDL KSVEEITNYTKAGAFCKSCV RPGGHEKRDYYLVDILKEVR EEMEAEKLKATANKSQSGEL AFREMTMVQKIKAVDKVIDE NIRPMLMMDGGDLEILDIKE SDDYIDVYIRYMGACDGCMS ATTGTLFAIENALQELLDRS IRVLPI >_0036.001389_ EXIG_01APR05_CONTIG280_REVISED_GENE1390 exig_01apr05_Contig280_revised_gene1390 MKKTGTALLIVDIINKMDFE GAEDLLAETRLILTSLVELK HICKSYDIPVIYVNDNFGLW QENVDQIVEECRGGLGDPFV QALHPQQDDYFIIKPKHSGF FGTQLDILLKHLDVSRLIIT GITTDMCILFTANDAYMREY EIWVPQDCTAAETTEAKNHA LEILHTTLSIDCSDSTTVSF D >_0031.000071_ NP_284934.1 gi|15795183|ref|NP_284934.1| putative transposase [Deinococcus radiodurans] MVGERSVWCGTVLSISWMPY GGNPNSSWRSWRCSSRLFAR QERLKVKLSPTELMAILQYV LSAVPLRKTQRNFLTVLLSV FLAVPGQLNALNLSRYAACS ESTIRRWLHRSDDGAIPWGA LHQATVSTAIESGLISPLCV LAIDASFHRKAGQHTAHLGS FWNGCAARTERGIEQSCCTL IDVQHRQALTVDVRQTLTGS EAPTRLEQTADQLDDVLLDL RTVQQLDLAAVVADGNYAKE PIVETVTGHGLPFISRLPRN ANLNDLYTGEHPRRRGRKKK FDGKVDFSDLQRFDLVSARP TERVWTQVVWSVQWAREVRA VVIQQIGKKGQVTGYAVLFS TAVTMPAHEVMALYRSRFEI ELIFRDAKQFLGGQDVQLRS QQGIEAHWNVVLLTLNLCRL EALRAAGGGQDLVFSLEDMK RRAYNALLAQVILSNLDLSA RFEE >_0030.002911_ DHAF_12NOV03_CONTIG1085_REVISED_GENE3323 dhaf_12nov03_Contig1085_revised_gene3323 MKKSNRQLQKVQTKETLLNK AYQLFSSQGIMNTRMSDIAQ AAGVSHGTVFAHFQTQEMLI TEVIETYGEKIARRTHEAAG TCEHMEELLAAHLAGIGEFE PFYARLVIENRLLPPEARDV WVSIQSAISFHFSQVAQREI GSGQSIDLPLYFLFNTWVGL VHYYLVNGDLFAPEGNVVGR YGDTLVDHYMKMIRRDGGKG GEKE >_0030.002299_ DHAF_12NOV03_CONTIG1077_REVISED_GENE2617 dhaf_12nov03_Contig1077_revised_gene2617 MKGLGWAKCPGTCGEWVQGA KDGTPFLVGCPINRFVEAKA EILFSESVARDNSGHQQELK DWIWQLPEGKEKTRQALERF ARIQNLPPLTGKIHMQSQLL IGKGMASSTADMTAAVSAVA QALAIPWQSEEQARLALAIE PSDPIMFPGVTELAHGDGTY IKSLGPKISAQLLMLDGGGF LDTLAFNARRDLPGHYRKYE PKIKNALALFYEGMAQRDLG KIAQAGTISARCNQDINPKP FFEEFLSWTLGKGGLGVITA HSGTLLAGIFPPDLSTTEKK NLERESRIQFRPVIVEWVET YDGGTEGGVMNAWRKSVGRL GTVWETELY >_0028.000525_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE0526 ddes_06jun05_Contig143_revised_geneDde0526 MAKHATPKLDQLESGPWPSF VSDIKQEAEMRAKNPKGLDY QIPVDCPEDLLGVLELSYNE GETHWKHGGIVGVFGYGGGV IGRYCDQPEMFPGVAHFHTV RVNQPAGKFYTSEYLRKLCD IWDLRGSGLTNMHGSTGDIV LLGTTTPQLEEIFWELTHDL ETDLGGSGSNLRTPAACMGE SRCEYACYDSQQMCYDLTQE YQDELHRPAFPYKFKFKFDA CPNGCVASIARSDFSVIGTW KDSIKIDQDAVAEYVAGKIA PNAGAHSGRDWGKFDIEAEV VNRCPSKALSWDGSKLSVNA KDCVRCMHCINTMPKAVRIG DERGASILVGAKAPILDGAQ MGSLLVPFIEAKDPWDEVKE VIENIWDWWMEEGKNRERVG ETIKRLSLQKLLEVTNTAPQ PQHVVEPRSNPYIFFKEEEV PGGWDRDITEYRKRHQR >_0024.003276_ CHUT_08NOV04_CONTIG199_REVISED_GENE771 chut_08nov04_Contig199_revised_gene771 MKVLMIGPFPPVVNGVTVSN DFLYHSLTSQGDQVNRINTE TGKIASMQGDKISLHKILTF LSIYKNIFKVSGKDVVYMTI GQTFWGVVKYSPFICYCWLA GIPYVFHIHGGYLSKSYLNM SRRKQRIIKVLFSKSSCVIA LSESLAKDIQNIFTKVNIAV VENFYDQELIHPALERTVHS VPHFLFLSNLMLGKGILEFL DALIILQDAHKLHFKVALAG NIERGMQAEIQKRIDRLKED KVYFLGLADLPKKKELLYWS DIFVLPTWYIMEGQPISIIE AYVTGNVVVSTHQGGIEEIS GYDTFFKTEAQNPALLAEVL FNSVQQLPQLKQAIEQTKKT TQIRFNSNRFVEEIKEVLLK K >_0024.000568_ CHUT_08NOV04_CONTIG199_REVISED_GENE151 chut_08nov04_Contig199_revised_gene151 MSEIKAKDKHAKDNRVKKVG TILVSQPKPTDEKSPYYVLA EKYNLKIDFIPFIQIEQVDL KEFKKQKIEILAHTAVIFTS RNAIDNFFRICTEAKIEVPA EMKYFCITEQTANYLQKYIV IRKRKIFVGKKTASDLEEVI KKHKTEKFLYPCSDIRKDDI PMFLSQNGFTFSEAVFYKTV ASDLSHLADVNYDIITFFSP SGISSLFVNFPDFEQKATRI AAFGPTTAKAVHDAGLILDV EAPLPNAPSMTGALEIYIKK ANNIK >_0022.001274_ NP_939648.1 gi|38233881|ref|NP_939648.1| Putative DNA-binding protein [Corynebacterium diphtheriae NCTC 13129] MAPAKKVETRSTDGETRRHI MLIMLKEGPVTATQLGQQLN LSATGIRRHLDILLDEGFAE ISSVRRVACEKSRGRPAKAF KLTASGRSQFGHEYDSLAAE ALATLKESGGEAAVRAFARK RVEKILADVHVPDNIDDNQL AEVATQLVEAFSNNGYAATI NNAANGVQICQHHCPISGVA ADFPELCEAEHEMIAKILGH HVQPLASIADGHDICTTNIP ITPILTPRKNTPTERSGS >_0017.003354_ NP_811247.1 gi|29347744|ref|NP_811247.1| hypothetical protein BT2334 [Bacteroides thetaiotaomicron VPI-5482] MEVITFESEAYKALVGKIEK IAGYVAEAQLPSEEKKETWL DSNQLAEALGISTRTLQRLR DENLISYSMLRGRCMYKLSE VERCLEERTIRCKPQTLEDF RKNYLTRTGNDKKG >_0016.004176_ YP_441783.1 gi|83721074|ref|YP_441783.1| transcriptional regulator, AraC family [Burkholderia thailandensis E264] MTVSGVANVAHALRQYASIN PNRLTNRPMNDRLKGLLARF ELHARVFHFGTLPGASTFDI CADGFHMHLVRTGAVCVTGG TLGLHAVPEPSAVFIGRPGK YRIEARGDAPVEILSAAIEF GLGDENPLLRGLPDLLAIPL ASMSPLDGVQQALFAEASAP ACGHDTVINRLTEVLVVQLL RFVMRNRLVASGSLAGLSDA RLAKALNAVHADPALSWSLE RMAAIAGMSRSRFAAHFADT VGLPPGEYLLQWRVGLAKTL LRRGYAVKEIAPEVGYGSAS ALTRAFAQCTGHAPTDWLAR ASDAPRATGTMPDIDVRAA >_0016.003461_ YP_443264.1 gi|83720031|ref|YP_443264.1| Aldose 1-epimerase family [Burkholderia thailandensis E264] MPNSVRAIAVARSMLAPCWD RSGTINMTGPGHYPYSLLFL TFSVSRMCAMHSAPARAHDE LAPAARPVACRTDARAPRRG DSAILGDLPWKTSTPATMPS FQNQDILELTDGASLARIAP EAGGRLLSWSIGDASIVFWP DAADWSNPAKIRGGNPLLFP FLGRHRVDGRIGFWRDGAGV VRELPMHGFARDLPFDSQAD ADGRGVTLSLHGSERTHAGY PFEFRFAARYRLVDERTLDV ALSTTNLGDAPLPHYPGHHF YFALPHAERASTVLELPPTR RRRQLDDGSISAPEPGSSRY TLDDPAILDRFHCLDGIPAE PVRVLMPGRRHAIEIDLNRP GSVPWYAVTTWTEAPGSNFY CIEPWLGLPDAIHNGLGLRH VAPGATETAALRIRVTPLA >_0016.000522_ YP_438396.1 gi|83716911|ref|YP_438396.1| flagellar hook-associated protein 3, putative [Burkholderia thailandensis E264] MRITSNQYHSIMAQVNRQAT AGLMDTQIRMATEKRILRPS DDPTGAVRLALLVQADATLD QYRATGAMLNNRLQHSERVM SGIVDQLQNEAVKHLGLAMD GSRSPEDLSAYAQVLASVRD TLLQSANARDSSGNYLFSGT AITTAPIRFDQAAPAGSRYT FEGNLEKQTAVVGDGVTEVA NDNLAEMADALNALDEAIAK MSEPDANPNDPAYRDVLGRV TDTIESVLASVNAKIGQMGS AQARLQMLDELHESAQIVGN QAALDVGKISPEEVFTDYQS YMVALQASQKLYAEVTQLTL LNVL >_0015.000075_ NP_388932.1 gi|16078115|ref|NP_388932.1| similar to hypothetical proteins [Bacillus subtilis] MNTDHTKRNLFELYAELIHQ QEKWEGLIKAFLSDELRKLD VEHGSKSQLTMTEIHVLSCV GDNEPINVTSIAEKMNTTKA TVSRISTKLLGAGFLHRTQL SDNKKEVYFRLTPAGKKLHS LHKYYHQKAEQRFLSFFDRY TEEEILFAERLFRDLVTKWY PSSEEIEGGLPSIFK >_0013.000294_ NP_880253.1 gi|33592609|ref|NP_880253.1| conserved hypothetical protein [Bordetella pertussis] MCSSPGHTFSKPVREAITLV AGLGVAGDAHQGATVRHRSR VRADPGQPNLRQVHLIHGEL HDALQQAGFNVAEGTLGENI TTRGIDLLDLPRDSLLYLGG QAIVRITGLRNPCAQLDRYQ RGLMAAVLERDAAGGLVRKA GIMAVVEAGGDVRAGDPIEV VLPPPPHHRLDCVYGPRNRD RRAVPR >_0011.004167_ YP_104873.1 gi|53716018|ref|YP_104873.1| patatin-like phospholipase [Burkholderia mallei ATCC 23344] MPHALEHARTPFHDLPYETI ALVLQGGGALGAYQAGVFEG LHEAGIPLNWIAGISIGALN TALIAGNPPERRVERLREFW NTICQPAFFPALPAMFEAAL FNSHEYVRTFFTASQAASAV MQGQRGFFVPRFPPPLPGST HPPEKVSYYDTSALRATLVK LCDFDRINSGETRVSVGAVN VGTGNFIYFDNTKTTLRPEH FMASGALPPAFPPVEIDGEF YWDGGIVSNTPLMEVLHASP RRDTLAFQVDLWSARGPLPE SMNEVTERTKDVQYSSRTRF VTDTLQREQRFRNVLRRVLD QVPESIRESDPWCQQAETLS CSKRYNVQHLIYQQKAYEQH YKDYQFGASTMRDHWSAGLA DIRKTLAVKNGLALPDNDAG FVTHDIHRMR >_0009.001175_ NP_243513.1 gi|15615210|ref|NP_243513.1| transcriptional regulator (GntR family) [Bacillus halodurans] MKMNFNKRDPVYLQVIRHFK ERIATGALLPGEEIPSRREL ANNLKINPNTAQRAYKEMEE QGLIKTEGNLPSKITNDPDI LASVRSELIRDAVDNFIAAI KPIHVPIDEVITLLKEKYEK DEI >_0009.000196_ NP_242030.1 gi|15613727|ref|NP_242030.1| transcriptional regulator (GntR family) [Bacillus halodurans] MKLPIKIEQNSRSPIYHQIE EQIKALIVSGHLNEGDPLPS IRALAKETACSVITTRRAYQ NLEQEGYIRTTQGKGTFVAK IGDEIKEQTKFSTVYHTLSM AIETGRRHDYSFEQLLDMVH TIIQQQKEKEGRGEE >_0008.005635_ NP_982152.1 gi|44004484|ref|NP_982152.1| transcriptional repressor PagR, putative [Bacillus cereus ATCC 10987] MTTIQASNEMYKIPEADVEL LKIMAHPVRLQIVKELEHRK ICNVTQLTELLDVPQSTVSQ HLSKMRGKILRSERRGLEMY YHIANSKACQIVSVLGL >_0008.004209_ NP_980242.1 gi|42782995|ref|NP_980242.1| hypothetical protein BCE3947 [Bacillus cereus ATCC 10987] MKVIRISELQMKDIINVSDG KRLGNIGDIEIDMNTGKIES IIISKQARMLGIFGKDIEIV IPWKEIVKIGEDVILVRVNP VNSVTESIQTPTIS >_0008.003192_ NP_978747.1 gi|42781500|ref|NP_978747.1| transcriptional regulator, putative, Mar family [Bacillus cereus ATCC 10987] MTQNELLNEYMNLLLNMSGT FKLLSEKSSEFTHLEQHVVE YIAQQKVPVNLKMIASYLNI PKQQLSVTVRNLEQNGYIVR KQDTVDKRAILISLTEKAKK VHYERWKQIYNNFTHNLEKL SEEDQRDLTYGLYKTNTMLS KMLNNN >_0008.000197_ NP_977538.1 gi|42780291|ref|NP_977538.1| transcriptional regulator, AraC family [Bacillus cereus ATCC 10987] MESYETQIQRSIDYIEEDVM EKQTLRNLARIAGFSESHFH RVFQALVGDTVMEYVRKRRL ARAAYQLSHTDEKIIDIAFE HGFQSHETFTRAFKKLFQMT PSEYRKQEIETPMYSSVNVK QRKLNPYLGGIQMEYRIVNK PEFLVAGYELKTTSKEGKNH QDIPAFWQEYLQKDLGTTIP NRKDTSQWVELGLCTDFNLE TGDFTYIIGMEVTDFENVPD GIAKRTFPSATYAVFTTPKV PHEEMVSSIHQTWNAVFSEW FPHSGYEHCGVTEFELYDER CHEDKSEFAQVELWIPVKKK >_0002.000101_ NP_148218.1 gi|14601677|ref|NP_148218.1| hypothetical protein [Aeropyrum pernix] MGIRLTGSAVKLMSLIDGSG LFIVEGTPILYHKPSSTILF SDLHLGYEQAMAEAGVFLPR VQLRRAILTLDRVVMGLKPK KAVIVGDIKHAFDRLLKQEA LETAKLLEWLSSRGVDKVIV IRGNHDNYIPGVVTKSGGEF VEDYLDVDKGIVAVHGHKKA EFDADIIIIGHEHPAIKINV AGSRVKYPAFLIVPREDGGT IVVLPALGVYQTGNPVTLDR SLYLSPYIREEGVVEEAVPI IIDESVGSLKLPPLRELDKI LG >_0001.001219_ NP_070030.1 gi|11498801|ref|NP_070030.1| molybdopterin oxidoreductase, iron-sulfur binding subunit [Archaeoglobus fulgidus DSM 4304] MSELSKLTFVHDRRKCIGCY ACVIACKVEHSSENFDDPGR IRVFHDGPRITGEKVQQHFK VVVCRHCLSAPCVDECPTGA LRKSEDGMTVLDLDLCIGCK ICMEVCPFGAPQLGDDGKVR IYDLCMPRIEEGKKPACVSA CVAQCLQVKSVEDLKKKSKT RKPEG >_0120.000989_ YP_001301704.1 gi|150006961|ref|YP_001301704.1| putative transcriptional regulator UpxY-like protein [Parabacteroides distasonis ATCC 8503] MPTCILRNDRPKGGLTQAGK RHDLKAVRWYVLTLPTAAGG RRDRISPSKGLDVELSRRER RGEALFEYFAPSYVEVRKVG GKLVNTRRPLLFNYVFIRSS VEEIFRMKQALPLYNFLPRV SSGSTTYFPHLLDQEMANLR WVAEAYSNELPVYVPESARL LKGDRVRITSGYFTGMEAEV VIQPGGGHKEVMARILDCMW VPLFEVKAGEYELIELNAKS KHVYTHLDNDRLSEGLHEAL GRYHVSGSVCEEDRRLAGEV LRGYASLRAETDVMRCKLCA LLLPTYKLSGDEEAFVRLHD TMRGLLPVVKAPQSRALLLV TLYGCTDNALYRRMAHELVD PWQVDPSPKKSKLSLIRRLG DYDRWLGHESVDS >_0113.001633_ YP_001087008.1 gi|126698111|ref|YP_001087008.1| hypothetical protein CD0532 [Clostridium difficile 630] MNYEIVEINEKIVIGISKET TNKDGQAVNDIGELWKKFMG KGIYNAIKGKKNDKTIGLYT DYQGDFTSPYSFVACCEVNS NSDKEKNLEELNTNNTVGES IISKVIPAGKYAKFVIVGGQ KEVGDFWFEFWQMDFDRTYI SDFEEYQCNTFDTKKQEIHI YIGIK >_0113.001220_ YP_001086700.1 gi|126697803|ref|YP_001086700.1| flagellar hook-associated protein [Clostridium difficile 630] MRVSTGMMSSSYLNSLQDNL QRLDKVNRQINTTKEINKLS DNPYKAIKILNSKSEIKTME TYIENCKDTADWLETTDTSL DQLGNLLADIKKGLVSSGNG SYSDDEIKTISNSTNEKMKE IANALNATHEGKYIFSGSNT GTPPVECVENADGSVSLKFN TSLNLNKLNDSLSVDVAQGI SVDYNVKLSSLGFDPTKTDP FEKLNNISKKLANPSDANIK ELTTTCLGDMENLIDNTVNV RSIYGTKANTVDAMKEKNDE GLIQLKDVLSQNDEIDYGEK LVQLKAAELTYQASLQTGGK LFNVSILDYI >_0111.000772_ YP_211614.1 gi|60681470|ref|YP_211614.1| putative MarR-family regulatory protein [Bacteroides fragilis NCTC 9343] MICSLFVDKVNKETYLCAMK QEYISRIRSFNRYYTKILGI LNKYYLGSELGLPEVRIIQD VYLHPDRSSKDISNELNMDK GLLSRLLKQLEQKEYIFRKS TEKDNRMGLVNLTEKGCEVY YRLNTAANQSIERIFSHLED RQLQRLVHCMDSIYKIINSV ETGLTVDNNEPIVIRPIEES DNASIASVLRASVEEHGAPK VGTFYDDPHTDRMFQTFNIK GAEYWIVESNGVILGGGGFY PTKGLPHGYAELSKFHFRPE LRGRGIGKRLLQFIEQRAVS AGYVYMYIVSYHQFGNAVSM YEKYGYEHIDNALDQSGLYQ DAPFHMVKAL >_0110.003092_ YP_001300092.1 gi|150005348|ref|YP_001300092.1| putative RNA polymerase ECF-type sigma factor [Bacteroides vulgatus ATCC 8482] MTDFNREIADLYPWLFKIAR RYCSSVCDIEDLVGDTIYKV LSNKEKFKEGRALKPWCEVI MLNTYITAYNRRSLIRFVGC DNIKEIFSHNQASDDLMVHD IQAAIRRCHNRTCCMDCVVY YAHGYSYKEIGKMVGIPVNT VRSRISYGRELLRNELDLTV K >_0109.000362_ RER070207000363 REr070207000363 MYSIGEIISTYRKKKGLLQQ DLADELAKEGITISYKAISN WERNLAEPSVTIFYKVCRIL GITNMYEAYFGVNPADPFSS LTDEGREKAMDYINLLHASG MYEKQTAKIIPFRSIDIFEN AVSAGTGNFLVDGPKETVHI DESILPEDTTFGVRISGDSM EPEFHDGQIAWVLQQESVAN GEIGIFALNGEAYIKKLQND KDGIFLISLNEKYTPIKVGE NDRLDIFGKVLGKSDASAIT GHCR >_0106.002450_ SWOL_07JAN05_CONTIG99_REVISED_GENE2451 swol_07jan05_Contig99_revised_gene2451 MQAVVLENTNNLSQNEWLLM RKQGIGGSDAAAICGLSRWR SPMDVWLEKTGQLEPEKAGE AAYWGQIMEPIIREEFTIRT GMKVTTINSMLKHRRFTFML ANLDGVVQDPNRGQGIFEAK TAGLYASSEWGDSLPDEYAI QVQHYLAVTGLPFACVAVLI GGNRFKWLYLERDESIIDLI IQLEAHFWRLVQTNTPPPID GSRATTELLNRLYPHGKKQQ IELPAEALDLIEAYELAKEQ EEKAALLKDTAVNELKNMLG ENECGVVQNRRITWQEIKTK RFDSKLLRTEEPDVYARYLK YSSYRRFQIK >_0099.002816_ TFUS_04MAR05_CONTIG93_REVISED_GENE765 tfus_04mar05_Contig93_revised_gene765 MNAQRGIPSSHQNAFRLYGP QFQNKPAELYRQMRTDYGPV APVLLDGDIPAWLVIGYREV THVLNHPETFARSSRRWNAW DLVPENWPLYPMVTRTPNIL YSEGEEHRRRATAISDALSG ADQHEVRQYAVQAADRLIDG FCAASRADLRADYASRLPAI VLGRLYGLDQKHAEVLAEAM TTMIDSGPDAVKAQQFLLQT MGTLVAERRKQPGPDVVSRL VHHPAKLRDEELIPDLVVIL GGGHQPTTEWLGNTLRLMLT DDRFAASLTGARSSVREALN EVLWEDTPTQIYLGRYAAHD VELGGQLIRRGDLVLLGLAG ANSDPQINPGPECRMSQGNQ AYLSFSHGEHRCPYPAPELA EIIVTAGIEVLLDRLPDVEL AVPVDELRWRPSPWMRGLVA LPVVFTPVPPIGGQ >_0093.002702_ NP_341635.1 gi|15897030|ref|NP_341635.1| Conserved hypothetical protein [Sulfolobus solfataricus] MEMDMGSELGYDYRVLKLGG SLITCKDVPRCVKLEVLRRV SEEIRKFVNENPDKKIILLH GGGSFGHYEASIFDDNRIVR TSEAMQELNYIVAKHLLKSG IKAISVPGKFYTFDAVLSAL EKDLVPLIYGDVKFDGSIIS ADDMSIDIAKRLNARLLFAI DKAGIIGRGGGVISELRGID EVSILMQTNYYDITGGILSK IKKIFENNLNALIFDGSKTG NIYLALRGYNIGTLVRGNPN A >_0093.000493_ NP_343455.1 gi|15898850|ref|NP_343455.1| Acetyl-CoA C-acetyltransferase (acetoacetyl-CoA thiolase) (acaB-2) [Sulfolobus solfataricus] MGTFLNRIAIIGLGWYGFKP TTHEVSFREMVFEAASRAYK DAGGINPRSDVDSFISCQED FWEGISIADEFAPDQMGGAM RPTMTVAGDSLQGLAHAAMH INSGVANVVSIEAHSKVSDI LTFSDLEKFAMDPIYIRSIN PPNFHFIAGLDAVKFMHRKG ITREDLALVVEKNKKAGLSS PRASFASNISAVDVMKKNYV VYPLSELDIAPFVDGAIVIV VADEEVAKKLKKDDYLVVKG IGFATDSSNLETAELGKANY MKIASDMAYKMAGVETPRRY FDAVFVDDRYSYKELQHLEG LRVSEEPSKDLREGNFSPQG EVPVNPLGGHMAKGVPLEAS GFSLLLDAIEYIKQGKGERA VVASWRGIPTLTGSVVVVEK P >_0090.000131_ YP_166411.1 gi|56696057|ref|YP_166411.1| transcriptional regulator, AraC family [Silicibacter pomeroyi DSS-3] MSCSALSQGLVFGKSEFPDR GSDTRMDRLTALMERFQLRV EAAAPEDANLLALAGADGTP GELLFYPAGTGRAGDPLFAA RVSWSGESNPFLAALPEVVR FDLSADPEARAVVELIRVEV AARRCGAQSVVNRLGEVLMV RLLRDQIAQGATRPGLLAGL ADPRLSRAIVAIHDHPGRVW SNGDLAAEAGLSLSRFAELF AAQVGETPMGYLRRWRLILA AQDLGRGDRVDRVARRYAYG SPEAFARAFRRAHGVAPMSL RAG >_0084.002826_ SFRI_16AUG04_CONTIG86_REVISED_GENE2829 sfri_16aug04_Contig86_revised_gene2829 MTTVRVAGVGMIPFCKPGKH EPYRAMAAKAIKLALTDAGL DAKSIGQAYGAYIYGDSTCA QHAFYDVIQSGIPIINVNNN CSSGSTALFLARQAVLSGQV DCALAFGFEEMQPGALGSHW TDRESPFERIEPVLKQFNAP QGPIALRAFGAAGRHYMDKY HVGADIFAKVAEKSRRHALQ NPYSMFSTPLTYQQVLDDKL IYDNYLTRLMACPPTCGAAA TIVCSEQFARQNGITSKVKI IAQAMATDTEQSWQDPIFAV GKGMSQQAAQRVYDDAGIDP NDIDVIELHDCFTPNEVITY EALGLCPEGGAAELIANGDN TYGGKFVIGASGGLMSKGHP IGATGLAQCTELTWHLRGQA GNRQVDNATLALQHNVGLGG AVVVTLYGV >_0082.000450_ NP_765873.1 gi|27469236|ref|NP_765873.1| poly (glycerol-phosphate) alpha-glucosyltransferase [Staphylococcus epidermidis ATCC 12228] MIYTVTTTLPLSHGGRTQAL LRRIKLLDEEFKIPSKILTT NYHGNYPSIYKKYRQENKVT ENIQFENMYEWLSNFKLFKV PKTLITRNPKYIKTPRKIKG LIDKQGKKSGLIHYYNNECH VRSRKYYGQSNVLEYEDFIS PTSGLKYERHQYNLYGQLHR KEYYYDDSSLKHSDELFDTE GSMYCKRYFKTKPNSKINGV EIYRNKKLYKTFKNDKLLAQ FYFQNRFKNQDIVFNDARFL DKPLLKQTHQTKNILVLHSS HLSGDQIKKSYRFALNQSKN VYKYIVLTHQQKHDIQQHFH ISDDQFQLVPHFIELDTEVE QDSSNNQNRFIYIGRFSTEK QIDHIIRAYHKFLQSGYQTE LHLFGRDEDNQIPLMNTLIS ELKLSDKVKIFKYTNQPLQE FKNSKASLLTSQYEGFGLTL MESIEMGCPVLSYNVRYGPS EIIQNGINGYLIEKNDIDSL SKHMINIIEHPLQKVKNKDT LKYNAAVNNYKQLMQSLDLL K >_0081.002139_ SDEN_20JUL04_CONTIG118_REVISED_GENE2140 sden_20jul04_Contig118_revised_gene2140 MELDACLIIKFGGSIITDRE SPYTLKRDMCNALIQKVIAL KQQSATPILLVLGGGAYGHP PVHEYGLASARGNGTVSNLQ FSRLTTGLYKLMVDFMQISY ENSLEMHPFQSSSLFMCQDG KVKSVFSEAIEKSLQFNDIP LLTGGSAYDTTLGQLVFGSD RIPELLTKMFKVSKCIFVSD VDGVYEHTGGKMFDEITPEL YPSLSDAIFATGRLDVTGSM KGKVDAAMRLAEMEVSSVIC SAATFLATGVSDICSGNISG THFNSSEKVGLNPKMEELC >_0078.006894_ NP_821782.1 gi|29827148|ref|NP_821782.1| putative taurine catabolism dioxygenase [Streptomyces avermitilis MA-4680] MRINERPGTAIGATVEGFDH ATASDADIAALKSTVYTKKI AVLKGQDLSPQEFLELGKRL GRPETYYEPMYHHPEVTEIF VSSNVPENGKQIGVPKTGKF WHADYQFMPDPFGITLIYPQ VIPRQNRGTYFIDMGRAYDR LPEDLKKEIGGTYCRHSVRK YFKIRPHDVYRPISEIIEEV ERKTPAVVQPTTFTHPMTGE TVLYISEGFTIGVEDQDGEP LDDELLKRLFQATGQLDETF EHDNIHLQSFEQGDLLVWDN RSLIHRARHTTTPEPTVSYR VTVHDERKLHDGIKVA >_0078.004648_ NP_823630.1 gi|29828996|ref|NP_823630.1| putative TetR-family transcriptional regulator [Streptomyces avermitilis MA-4680] MVTSRWTAAPAQTASLRRRG AVLERAILDAALEQLSTVGW NGLTMEGVAAGAQTGKAAVY RRWPSKEDLVVDALQAGLPR FDEVPDLGCVREDLLQLCRQ AREAMFSRPGLALRSVIHEC DSSQADRFGGVILEGVVEPT VKLLRAIINRGIERGEVRSD AANGYVFDAIPAMMMYRTKM CASEWSERDIEELVDQLMVP LLRPQGA >_0066.000166_ NP_904456.1 gi|34539977|ref|NP_904456.1| capsular polysacharride biosynthesis gene, putative [Porphyromonas gingivalis W83] MLNNASYRLKYWLRKAVKWM PAYRKTYREAHVVSSREQQE HLFWELLKYVIVHVPYYRDY QKFLGGNVQIEDLPIVKKED VAKDALSFVSDEFNPEKLLR VSTGGTTGTSTNIYTSWLDG VRQTAYIDAAIDLGRVSNPI ICTLREHDLKATEKYRFWGN RLMLSPSNMNKDSLEYYVDL MRRYRVNILHCYPSSLMVLC KLLQNTQVNLDIEAILVSSE IVSSELKHIVREVFPRATFI NVYAQTENVARGISLNAAPF EFLSCKYLVEFLDTGERRDG NIIAEIVGTNLEKKSMPLIR FATGDYAVLNQRGEVIDILG RTSEYLIDKFGNPIPCIVTN RPHTLDNVLLAQYYQEKIGE FEYRVMVNENFNSNDIRAIE EDIKLNFGDGLFARVVVVTQ MEKTSRGKHRKLVQRLDLSL YL >_0061.003181_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF6481 npun_22dec03_Contig1_revised_geneNpF6481 VKFRDCAAAVMGSPKSECQL SRCHTHHLLPALHGEGVPSL LSGFATCPGLFYATPAADGI LSRIRIPGGIISSQQCRAIA DIADQHGGGYVDVTNRANLQ VREIRTGINSEVLQHLQDMG LGSRNSVVDHIRNIMTSPTA GIDAQELIDTRPFVQDWDNY IAAHPALSGLSAKFSVCFDG GGIIRVCDRLNDILFAAVLV DGNVYFRLYLSVGTKGQPPS DTGILLTPEQCLPVLAALTE VYLAHSNTTNNRRLRLLELL NTLGCENYLQEVQQRLPFSL LCSNLTPQPPSLVGKGEKFS PLLQGEGLGERLDAKYQHIG IHPQRQQGLFYIGVVLPLGR LESNQMKGLADLAAKYGSGT LRLTPWQNLLLTDIPQQWVG DVQSKIAFLGLDISATNIKS ALIACSGNKGCASSATDTKS HALALVEYLETRVTLDCPVN IHFSGCEKSCAQHSKSDITL LGVSIEDGNGTVEGYHVYVG DSEHKFGRKIIYQYVTFAEL PALIKRMLDVYKIQRLNSDE SFGEFANRYTEKL >_0056.000220_ SARO_25NOV03_CONTIG23_REVISED_GENE234 saro_25nov03_Contig23_revised_gene234 MVPFSFAGQEFRLGAARALF WAEESALLVADLHLEKASFF ARHGQMLPPYDSRATLERLA GALRETGARRVFCLGDSFHD AAATERMEPHAAGMLDALTR ATDWVWITGNHDEDARAPGG TLVDELSVRGVSLRHIARKG AAGTEISGHFHPKLRVTVRG RSIARPCAVASENRLILPAF GALTGGMHAGDPAILAALQP ARAIDAIVPAGERLARFALW REAA >_0052.005627_ NP_106770.1 gi|13475206|ref|NP_106770.1| integrase/recombinase [Mesorhizobium loti] MIAHLRAFLRYCGDHGEAAG GLHVIDMPRVYRGELPPRAM DWTMVRRLLASIRRRAARDW RDYAILHLMAYYGLRPSEVA TLRLDSIDWKARTLRVEQRK TRSVLVLPLAGRTLRLLHRY LQVGRPDSVLPQLFLRIRSP IKPLKHYGVITVFTYRAQKS GLSLGGVSSYSLRHAFAMRL LRRGVDVKAIGDLLGHRSLE ATCVYLRADVDMLRKVALPV PGIMVVEGGDHA >_0046.001459_ NP_470489.1 gi|16800221|ref|NP_470489.1| hypothetical protein lin1152 [Listeria innocua Clip11262] MEKEKVVNIQPLMNSQVENQ FTQQIQEDFNKKNPDHSALL HSLETVEVLDVLSDYYTAHE KQKQPKKEAATSSKTSGEHK EIKQAIRYIKKNIHRSITLE EVANYVYLSPFYLSKLFKNE LNINFINYVNEQKMLYAKEQ LEKSDWAVHTIAKNLGFSRA SYFCKVFKKEFDMTPKEYRD SLK >_0043.002519_ JANN_22DEC04_CONTIG27_REVISED_GENE2520 jann_22dec04_Contig27_revised_gene2520 MAQIFELSREKPVRRVEAWQ DIVSRTFVTLACDLPDKTDV TGSIASIGNGPISASLVDSV AQTVDRTRRLIRQDDAEVAL ISFQIEGECLLQQRDRQVRL RPGDFTIYDSTSPYDLQFEE DFCQLVLHMPRDLLRRRLGP VRQICAQRFGSDTGANSFAG AYLKQIALQLTSLETATAEH HIKTALDLVGYAVEQNVQPC GHDVRRQSAAIREQVAQVIS KHFRDDQLSTPSLARAVGIS SRRLQEVCAEVGTTPMNAIW DIRLEKASDMLRSPAFAHLS ITDICFACGFGDSSQFSRRY KTMYGVTPREDRKSGYRNGF SGSVQTLSKSAGRFQRR >_0043.001500_ JANN_22DEC04_CONTIG23_REVISED_GENE1501 jann_22dec04_Contig23_revised_gene1501 MPLKYSDSVVLVGGGPLSPD IFDVVKYRAGFFVAADSGAD ALLAREVIPDAVIGDMDSLS ERARAAIPKDRLVAVSEQDS TDFDKAVRGIDTPLIYAVGF TGGRLDHELAALHVLVRYGH RAIVLVGEEDVTVHLPARIT LDLPRGMRVSLFPLDAVTVG MEGLRWSFKALALHPIHKIG TSNEVGEGPVVLTSDAPGAL LIVPRSALDAVVAGLSRADL HSPQPELSAKT >_0038.000217_ NP_280276.1 gi|15790452|ref|NP_280276.1| 2',3'-cyclic-nucleotide 2'-phosphodiesterase; YfkN [Halobacterium sp. NRC-1] MDLYSAVDPDVATFGNHDFD YGPSRTAAVVADSPQTWVSA NVYHDGDRVAGVEPWTLVER DGTTLGFFGVLDADTPALNP MASDLTVTDPIQAATDAEAA LRDAGAEYVIALSHLGRGDD ELAAATTVDAVLGGHIASER VERLDGTLLTRPGAGGDVVF EIDIAADTVTRHQVDNAPRH DGVTAALRDRLADAGLDAVV GHVTPPMERTERTLFEGESR IGNFVADAYRWAADADVGLQ NAGGVRDGPALAGDVTVADL VSVVPFAEPVSVAELTGREL LDVFRAGNGSGGLGFAEPDW WHAHVSGATLEWRDGDGLVS ASVGGDPVDPDATYTLATTD YLFYSDDEFPALDAGHRIDQ LDTQHEVLASYARQEGINPE TEGRIRFHADD >_0033.003556_ YP_051469.1 gi|50122302|ref|YP_051469.1| pyoverdine biosynthesis protein [Erwinia carotovora subsp. atroseptica SCRI1043] MWHKCALFSPYDLLKTSILS SYKGIYTMENRVNCSIELLS PFGALLTPVEAGQGIATLPI DTLRELAREYHLLVLRGFSS GFSDPETLTEYAGHWGEIMM WPFGAVLDVKEHADTKDHIF DNSYVPLHWDGMYKPTIPEF QLFHCVSAPGQDQGGRTTFV DTTRLLAGADAPLVEEWRRV SITYRIKAVVHYGGEVTSPL VIPHPNGVGEIMRYNEPPTK GERFLNQHALEYHHIAPEAQ NTFSQTLRQHLYDPRYYYAH QWLQGDVVIADNFSLLHGRE AFTAHSARHLQRVHIQGTPI CVNTCIDNAA >_0033.002337_ YP_049353.1 gi|50120186|ref|YP_049353.1| hydrogenase-4 component A [Erwinia carotovora subsp. atroseptica SCRI1043] MNRFVIADPELCIGCNTCMA GCTSVHKAQGLQGLPRLTVV KTEDKTAPLMCRQCEDAPCA RVCPVNAITHENAAIVLNES LCIGCKLCGLVCPFGAITPS GSTPVNTPALLQNMIPEALL RDVPGSAPGTNPFIAWNAGI RAIAVKCDLCDFQDTGPECI RVCPTKSISLVDSQSISTNS QSRRLKTLLSSFHEQGFMSV QQEGDE >_0033.001329_ YP_051461.1 gi|50122294|ref|YP_051461.1| hypothetical protein ECA3372 [Erwinia carotovora subsp. atroseptica SCRI1043] MFPSFLRRTAAKNGSYANEI SSSIIINRPAERLFDLWRKP ETLPILMGHFASIEILNHTD SNWRINTPIGSLIEWQARII DEKPGEYIHWRSLEGARVPN EGRLSFQPAASEAGTAVTLT IRYNPPGGLIGKKIGQMFDM FSRDMLTKTLYRFKKLAEDE RV >_0030.000057_ DHAF_12NOV03_CONTIG1004_REVISED_GENE68 dhaf_12nov03_Contig1004_revised_gene68 MGSLLTAQEVAEMLNLSVDT VWRYTRQKKIPVMELGKKQY RYEKEAVLAALTAGKIPAED LPVKEESFGYAKQGEYTYED YVRIPEEPGYRFEVLEGMLI KEPSPTTHHQRVVFALSRQL ADFFEGFDPEGELFIAPLDV TLTSRNVVQPDILFISGSRR SIMRPERIDGPCDLVVEVMS PSNRRKDRLRKLEIYRRAGV PHYWLVDPEENILEAFVLRD ERYVLIVVSGPGDRFAHPDF PGLDLDLAKVFYRPEYE >_0029.000122_ DGEO_15APR04_CONTIG101_REVISED_GENE94 dgeo_15apr04_Contig101_revised_gene94 MSNSQILGERARILADQLLA VPTTVYQQRSLQAALHLFLD TGTKTALHRAPLVSKSAVSR LLNNYDWDTAACWALLQRSQ WEALLLAARRKRRACLRLSV DLTSIEKTGKQLPFVRVYNE VHGIHLVVLFAEYRGLKFPV GYRVYRGKGTATPVSLALEL LGEVPDAIRKRFRIRVLADS GFEAAVFLDGVRTLGFEFVV GVRATRRTTHPGQVTVADCE HGAWLELQNWPHDTLTLARV ERGERTFFSVASELMTGDEV AAEGGKRWNIESFFKEGKHQ FSLQQFALRTARGLDRWVLL VFLAFTLTMLHRSPDLSLEE AAGLALTLALPFLRLNVIFA RLATDEEFLRQHGYSLKIAR CNS >_0028.003247_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE3255 ddes_06jun05_Contig143_revised_geneDde3255 MAKKKILFITGTRADFGKLC SLIDKVSEHTEFEYCIFVTG MHTLSRYGYTVDEVMKKYSS FRLEGGFRNVHVFMNQVHGE SMDMVLGNTIFGLSRYVSEY QPDMIVVHGDRVEALAGSIV GSLRNILVAHVEGGELSGTI DELIRHAVSKMAHLHFVANG DSRRRLMQMGEAEETIYPIG SPDVDLMFSPNLPSKESVLA YYNIPFTEYAITLLHPVTTD LEQTKKMAGAVVDAMLDSDD NYIVIYPNNDHGSDIILDEY DRLSGEGRIAVYPSLRVESF LVLLRAAKYLLGNSSAGIRE TPCYGVPSINIGSRQDGRFC CSSIINVPGSYDAIVKALAD VRKMVPHPPRFEFGRGNSAE QFIKILMDEKTWLTKPQKQF VDHHFALCR >_0028.001498_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE1501 ddes_06jun05_Contig143_revised_geneDde1501 MSMVINHNLMAQNASRNLST AYGNLSTSVRRLSSGLRVGT AADDAAGMAIRELMRANIAS MHQGVRNANDAISLLQTADG ALSVIDEKLIRMKELAEQAA TGTYTSDQRLLIDSEYQAMA SEITRIANSTDFNGIQLLNG NLSSSTHDGSGVVSRGKMKI HFGTANDSAEDYYYIQIGTA TASALGVGNQADATFGGFAI STQSAAQQALVAIDKAIVSK DNIRAALGVLQNRLENTITN LQIQAENLQAAESRISDVDV SNEMTEFVRQQILTQSAVAM LSQANSLPRMAMQLLGG >_0028.001265_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE1266 ddes_06jun05_Contig143_revised_geneDde1266 MSNIRFIANQKARQQTDKVD VSFLSCAEAEVVRGFFKTFE GYEPTPLVGLKGLAAELGVS SFYLKDESKRFGLNAFKVLG GGYAIAKHICRRLGVPIAEM SMERLCSAEVREALGEVTFV SATDGNHGRGVAWAANRMGQ KSVIYMPKGSAVTRLENIRA EGAQASITDLNYDDAVRLAW KDSQEKGWVMVQDTAWEGYE EIPGWIMQGYTTLALEALEE FERQEGEAPTHVFLQAGVGS FAGGVQGLLAQRYGERRPVT AVVEPELADCIFRSAAAADG NPHFVTGDMPTVMAGLACGE PNTVGWSILRDYSDGYISCP DWVAANGMRILASPRPGDAR VISGESGAVTTGIVECLMRM PDMAPVREALELDGDSRVLV ISTEGDTDPVMYRKIVWQGA YPEIEG >_0025.000037_ NP_282281.1 gi|15792458|ref|NP_282281.1| putative lipopolysaccharide heptosyltransferase [Campylobacter jejuni] MKIAIVRLSALGDIIQSAVV LQFIKNFKKDIEIHWFVDEK FEGILKNHPLINKLYALPLK DKKILKSLKILLKARKNNYN AVIDLQGLIKSAIVSRILSR NNFGFDKNSLKESFAHNFYN QKLELDYNENVFVRYLSLTS FMLNTDFNVKNLAFKQDIFS VDENLKQLLNNKLKLDKNEK NILIHVGSSVENKIYPKTKL AILCKLLINEFQQTKIWLAW GNVKEYEFAKEVLNLSGIDE THIELAPKFNLEELMAFTKM MNLIIGNDSGPTHLAFALNK ASITIFGATPSYRNAFQTHI NKIIDAGKKIQNAKHIDKSD FCITRIEEEDIFKLAKGLLN EK >_0023.002952_ NP_599335.1 gi|19551333|ref|NP_599335.1| Mar family transcriptional regulator [Corynebacterium glutamicum ATCC 13032] MTQDEHPRQADSHFNMLLPD GNENAHQLSVALNQVAHLLA YDADSSIHRPDGLSLASYRI LFSLWTDGPMSPLQVTDKTG MKKSAISNLLKPLLAESLIV QVTAENDRRSKVLSLSEKGT TYIQKTATRQNALESEWFGT LTDIEQDLLESLLRKLLDSN RASKVRKNRSN >_0023.000616_ NP_599274.1 gi|19551272|ref|NP_599274.1| 5'-nucleotidase/2',3'-cyclic phosphodiesterase or related esterase [Corynebacterium glutamicum ATCC 13032] MLKCAVDEAAGGRAQAFVSS GDNIGGSPFQSSILGDEPTL EALNQMGLDYSAVGNHEFDK GYADLSSRVADLADFDYLGA NVEGENPDLAPYGISHLDGV KVAFVGTVSQETPMLVNSEG IEGITFTDPLEATNRVADEL VGSGAADVVVALYHEGITGT EAWSENIDVVFAGHTH >_0020.003006_ CAUR_25MAY01_CONTIG969_REVISED_GENE4515 caur_25may01_Contig969_revised_gene4515 MWPTPLAFSLAIEQLAGVTA PLRAQAVRVMLAELERATSH INTLAALATALGLTDAARLS RLTTRLGLIALKLSGNDEAA LIRPGGLLADPSESALAETA QAINDLLPRMIAVAEQSIPR RSLLARTVDIGVLQPTAARQ FGLAGPLARASGVATDVRIQ EPYAAYNILAPALVTEEGGD VHSRAVVLLLEAIESLRLVV RWLQNLPIGNVYEEVPLVAG EATVTVEAPRGSLRYTVRSD GQRLTGVEIGIAPQLDRLLA RSLLQQAAVDDVMLIAISTD PCTACWQSATYLIGRS >_0020.002217_ CAUR_25MAY01_CONTIG813_REVISED_GENE3319 caur_25may01_Contig813_revised_gene3319 MRILYILPRYDSAAMGNRIH TEVIHAWRESGITAEVLSLA ANQAQPTRTVEDDIVVHRLP SRGTFVTQVVNRVVNPLFSY PYLASAIVGLRRFLATTPPF DLCHIETAFPLGVAALFAGR YAPPLAVTLPGADVMAEPEY DYGYARFRAVRALLPLVFRR ALVIRADSPQIRELAIVRGA PASKVTAIPYNITEDSFPPA DMPLSEMRARSRAIIATRHG LDPERPIIVSLNRLHPFKGI AYLVEAVPTIRAAGLAPQVV IVGPNRSTPRFGDYGEFLRR RAQALGVAADVIFTGAIPHH DARTYLAGADVAVVPSVAES FSRVVIEACAVGTPPVVTRT TGASAYVAAAQAGIVVEPRS GPAIGEAIVSLLKDRAVWQA YSARAAAHGTAVFFTADRYR TGRSLSGTPLRSKAFPETVA TGR >_0018.008151_ BFUN_06OCT04_CONTIG482_REVISED_GENE8152 bfun_06oct04_Contig482_revised_gene8152 MTQSDPLHPANGAAAVPPVE SFRSRDFLLSHVQDTLRFYA PNVLDPSGGFFHFFRDDGTV YNRTTRHLVSSCRFVFNYAM AYRQFGDPQHLEYARHGLRF LREAHWDAQHEGYDWELEWH DGRKRTLDATRHCYGLAFVL LAYSHAAMAGIEEAKPMIGA TFELMEHRFWDAAAGLYADE ASPGWQVSSYRGQNANMHTT EALLAAHEATGHLVYLDRAE RVASNITLRQAKLSQGLVWE HFHADWSVDWHYNEEDSSNI FRPWGFQPGHQTEWAKLLLI LERYRPLPWLLPRAIELFDA AMTHAWDEDHGGLYYGFGPD GTVCDHDKYFWVQAETFATA ALLGKRTGNERFWDWYDEIW RYSWAHFVDHKYGAWYRILT CDNRKYSDEKSPAGKTDYHT MGACYEVLAHALPNGAAAAS ESAEQTK >_0018.006734_ BFUN_06OCT04_CONTIG482_REVISED_GENE6735 bfun_06oct04_Contig482_revised_gene6735 MLIVNVSTMTTAPLNGADIP SRREGIVNAASETFLRYGYA RTTMADLAKAAGLSRPLLYL EFQDKADVFRAVAEQVAREL ERRIRAELDALPNLRTQLNL ACEIWVVETFDLISAHPDAK DFFDLGPELLGQSYDAFEVL IADILSRAGAKQARALARLL VSASKGFKESASDGKDLRKL LQLQIDALVGYIERNAHAAA APHAPCVRRTRRQ >_0018.003281_ BFUN_06OCT04_CONTIG481_REVISED_GENE3282 bfun_06oct04_Contig481_revised_gene3282 MSLNVEAAHPFIAARIHGLD LSKPLSDERIVEIEQASGQY PVLIFPRQYIDDDQLLAFAA GFGPLQVAVSYSTRPEDHRL APMINDISNLSKENQTYRPG DRRRMNNLTSRRWHSDASYL PLPARYSFLLSYIVPAVGGQ TQFADMRAAYDKLPDHLRKV VEGLSCHYDIMASRAAAGFY DASDEERKALAPCIHELVRT HPISGRKSLYLSSHATHVVG WPEPEGRDLLRELTEFATQP QFVYSHEWSVRDLVMWDNRA LMHRGRPHIPETDVREMHRA TTLDDRTWTRGSNQPVAASV A >_0018.003277_ BFUN_06OCT04_CONTIG481_REVISED_GENE3278 bfun_06oct04_Contig481_revised_gene3278 MTLRVHARVSKLGSRRIAAR MPRDASDWRTQAPITSAPAD FDQRMRRTEQETDLAHIDAA QVTRIAHIADALLGARRSNS VLPMLAAQDASVSMPEAFAV QSDMTRLLGQPIAGWKVGYV RDVRLTHAPVYRDACVASGA RVRLTSTLAPAVEGEVAFVL SRDLPPRASPYTPDEVADAV SSVCAAIEIGAPRLGNFMEA PLEHKIADNMGNCALIYGSG QRNWRDIALESLRVTLAIDG GVVTETHGGNAAGNPFDALV ALANAPHKREPLRAGHVVIT GTCTGIYPARAGMQARVAFE GLGEVSVTLE >_0016.004141_ YP_440670.1 gi|83718709|ref|YP_440670.1| gp47 [Burkholderia thailandensis E264] MSHVSPVFSHPRPGQVRLSG RIVLRGTMSAVQPDGPDTRR PALKLVKTQELSRDDWLAVR RTGIGGSDAAAAAGLNPYMS TLELWMDKTGRAEGLAGPDP NDTTSPTYWGTLLEPIVAAS YTKQTGNRVRRINAVLRHPT IPFMLANIDREVVGCREVQL LECKTAGEYGARLWRAGVPE YVELQVQHQLAVTGKQAAHV AVLLCGQALEVYRIERDDAL IGRLVELEARFWRYVESDTP PPADGSESADRALRHLYPGN GGTVDFTDDRQLSSVFADLV AVRAQIETHQAIEAQLKQAI QQAMGEATRAVFETGAVSFK RSRDSSTVDLARLLADHPEL EQQYAGSKPGTRRFLISA >_0011.004391_ YP_106522.1 gi|53716073|ref|YP_106522.1| 4'-phosphopantetheinyl transferase family protein [Burkholderia mallei ATCC 23344] MTNRMTNASTTLPTTNASPV PPLGERDAHVWYARTAACDT PALRERYRALLSAEERERLG RFAFDHLKLEYLVTRALCRT VLSAYVDAVAPEQWRFRANA HGRPEIDAGDARPPLRFNLS NARSIVACVITRTADAGIDV EERARSNDLDGIAASHFSAS ERAAFFALPPDARRTRFFEL WTLKEAYIKARGVGLSIDLG EFSFALPAQPVRIAFDRHVD DDASHWQFALLDVGAEHQMA LGIRDPHAAARPFDITMREI VPEPAASARCAAALD >_0009.000822_ NP_242718.1 gi|15614415|ref|NP_242718.1| siderophore (surfactin) biosynthesis regulatory protein [Bacillus halodurans] MCLSSNVNQHNDTTVVVGTI SSLHSRKEELYSYLSSDERQ RAERMKSSVYAERFKLIRGY LRFLLSTVLALPPNQIHFTY GKYGKPIVENNDYFFNVSHA KDYFLIGLHETAVLGVDIEC PRPFPPKVHPFFFHQDEINL LASVDPDQKMRLWLSLWTRK EALGKAVGEGLSSNIGKQSV LSDTIHYNGREYVLLTQHDP SYVKTICLEGKSVQ >_0009.000473_ NP_242362.1 gi|15614059|ref|NP_242362.1| BH1496~unknown conserved protein [Bacillus halodurans] MKGIVYIGHGSRVEEGNEQL RAFVKKAIERKRDIPIQTIG FIELAEPSIEEAIDECVAMG ATDIAIVPVLLLAAGHAKVD IPQEIERAEKKYPEVSFSYG RPFGVETVIVDVLKQRLEAV GLKHVNGLPAYDDREEATIL LVGRGSSDPDANSDLVKISR LLWEHVPVMEVEVCFLAATR PSLDEGLEKVIRLAGKKVYL LPYLLFSGVLMNSLQAKVNE LNETLLNKEFLLCHYLGFDD QLVNILVERVDEVLEHKISG NPDLYHDRLSKQANGVEGL >_0008.000665_ NP_978113.1 gi|42780866|ref|NP_978113.1| hypothetical protein BCE1794 [Bacillus cereus ATCC 10987] MDEIIKELQKLGFSQYECKA YIGLLKHSPVTGYEVSKQTG VPRSMIYEVLGKLMDKGAVH IVPSEPVKYVPVPATELMNR MRKDFEKSFEFLDEKLNGLE QERQIDVISHIRSNDRVLKE ICNIINRAKEELWISVWEDQ VHEIEPYIHQKEEEGVHIFS ILFGAPETKIGATFHHNYMT PHVVEKRMGGHLTVIARDGE EVLIANFSNDSTSWAVTTYD PALVLVATEYVRHDIMVEEI TQEFGADKLDMLWRENIDLV HVVTGKRTLEGMEDDTDE >_0006.001526_ NP_890104.1 gi|33602544|ref|NP_890104.1| putative decarboxylase [Bordetella bronchiseptica RB50] MHPASNSAVTRDSAAVMSRY DLTTRLVGRLDDAAVIGGIG STNHDLWASGQRPQNFYMLG SMGLAVPIALGVALAQPSRK VIALEGDGSLLMQLGTLGTV AACRPANLTIIVFDNGVYNI TGGQRTLTSHTIDVVAMARG AGLRNSHWAADPAHFDALVE QALRGDGPSFIATRTDRQLA RMFSEFNPTKIRQGFSAGLG VA >_0004.004571_ 17743322 gi|17743322|gb|AAL45603.1| glycosyltransferase [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008689|:1987626-1988825, Atu4809 MKLAYFVIPHIGGTYSVFKH LRKGLAAYDIDVRWLGVCKE TYGLPLDLQGETAFGQLLRM PVNLSERDCAARMAAAIENG GFDGVIVNVLGDQLQTNIAR YLPEGILRILVVHNISPGTY AAARSVRDYAHVTIGVSERC RADLVARNGFPKDRTYAIPN AVDSDAFRSQAVRRVNRGPE LKVLFVGRIEDASKGVMWLR EILDGSPEAVRLTIVGDGPD MGKLRRRLASHDDRVSYAGS VQLSDIPVIMASHDVLIMPS RFEGLGMTMIEAMAGGCVPV VSHIRGVTDTIVEPGRNGFL FPIGNYTAAANAIGRLHADR DLLERMSIAGKEMVLNRFSI ERMAARYNEVIAMTLNDRPG LSTVLRMEDWSIPAGLRPGL RTYLPLPLKNWLRVVRERL >_0004.004461_ 17743201 gi|17743201|gb|AAL45493.1| N-ethylammeline chlorohydrolase [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008689|:1860497-1861990, Atu4699 MSDTLNSLQLGERPEGRTLL TASWVVGHKDGSHRLLKNGE VVFENGEILFVGHGFPGEVA RRIDFGNALVSPGLIDLDAL SDLDTTILGIDHHPGWAKGR VWPRSYVEAGPYEMYSAEEL AFQKRFAFGQLLLNGITTAA PIASLFYRAWGETVAEFEAA ADAAGELGLRVYLSPAFRSG GMVLEEPGRMFPVFDEERGF QGLKEAIAFIEKQSGRHGDL VRGMLAPDRVETSTLGLIER TDAAARELGCKFRLHMAQGV MEVDTVRKLHGSTAPVWLAK AGLLSERLIAPHATNATEED LALYAENGVSIVHCPLVSSR GGSTLSSFSSCRKRGINIAM GTDTAPADMFMNLLVGLITC RINDGAPDQIRCADLFDAAT LGGARALGRSDLGHLSSGAR ADIAVFDLDDAVMAPSVDPI TTLVTGGSGKVTRAVFVDGR LSMRERQVAHIDMRRAREQA QAQFDRLIAKYPERSWANPP VSEIFPPSYQVEVAQHG >_0003.004089_ ARTH_26JUL04_CONTIG47_REVISED_GENE4095 arth_26jul04_Contig47_revised_gene4095 MKRIGFLSFGHWGNVEGSRT RTARDALLQGIDLAVAAEEL GIDGAFFRVHHFARQQASPF PLLAAIAARTSRIEMGTGVI DMRYENPLYMAEEASATDLI SGGRLQLGVSRGSPEPARDG AAVFGHVPADGESPADMARR HTAVFRHAITGAGVAERDPR YGGGPGKLPVQPNSPDLARR IWWGAGSRATAVWTAEQGMN LMSSTLLTEDTGVPFDELQA EQISMFRAAWAGAGHQHTPR VSVSRSVLPIVDEEDRRYFG LSALREKRDQVGIIDGLVAR FGKSYVGEPEAIAGELAADA AVQAADTLLLTVPNQLGVDF NAKLLGTVARHIAPAIGWQP KWSPAAALPAAGASGAEASR G >_0003.003346_ ARTH_26JUL04_CONTIG46_REVISED_GENE3351 arth_26jul04_Contig46_revised_gene3351 VERPAKGRRRLAAAVRTAGR PAVTALRVPVATVLPEDAED ALLIGRLWDVDTSGPRVVAV QGGDVFDLQHLAGTVSELLE RADPAADVRAAMSTPRWKTA DVVSASLRQDITRPHLLAPV DLQVIKACGVTFVDSMIERV IEERCGGDAGRATEMRDLVG KSLGGSIGEVRPGSPEAAKA KRVLMAEGLWSQYLEVGIGP DPEVFTKAPVLSSVGLGAGV GIPAFSSWNNPEPELVLIAT SGGKVVGATLGNDVNLRDVE GRSALLLGKAKDNNASSALG PLIRLFDGGFTLETLRQEEI LLRVEGADGYRLEGRNTVAR ISRPFEELVAATFGSHHQYP DGFALFTGTLFAPTQDRHEP GQGFTHKMGDVVTIRSRHLG ALVNTVGAAEELPPWHFGLR QLFAYLAGEERSASAAGAAS VS >_0001.001102_ NP_068848.1 gi|11497628|ref|NP_068848.1| hypothetical protein [Archaeoglobus fulgidus DSM 4304] MNAEQLMEERRQRIEDVVKG KEPDRVPVTGATTVWHGSYA GYTAKEVLFDYEKCKDAWLK VAKDFDFDSFTVVGGLEGMI YTVALLEQPDMSAAARFILG PTHLVLQDVYTRWPGYELEE NAHPQFIGKEIMKVEEYDQL IENPLEFINKVAMPRINRKL ANVGSAEYNAALAKYGAELA RFGAFMADVSMELAKLGYPT IPMSWAYAPLDLISDFLRDI KNMVMDLYRYPDKVKEAVEA VKPLIIKAAEVSAPPKEIRK QVFGTDVVECFYPLHLNEYL SPKLYNEFYWPYLNEVLHKV ADMGQVNFVLFEGRHDAHLE TLLEAPKKKVVGVFEKTDPR KVREVLGDHVILVSGPPNSL LIGGTPQKVEEYMKSLLEDC KEGGMMIWPGVDGGISRDAR PENVKAVIEAVKKYGTY >_0120.001963_ YP_001301494.1 gi|150006751|ref|YP_001301494.1| biotin carboxyl carrier protein [Parabacteroides distasonis ATCC 8503] MRGGMIMSKALATYFATVND IPDTEFKVEILEDGPIKKVS VNGTVYDVDYNLGGDTIHSI IMNHKSHGVQISSVGDSTYE VKNKGDYFQVQVIDELKKLR LSRTSSKTVGRQVIQAQMPG VIQKVYVKVGDEVKAGDPLC VLVAMKMENEIRTPIDGVVK EVYVNETDKVSVGDKMLVVE >_0120.000996_ YP_001304148.1 gi|150009405|ref|YP_001304148.1| putative polysaccharide deacetylase [Parabacteroides distasonis ATCC 8503] MILLSFDIEEFDAPLEHGVE LPFEEQMRTSVEGTRKILAC LARHRVKATFFCTANFALHA KDLILDIQKGGHEIASHGFY HSSFETADLRKSKEALEELT GQPVNGFRMARMMPVEEEEI HKAGYLYNSSLNPTCIPGRY NHLGQPRTYFMKDGVLQLPA SVTPIVRFPLFWLAYHNLPA TLYRKLALWTWKEDGYFLTY FHPWEFTSLSDRKELKLPFI MTNHSGCGMERRLDALIRFF KDKRAPFGTYTQFSQEILSK SHGQE >_0120.000747_ YP_001303543.1 gi|150008800|ref|YP_001303543.1| conserved hypothetical protein, putative phosphoesterase [Parabacteroides distasonis ATCC 8503] MIKVGLLSDTHAYWDDKYAE YFKDCDEIWHAGDIGSDLLA AKFEALKPFKAVYGNIDGQA IRLQYPKVAHFKVEDVNVMM THIGGYPGRYNPEIRKELYD TRPNLFISGHSHILKVVFDR SLKCLHMNPGAAGKSGFHQV RTLLRFVIEGKDIKDLEVIE LGNRAL >_0116.000855_ YP_193708.1 gi|58337123|ref|YP_193708.1| cell-division protein [Lactobacillus acidophilus NCFM] MERRQASKFYPHFNQEERPV IDYFTGLFNQLIFKHEPILT SFLDPGKRNILKTIVGNDAF IQEYGGYANAEKKRVYLSEE WVNLRPDNYQIQPYEIEYPQ KFVKITHSSILGTLANSGIN TDAFGDIITDDNGTWQFFAK KELTDFFEEQVDRIGRTQVK LKPISFKNVIVPEDDSIEKT EIIASMRVDAVLSGISKQSR GQIQKMIDSKLVKLNWHDIT GSNIMVKEDDVLSLRHFGRV LIENVSATRKGKYKVVLKLW QTKKCN >_0115.003687_ YP_001336987.1 gi|152971878|ref|YP_001336987.1| putative pyruvate decarboxylase [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] MSEMITVGDAIARTLEQYHV EAIYGVISIHNLPIADAVGQ REKIRFVPARGEAGSVTMAD AHGRFSGLGVALTSTGAGAG NAVGALVEAMNAGTPLLHLT GQVEKAWLDADTGFIHETRD QLTFLKASSKRAYRISNANQ AVAILHKAIQEAQTPPCGPV SVEIPIDIQSAKIPLSLLTA PLKRAPAVEPEASLVDALWA QLKQAKQPLLWLGGGALESG EAVKTLADAGVTVISSTHGR GILADSHRASLRAFHNSPSV EALISQCDFTLVAGSRLRSN ETRSWTLELPTPRVQIDIDP AAASRNYLMDNTLVADCRAL LAALAARVQGRIWGDARWDS QLKEAVEAAERGLRDQCGDY AKLNDAIAQALPDDGILVRD ITVSGSLWGSRLFRAHGPLM NIHSLAGAIGMGLPMAVGTA IANPQRKVVGLVGDGGLSLN LGELATLAQEKANVTLLIMN DGGYGVMRGIQDKYFGGRQY YNELHSPDFTLLAQAMGLQA WSVDRAEDFQAVMTEALAMP GPSVVEVKMGQIGALRFAGP PQKTLY >_0115.002743_ YP_001335512.1 gi|152970403|ref|YP_001335512.1| putative bacterial regulatory protein, MarR [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] MPDTPLIQQIRTASRLMVRE LGFMSTTLAATHYSPSAVHT LLEVSMRGEMTAAQLVTLLG LEKSSVSRVVSRLLAAGELE ERPCAEDARAKSLALTAKGH DTVAKINAWGTRQVVEALDH LDETQQHTVATGLAAYARAL AQCRDSALADTAPQISLMTG YQPGMIGRIAQMHGEYYARH HDFGAFFEGKVASGVAEFAT RLSSPANQIWLAIREGKIVG SLAIDGEDLGQQEAHLRWFI LDDSCRGTGIGRRLLSEAMA FCDSRQFSAVQLWTFKGLDA ARKLYESFGFTLIREWQGEQ WGKVMTEQQFTRSGNIG >_0115.000567_ YP_001335539.1 gi|152970430|ref|YP_001335539.1| putative transcriptional regulator, TetR family [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] MHYLQRDARREGIMQAAMRL ALRGGFAAMTVRQIAREAQV AAGQLHHHFTSIGELKAQVF IRLIREMLDMPLVAEDASWR ERLFSMIGSEDGRLEPYIRL WREGQVLADSDPDIKAAYLL TMNMWHAETVAIIEQGLASG EFRSAEPAADIAWRFIALVC GLDGIYALDAQALDEAAFSR YVNKMITLELF >_0113.003678_ YP_001086936.1 gi|126698039|ref|YP_001086936.1| TetR-family transcriptional regulator [Clostridium difficile 630] MSELSKRDKEKIQRENEIID KAEKLFCLNGFDNTTMNELA KEVEYTKRTIYKYFSCKEDL FFAVVLRGYKRLWDNVKIES AKGKTGFEKIKLSYFAFHKF YSVEPSLLCLMGMIGIVNSE KSDTDMLFKEKFFSFNKFMF DEIQGMFEIGKNDRSIRHDI EIPTLMYSSIFTLTGFFNLL SVTGKSYLNNFNIDEEKFIE TTLALLIDSIKA >_0112.001293_ YP_910275.1 gi|119026430|ref|YP_910275.1| probable sugar kinase [Bifidobacterium adolescentis ATCC 15703] MTQEQVAVTAEKIRAGKTSL GIEFGSTRIKAVLIDDAYHT IAAGDYEWASHLEDGLWSYT QEEIWKGLQTAYARMAGDVE TAYGERLTHVGHIGFSAMMH GYLAFDKSGELLVPFRTWQN TNTHEAHEKLSELFQYNIPE RWSVAHLYQAVLNHEEHVSK VDYITTLAGYVHWKLTGEKV LGVGDASGMFPIDPATHTYE TEFIKRFDAIEEVAAQPWKL ENLLPRPLVAGTPAGTLTAE GAKLLDPTGTLQPGVVLAPP EGDAGTGMVATNSVRVRTGN VSAGTSIFAMVVLEHKLARL HPEVDLVTTPAGDLAGMSHA NNFTSDLNAWVGLFGQFAAA IGQPVDAGTLYGTLFRAAIA DDVDADCGGLLNYPFRSGEF LAGLPEGRPLFARSPEARMS LGNFMRTQLFSAFSPVKIGM DVMTKDEGVAVDSLVGHGGI FTTPKVAQKILAAAFDTPIK VMATAAEGGAWGMAVLADYL WHADQPLADYLDTRVFADAA STTEAPDANDVAGFEAFFDR FRKGLPIEHAAIESIPLETK >_0110.001163_ YP_001300047.1 gi|150005303|ref|YP_001300047.1| glycosyltransferase family 4 [Bacteroides vulgatus ATCC 8482] MKVLMFGWEFPPKIYGGLAV ASYGITKGLSLQGDMETTFC LPKPCGDEEKFLNIIGMNQV PIVWRDVDYDYLKSRLSTST PEQYYAFRDHIYSDFSYMHV NDLGCMEFAGGYPGNLHDEI NNFSIIAGVVARQQEFDIIH AHDWLTYPAGVHAKLVSGKP LCIHVHATDFDRSRGKVNPT VYAMEKNGMDHADCIMCVSE LTRQTVIHQYHQDPRKCFAM HNAVYPLSQDLLDIPRPDHS KEKVVTFLGRITMQKGPEYF VEAAALVLKRTRNIRFVMAG SGDMLDAMINLAAERGIADR FHFPGFQRGRQVYEAYKNSD VFVMPSVSEPFGIAPLEAMQ CGTPSIISKQSGCGEILDKV IKTDYWDINAMADAIYSICT NPSLFQYLQEEGKKEVDGIT WEKVGLRIRALYEQVLKNYG K >_0110.000965_ YP_001299680.1 gi|150004936|ref|YP_001299680.1| glycosyltransferase family 4 [Bacteroides vulgatus ATCC 8482] MKILQINVFNYRKGGSEVVY FSTMELLRMHGEEVVNFALR WPENYPSEYESYFPESKETR SELLKPVKNIINYFNNREAA RKLEQLIEKERPDLAHMHLI WGQITGSILPVLKRHNVPII FSIHDYRIVCPAYTFRNGKG EICEQCRGRNFYHCVINKCT KDSYLLSVMMAIEQCYRNRF FNPAEYIDGLIYVSQFAKQM HEKYMPELKEKRNIVLYNLA DKILDAPAQKTDRYFLFFGR LSYEKGVKTLISAFKDMPNC NLKIAGTGPLEDELKDYTKS NNVTNVEFLGYKSGKELTDL VENAYFIIVPSEWYENNPMT IIEGYAAGVPVIGTNIGGIP EIIEEGVTGYLFTPANSVDL TRVVKAADSLPVEQYSNYQQ NALSFARKHFDREKYYPQLI GFYNQLVK >_0109.003050_ RER070207003051 REr070207003051 MKLMKTTEAVGQVLCHDITQ IIPGVKKDAVFRKGHIITKE DIPVLLSVGKDTIYIWENDE TMMHENEAAEVLYRMSACGT NSNETDGERHCEAAESGAFG GTASKMHPSPVKEGKIEVIA DCDGLLKVDSEKLKKVNSFG EMMIATRHGNTTVKKGDKLA GTRIIPLVIKKDKLEAASHI CDDGPILDIKPFVVRKAAII TTGNEVFHGRIQDAFTPVIE KKIAEFGAQMMFHEVFDDDD QKITDGCLRAIEAGAEIVFC TGGMSVDPDDKTPLAIKNTG ARIVSYGSPVLPGAMFLLSY YDAGDRLVPICGLPGCAMYN KRTIFDIVLPRLMACDMILA DELAGLGEGGLCLNCDVCTF PNCGFGKGF >_0109.001880_ RER070207001881 REr070207001881 MTRVYFVRHAQPEHDWEEDR TRPLTEEGKKDSAIVLEFLK DKKIDAFYCSPYKRSMDTIA EAADFFTKEIITDERLRERE KGLDGNNHGMFRKRWADHNY HEEGGESIAMVQKRNIEALN EILSDNTDKDIVIGTHGTAL STILNFYDNNFGCEDFLRII DWMPYIIELDFESSCNELEF ECPSYSFKLIGKQEHCYIEK EFKGK >_0109.001671_ RER070207001672 REr070207001672 VNTFLSPLSLERETECLKKV AQNDKEAKDELILHNMRLVA HVTKKYAVSEDEMEELISIG TIGLIKAVSSFKADYGNRFA TFAIRCIENEILMHFRSRKK SRGDVSIFEPIGTDKEGNQI HLVDVIENGQSDVVSDIEVS ESLKILRQNMRSVLSDREYY IITKRFGLDGEKELTQRQIA KTLSISRSYVSRIEKAGLKK LRKLLE >_0108.0001669_ YP_460136.1 gi|85857934|ref|YP_460136.1| 2Fe-2S iron-sulfur cluster domain with dehydrogenase [Syntrophus aciditrophicus SB] MVNLVIDGKKIEAETGTTIL SAARDNGIVIPTLCHHESIE ASGACRLCVVEIAKGGRKRI VTSCIYLVEDGIVVDTKSER VLGVRRLVLELLMARCPESE VVRNLAEELGVKTQKRFKPD KDKGKCILCRLCVKTCEDIV GVSAIGLSFRGKDKVVGPPF TEDSKACIGCGACAYVCPTG HIEMTTGNGGAERTIWGRTF KMAQCSVCGQYFAPEDQLQY ISNKTGVPMENLTVCMNCRL >_0108.0001016_ YP_462835.1 gi|85860633|ref|YP_462835.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB] MTETIDARGLACPQPVILTK KALEHCDSLTVRVDNIAALE NVKRMAKSQNCRVTVKQAQD GTWTLELTRESGAKEVEEGS ELQDIVCDVAGKEESEPSLS GPSVIVISSDCMGQGDDELG RLLMRGFFHTLPQLDRRPDM IIFYNTGVKLTVKGSEVLED IGQLEQAGVEILVCGTCLNF FNLTDQLSAGKISNMYDIAD ALTTAGRLARP >_0105.002057_ YP_055077.1 gi|50841850|ref|YP_055077.1| putative PTS system [Propionibacterium acnes KPA171202] MPTPRLIVADVTATSQNDLY AQVNDRLTADNMVKSTFLNA VSAREKKFPTGLDFGYVSIA IPHIDPEHVISPGLLVCRNS ASTTFHAMDDPERNLDVQLS IWPLVTDPGNQIDMLEAVIT LIQDESSYRILLHGSHAEVA NTLPDVLATVED >_0098.002292_ YP_316233.1 gi|74318493|ref|YP_316233.1| iron-sulfur cluster protein [Thiobacillus denitrificans ATCC 25259] MSELKPMSDNTPESAERRRV IGGLAAAAAGLAIAPGIKLI EVAQARPEEQGASSAVRWGM LVDATKCATGCDDCVSACST ENGWTPVAESAKPRQAPQWI RKLELQDPLTQMETNLPMMC QHCAEPPCVDVCPTGASFKR ADGIVLVNRHTCIGCRYCMM ACPYKARSLVHEPLNDTQKA DVPRGVGCVESCSLCVQKVD RGDGTTACAEACSAAGHNAI VFGDLNDPDSEISKRLRELP TKQVRADLKLNTGVRYHGI >_0096.001934_ SYN_PCC7942_21JUN05_CONTIG52_REVISED_GENESYN_PCC79421887 syn_PCC7942_21jun05_Contig52_revised_geneSyn_pcc79421887 MTNFPPLLLAYGLDLEGPIH RGIATYAKAMIRLLHQSGYP LHLLTGAGGLAQGRLASLSL HRHLDQPSNWSRIQLAQSYF AERLGRSRQTPIKVDSNLII GDRIQYLQQIDQLINHPFFY RKLRLQAKISDRPFFLKTKN YPVVLTSSPLNLRVSSSSRL IQVIHDLIPLNYLRHPDEAV TFLRRLQATADYSDRIICLS ETTRQQFLELFPQAESRTLT LYQPVSFSAEALALTDRPDF AAAVLRRYGLERDRYFFFVG AIEPRKNLETLIAAQQVAVL RTQLPLVIGGTADVHTQDYA QQLQQRSRSTDQILWTGYLT EAEKICLLRHCRALTFLSWQ EGFGIPMIEAALCGRSSLLA NIPVLREVMGAAATYANPQH PLEVADQLIQLGCYPAQLSE LRSRATQQIQAFADGQVQQQ IQQVLQSLIS >_0094.000253_ SSUI_28JUL04_CONTIG162_REVISED_GENE255 ssui_28jul04_Contig162_revised_gene255 MLIIRSRGENLDSNKRQKIL NLLGLAQRAGRLVSGEDLVV EAIQKGQAKLVFLAEDAAGN LSKKVTDKSHTYQVEVVTVF STLELSAAVGKARKVLAVTD AGFTKKMRSIME >_0093.001404_ NP_342484.1 gi|15897879|ref|NP_342484.1| Conserved hypothetical protein [Sulfolobus solfataricus] MFMMKLDELEKELGSKLIMD NNVIDHYSKSPYLVSPVLSK MGKRVLGVVIAEDRYDIEFV VKFCDIHRIPLLARGAGTST IGQVLPIHPSIVLDIQKLNK TMEYDEKYLKISPGVKVLDA LNYLRKRGKELQVYPSSFYI STLGGYIAGGDVGIGSYQYG YHFHNGGIRRVKVVGSTGTF ELTGENTLAVAQAAGTTGII VEADIATVDYEDWKDQLVRF DELGKLVKFLKDVENERDKI RRITIEDQEALSLVAKNRAI PGKWNVILASTKSFGEEVEM KFLDELAFAAIYVTMSRLTN FSEYFYEVRLLSLDSFLKVV SQVKNALGSNVLIHGDVMTL RGETVIYTVFISDRQNFNII DSIMTKEGIPFEIHSLVVND RVDEEYRLELMKKYKRIVDP HNILNPGKLRI >_0090.003768_ YP_166107.1 gi|56695756|ref|YP_166107.1| glycosyl transferase, group 1 family protein [Silicibacter pomeroyi DSS-3] MRITFLSPRSNLSGGLRVIA IYARMLRARGHEVTLVTPAR AQPGRRQKLRALLRGQASNS PEPPGHLDTLDLPVIETARA DFHVDPDEIPDADVIVATWW ETAFAAAAMPPEKGRKFYLI QHHEVHDFLPWQISRATYYL PLTPIVISNWLDGILRRTYG RDDARLVPNGIDLTQFHAPE RGKAARPTVGFLYSPHPIKG SDTALAAIALLRQRFPDLHV VAFGAEPVTPDLPLPPGADF HLRPAQERIRDIYAACDVFL CASTAEGFFLPLLEAMACRT PLVSTRVGAAEDLIEPGVTG YLADVGDASALAEGAAHILS LPDGEWRAMSARNHKIAQGQ SWDHACDLLESVLSGEGRS >_0088.002579_ NP_717372.1 gi|24373329|ref|NP_717372.1| transcriptional regulator, AraC/XylS family [Shewanella oneidensis MR-1] MNTEHAAYQIANELGGLELL NAHYHKQNFSRHTHEGYTVG VIETGAQRFYRTGGNHVAPK HSIILVNADEVHNGCSATED GWSYRAMYPVPAQFANINKE LASNRGAPYFPNPVVYDPQL AELLRLTFTTLDTSDNRLLR ESLVYSSLTQLMARHGRNYP CEHLGANAKPALALVKSFID DHPAADISLEDLATLAGLSP YYLLKQFQHYYGLPPHAYQI QARVRLAKAKIKQGTRLLDV ALDCGFHDQSHLNRHFKKTV GVTPGQFAKEIGRNFIQA >_0086.002022_ ROSE_TM1040_30MAR04_CONTIG53_REVISED_GENE2024 rose_tm1040_30mar04_Contig53_revised_gene2024 MPPKPVLHRDNPDAEPLIGN IKVTREDWLKMGLDVLIRDG EERVKVLALAEAMGVSRSSF YWYFKSRQDLLDALLTCWEE TNTAGLIRQANAPAERITGA VLNVFRCIANPNLFDTALDF AIRDWARRSEPVRARMRAGD DARVAALTMMFARYGCAEME AITRAKVLYYMQLGYDMAEP EESYAYRLSMTPEYLKVFTG QTPTEAELREFEDYSRQFWR DVSP >_0084.003685_ SFRI_16AUG04_CONTIG90_REVISED_GENE3692 sfri_16aug04_Contig90_revised_gene3692 MKNHKYRLTMNSYFNKNACQ DADAREHSLFTKLPDFLNFA KHECLHYQQTLNNIDVDKID SRALLAKLPITRKSDLINLQ KLNPPLAGLNAASAKFHRIF QSPGPIYDPEHCTDDWWRLG QAFHAAGFKQGDIVQNCLSY HLTPGGFILDSGARACGCVV IPAGPGQTEQQLDIIEDLKP NGYCGTPSFLKILLDKATAD NRDVSSLQKALVTGEALPTA LRREFDAAGISTLQAYASAD VGLIGYETIADDGFIISEDV IVEIVRPGSLLPVADGAIGE VVITSFNRDYPLIRFATGDL SAIKAGESACGRTNMRIKGW LGRADQTTKVKGMFVHPEQV DKVCQSDACIIKARLVVSQV NHQDKMHLVCEVIESALRDG NEQVLKDKIAQTLKTVTKLT GSVELVPLNSLPDDGKVIDE QRDFD >_0082.000966_ NP_764421.1 gi|27467784|ref|NP_764421.1| cell-division protein [Staphylococcus epidermidis ATCC 12228] MSLGIQRDQLGDIIVGEDIQ FVLTKQLESYIISELTRIKG ASVKLNSIPTEDMIQSEENW KVHSTTVSALRLDVVLKEMI HKSRNIAQQLIIKKRVKVNH TIIDSTDFQLEQNDLLSIQG YGRAQIVEIGGKTKKNKLHI HYQTLFK >_0078.003159_ NP_827957.1 gi|29833323|ref|NP_827957.1| putative TetR-family transcriptional regulator [Streptomyces avermitilis MA-4680] MSHLVYACLMAVDREHVLRS AADLLTRKSTATMDEVAKAA GISRATLHRQFAGRDALVRA LEELGIRECEAALDAARLEA GTANEAVRRLVSEIEPAARL LAFLYTENQLFEGDAQHEGW ARLDARIAALFRRGQESGEF RIDLSPAWLTEALYGLMASG AWAVLDGRVAAKDFSYMTVE LLLGGALRREES >_0076.001369_ SAMA_14OCT04_CONTIG18_REVISED_GENE1370 sama_14oct04_Contig18_revised_gene1370 VFDHKAFLKTVSSASGVYRM YDAAAEVIYVGKAKDLKKRL SSYFRTNLPNIKTQALVSNI ANIDVTLTHSETEALILEND YIKQYMPKYNVLLRDDKSYP YILLSGHKHPRLAYHRGAQR EKGEYFGPYPNGGAVRESLN LMQKLFPIRQCEDAYYRARS RPCLQYQIGRCSAPCVGKIS DEDYDEHVRLATLFLRGKDQ QVIGSLVRQMEAATLAMAYE AAARYP >_0076.000495_ SAMA_14OCT04_CONTIG101_REVISED_GENE496 sama_14oct04_Contig101_revised_gene496 MKIQFIATVVGPLGPDVLQS LAKVSREQGAEWLSSKLIMQ DGQFAAMMKVSIDDDKEAAL RDNLAREFPTLGFVYAPVAA VEGPVQQIQVELDCNDRPGL TRDVNNVLANLGVSVSHFES HRVQVTSLGRTLFNASLSVA LAPEMDIATLTSALEAVEPN ARVHYREMAFAAHRV >_0074.003680_ RRUB_10JAN05_CONTIG98_REVISED_GENE911 rrub_10jan05_Contig98_revised_gene911 VRRADPGVRGSPAMTPKIEA LFNDALIAAAARTPRGPTPT HADFHVHRDNPFCGDEVSID LVLGEDGRIATATVRARGCL LVEAAATVLAEAAPGLHPCD IAEAARHLRAMLKKNTPPPG APWATLEMFSAISSTPSRHS CALLPFEAVEKALGGPLVQS SLCKK >_0073.003015_ YP_296479.1 gi|73541959|ref|YP_296479.1| Rieske (2Fe-2S) region [Ralstonia eutropha JMP134] MPESGIEGMAAARFLCPADA LVDGGSGVRFTVELNTRQVG AFAVRFDGAVHGYLNQCAHV PMELDWLEGQFFESSGLYLI CATHGAMYEPDSGLCVGGPC RGASLAKLRIEERDGNVFWV PEAPYHPADTADNA >_0071.000019_ NP_743060.1 gi|26987635|ref|NP_743060.1| ferredoxin reductase, putative [Pseudomonas putida KT2440] MPEICVGERRWLVPIGSNLL DALNEAGLNVPYSCRAGSCH ACLVRCLEGQPADALPEALA LEKHAQGWRLACQCRVVEDL RVALYDPQQGGVPAQVCALD WFGDVLRLRLRPDRVVRYQA GQHVVLWLGAVARPYSLASL PGEDDFLEFHIDCQRPGAFC DKARGLQVGDEMRLGEFRGG ALHYDPDWQERPLWLLAAGT GLAPLWGILREALRRGHRGE IRVVHVARDPAGHYLAEKLL KLPGVSVELVLVEHVDEALA GMRLLSRQTVALLCGAPGSV ERFARRMFMAGVPRGQVFAD VFVEHA >_0069.001701_ NP_895746.1 gi|33864186|ref|NP_895746.1| glycosyl transferase, group 1 [Prochlorococcus marinus str. MIT 9313] MTAALPAGSDLVIVLPHLGP GGAQKVALMAAEHFLVQGRQ VTLVTLLPDKPLSHAVPEGL RWVDLGPAVAETTSNRAPIA RIWRFSCTWGRRSLAWISLV IGWKVLKRLVPGQAPLFVQW LVSSASGVQATLLRDLLYAG KPARVLSLLTRTNLLCCQAM WALPGHLVVSERNDPRLQKQ SFPWLRLRSWLWQRADVITA NTIGVLEGLQHCHPAMADDM RLLPNPLVVDSHPGNHSDEP ASGTCFLAVCRLVPQKGIDL LIQAYAQLPEPLREMWPLLI AGDGPERASLETLASSLLPR GQVRFMGFQRNPQVLYHRDA VFVLSSRFEGMPNSLLEAMG SGLAVIVSDASPGPLEVVVH GKSGWVVPTEKVTLLAEAMQ AMAENPALRCRLGDAAAEFM DAYSWDTLDPIWSDILCLEK PG >_0062.000448_ OOEN_16SEP02_SCAFFOLD1_REVISED_GENE23 ooen_16sep02_Scaffold1_revised_gene23 MSELKIPGKLMLAGEYAVTL ANHLALVFSIDRFISKNYSQ LVNEKGVKYGLGSSGAYAVL MTKMENSSLSDKDIFRQALI LSRQTQPQNSGADIAASTYS GLLLYKNGSFPERIFFPENW NLIVGWTGKPAITSELVKKN QLSSFFVKESDMIVRKMVDF IKAKDFEKFNQEIFLAEKNL EKLSGVLTDKLAKAIEIAKN FGIEAKISGAGGGDNVIAFT RDPKISQQIKNNWQEAGIIP LDLHVYYKK >_0061.007001_ NPUN_22DEC03_PLASMIDB_REVISED_GENEPNPBF153 npun_22dec03_plasmidB_revised_genepNPBF153 MQIIMYLTKVFYKVGDIPIF NPLYIHAYIIHQWLLNFRSQ PLSVNQKSAIIFPPHQDDET LGCGGLIALKREMGIPIKVI FLTDGQKSHTAIPWVQPLEN LIQVRKQEATTALHILGVEV SDIHFFDLIDGTLNHLSNEQ YQNTIKHLVELIQTFNPGEI YVPHRKDNHPDHECTYKFVK AAVEQSQIELEIL >_0061.000837_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF1648 npun_22dec03_Contig1_revised_geneNpF1648 LTQKRGFGRFNLAVYWKRKY RNKQETEPWYLLTNLPDLET AAQIYGARFGIEAMFRDCKT GGYNLEGSKANPDRLVRLIF LIALAMTSAWLHGQRTKLQK QESYICRTQEKSRTSKRHSN FWIGLYGQNWIVAWNECQAW VEELVASIRNKQSFYLRGLR AMKLIQKAL >_0056.000175_ SARO_25NOV03_CONTIG22_REVISED_GENE189 saro_25nov03_Contig22_revised_gene189 MDGISNVTAGAAKGVISKGR RTANRILDVAEELFARKGYG ATSLRDIASQVGLQQPGLYK HFSGKEDLYRQVYERALKPM IDLMDEILMRPNSDFSDLTD HITDLLAAHPNIARLLIRAA ISSDSEPDPVGLDWLHRMIG YGRKMNEKAGLPSSEEALGV QIVAIFNMLFGFFWASPLLE SLSGRKATAPQAMAIQRDLL RTFVRSLDQTSAPVLPHPTL AR >_0054.000223_ NP_988503.1 gi|45358946|ref|NP_988503.1| coenzyme F420-reducing hydrogenase delta subunit [Methanococcus maripaludis S2] MDLERHKLTYGNHKKNHFNG IDMEYSLTPSYLLKENMVLA CGNILFADDGFSVHVIEKLQ EILTEEEKENIALVDAGAGA PQQVLTLIDSESKTKKIIVV DIIDYGLEPGEIKILTMDDL PKPDHTKVDSHDWPLSTTMY RVVRDSPREIDFKVVGCQKK YVSEPDVVLELSEEVQNAVD VAVDIVLSELRGKPIN >_0049.001969_ NP_785729.1 gi|28378837|ref|NP_785729.1| phosphoesterase (putative) [Lactobacillus plantarum WCFS1] MQYFTSDTHFYHADLLGDND FAPRLFPDVETMNQAIVDHW NARVTDQDTVYHLGDVALYF TRPAKLSYERVFALLAQLNG KIIFIKGNHDSRAFFKYLAA HDPQLNGQPKFEFHDVGVLI KYDHRQYYMTHYPMMMGIVK QIINLHGHIHHYAVNVKETS TWGSIHQKSITWTIRCPLER HFHWPRSSRWLTVKPLILPS ASKRVLDRLW >_0043.001088_ JANN_22DEC04_CONTIG21_REVISED_GENE951 jann_22dec04_Contig21_revised_gene951 MDSTDRKILDLLQKNAKTSI QQLSETTGISTASVQRRMRA FREAGIIKREVAILDPTQIG LGITAIVSVELERDRLDQID AFKRKARNDRQVTHFYCIAG EADFILVVMAENIAGYEAFT HRFFFADKNVRKFRTSIVVS TEKATSELPI >_0043.000038_ JANN_22DEC04_CONTIG11_REVISED_GENE39 jann_22dec04_Contig11_revised_gene39 MALSDRIPYQAIVDRPRLHL PGDKRVAVWVILNVEEWRIE NPMPRTVLPTTMGQPLLPDV PNWSWHEYGMRSGFWRQWKA LVDRNIPVSLAINGNVCTSY PRVAGAALEAGWEFMGHGFL QGPMHRLDDQEGAIAQAMDG IEAFCGTRPRSWESPGLTET EDTLDLLRAAGVDYVADWVI DDLPQDIDTPHGRITTLPYS VETNDIAVYALQGHRSDEFL TRGRDQFDRLYAEGAENARV MAISIHPYITGVPHRIRYLE ELLDYVGGHEGTAWMTASEI GDWYRAEMARIDGGT >_0037.002456_ NP_953432.1 gi|39997481|ref|NP_953432.1| methylcobamide:CoM methyltransferase-related protein [Geobacter sulfurreducens PCA] MTSFERVMAAMGGVETDRPP FTLTLSLYGARLIGASPEQY YTSPRLYAEGQDRVAELFGP DILFAPFALAREAEAFGSTV VYHRHGPPNVVKPVVKNAAD FMRLADPDPDAHPALLYLRE SVRLLADGYKGRVPLAGILT APVDLPAIIMGIDAWLETLL FEPELASAVLEKTGRHFVAM ASRLLGDGADFVAMPVMFFN TALVTSRIAAERIVPALAAA FAQVPGPLVFHHGGNRIAPF LGELAELPGVAGFVIDPRDS FQDARARVGDGRVLLGKLNG PLLGLLTPDDACRVTAEILA DRRDDRHFIFASSAADVPWN TPPETISAVADTIRQWQRHG Q >_0036.002616_ EXIG_01APR05_CONTIG285_REVISED_GENE2617 exig_01apr05_Contig285_revised_gene2617 MKASLLLQTALRHFAQHGYE GASLQEIAQDVGIKKPSIYA HYRGKDDLFLTAMRYALDTQ KTHLATYFISTRHLSLEQSL KGFFDWFLEESTQNDQLKFI LRIAYFPPVKLEREVTDLIN PFFDTMQRHLTRLLRERNRT EQILYSDDYASAALAYLTVT EGTMTEFVFNGVAAYERRFT AVWPIFWRGLVR >_0035.002689_ NP_814093.1 gi|29374940|ref|NP_814093.1| transcriptional regulator, GntR family [Enterococcus faecalis V583] MLLEIDEQSEQPIYQQLIDQ IIVGIAKGELVPNESLPSIR QLADEIGVNMMTVSKAYNKL KQSGYIVTDRRNGTKIAAKL PATAVWQQQLHERLELLLAE SFLHQQSEAEIQALIQKIYQ NFDSKGRAL >_0032.002843_ YP_012257.1 gi|46581449|ref|YP_012257.1| glycosyl transferase, group 1 family protein [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MHILLFHFDESCPASVRQTF LLARDLSAGTAGDVTLVCTA GGFLEREARSVDVPVLPVGG LTGKPLTLMRLVRAVRSREA FVLHCCDTASAALVRQLKRL SGGRCRTVQTWRVVGGDAAR HVERVCRHADAVLCHGEMVC DRLERHGVPRSRLFAVSPAV DAGHYAPRRPRGDGRCVFVV DAPLAASSGHGIVLAAMRLL ATMPGLPEWEVRLVGDGPDF ASLLATARQTGVDTHLAMLG TQDERRILPDADVLLAPSLD GEHGADAIRAAWCVGLPAVA SSLDVHTGMVRHDETGLLVP VGDADALAAVMARLARDVAG DGAMAAGLVAGGRAEVLRRT PERLAAAHLDIYREQAGAVR RSLQSAGV >_0030.000886_ DHAF_12NOV03_CONTIG1044_REVISED_GENE987 dhaf_12nov03_Contig1044_revised_gene987 VLRLSGISRGFLRWNYGKSL KKKILNGVRKVTSETNNYLL ELTQPEAEAILRRSQLAIIP NGSVEQHGPHLPCGTDHYCI MAIARKVASGLDGLLLPFSH TGVTPFHASFAGTLTLRQET YVNLLLDTAVSVINHGVQKI LFVNWHEGNTTSINYAASQL QKEYGVTCVVAQACYITEQL YKGEADLTHAGALEVLPVMA YRPDLLKLERATNPSPYEAA KEVDALRRSKSVYPILKDIR QIAPTGWYGDLDIVSEEKAA ELVERVSGEIITAAQQVFER MQ >_0028.000591_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE0592 ddes_06jun05_Contig143_revised_geneDde0592 MPEPVLVIVKYCDPSTGGEG VAYRFCRYMQSRGVPFQLVC GRNKDPHGPLAEHVVELGML RPTRLLKYASFFRRAERYIA CTRGVPFSFEYVRGAAIVRQ TGVHAIFLRRSLEGLPEKEK RRKMRSRWWNLYNRYVPAQE RRVLDSAELQRILVPSGMTR DEVCQSYPQHCHKVTVIHNG VDTERFHPADEAARAAARER YWPGAGARRIVGFAGNIFMR KGLAHCIGSLRGLPEDVVLL VAGGDNAAPYRQQAAALGVE HRVRFAGAVADMPQFFHALD AFCLPTRYDPFGLVIAEAVA AGVPVVTSHLAGSAEIVQDG VTGAVCRSLDDSAVAQAVDK ALRLDRSGIAASAPDERDMF ARYLALADNVRAARRPVRP >_0024.000361_ CHUT_08NOV04_CONTIG199_REVISED_GENE1323 chut_08nov04_Contig199_revised_gene1323 MNLHMKNIVLGLTLLIVAWS CNKESGKNTPDNVFYTCSMD PQVMEKKPGKCPICKMDLTK VIIDKNDMQSGLKLNNEQEQ SANIKTLIISEGDLGREKTV NGRIVINENKRFKISSRVSG RIEKLYIKSLGETISKGVKL YDLYSEDLLVAQREYLIAIE KSKEFSGSELDYIQLADGAK NKLILWGMNENQIKSLINKK ELSNTVTIYSPYKGVVTSVN KREGDYVMDGDVVFETADLS TLWVEAELYAGEVNEFDVND PVAIKIIGFPGKIWNSRIDF FAPQLQTQSKVNLVRAEISN TDMSLRPGMQANIVLKENTK KALFLPLNAILQDSKGATVW IKEKDGSYHSRMVITGIESN DNIEIRSGIEAGEEVVINGA YLLNSEYIFKKGTNPMAGHD MSNM >_0021.003048_ NP_421967.1 gi|16127403|ref|NP_421967.1| glycosyl transferase, group 1 family protein [Caulobacter crescentus CB15] MTTILHAMLGKGLGGLEQVF LDYQPILEAWAARRGGRCVG VVRKGGKMAVAQANRTPPLS AMPALTDWDPITVGAARALV KTYRPALIFSHGQRPARVFD KAAPADVVRAVCLHKPSFDV TPGTHYVCVGQHLAALAIER GAPADHVWLVPNAVKPPGVE AQPFAEAGRPIRIVAAGRLH PKKGFDVLIHAVGKLRAWDY EVTCEIAGEGDERGALEGLI RDLDLEASVTLKGWTGDVAG FLATGDLFAFPSHQEGFPLT LLEAMAVGLPVVASEIDGPL EILTDGRDGRLVPDNDPDRL AEALAELISDRETAVRLGAA ARQQVLTEYSPQELARRLEA ALDGMTSRA >_0021.002926_ NP_421763.1 gi|16127199|ref|NP_421763.1| conserved hypothetical protein [Caulobacter crescentus CB15] MRHTLCVVHPMPTDTDIFES LRQELRRGTLILAVLAQLRE ERYGYSLRQALSGVGVEIDE GALYPMLRRLEAQGLLASEW REEDKRKKRFYQLSSEGRAV LARLADEWRAINAALEPLLE PAPSDGHQKG >_0020.001336_ CAUR_25MAY01_CONTIG1108_REVISED_GENE1979 caur_25may01_Contig1108_revised_gene1979 MIVSHLANLQRSYHHIVLSP HLDDAALSCGGSLAAAVAAG EPVLVVTICTATPPTDMQFS ALAQQFHADWGLSPTMVMQT RCREEETALSILGADGVWVG ALDAIYRMPEVYNSRAALFG EPAAGDPLFTTLHDLCHQLR HRFPHATVYAPLGVGNHVDH QLTCLAAATIEGPVMWYEDF PYVVRPGALEQRLDILDWPL RPLVRTIDERMTTRLAAITA YASQLAELSRSQLGHPIANG EAVDVMADAVRTYARQVAPP GTLYGERFWVRSGDCRYT >_0019.002945_ NP_346893.1 gi|15893544|ref|NP_346893.1| Molybdate-binding protein [Clostridium acetobutylicum] MENKTLTPLDVAKILRISKN TVYELIKRGDLNCYRVGKKI RIDSKDVELYKHKCKTNTKK APNSLQNTRKNINENFLQSN GNIPNGSFIICGQDILLDIL SRYLQIHPSGTTALRSYVGS YSGLLGLYFGKIQIATAHLW DGDSGDYNIPYVRRLVPGIH TIIINLAFRTMGFYVAKGNP KNITGWEDFRRDDLTMVNRE RGCGTRILLDEHLRLLNIDG NKINGYSRESLSHLAIASII SRNGADIGIGNEKTGLQVAN IDFIPIQRERYDLVIRKDDI EKPPFKAIIDILQSETFKSE LMGIGGYDLTQTGTTIAEM >_0018.001044_ BFUN_06OCT04_CONTIG480_REVISED_GENE678 bfun_06oct04_Contig480_revised_gene678 MNDVQGFIARLGDMPVISDP DVVRRRSRDMSMSFSPIIRR DAADKVAELIVRPRDKADVL GIASAAARTRMPLMMRGAGT CNLGQGVPLRGGAIVDMTAL AQVLWTKPGRVRAQAGTRLI DIDATTRASGWEIRMHSSTK RAATVAGYIGGGHAGIGSCT YGILRDRGNILGLEVVSVEE APQVIELRGDDVNLVHHAYG TNGIITEVELPLAPAWPWVE AVVNFRGFMDAVHFAYALAA SDGLIKKLISIDEFPNWQYM EAMRPFGRDGHSMVRCMIAE HCMEGFRGLVTEAGGAIAVE APEGQGPYGAPLWEFGWGHA RIQVNKTRPDIVNNIGLYLD PNLIDAVERSYRRFQGVGGM HLEVKRYGGRIAFQGSPYYG FVDEAQVAGVISGMIEDGAM VANNHTFFVKENGMKHVDER DADFKRRMDPYGLMNPGKFE ADDIEPKEGAGLALPTTGWS YDGASAPQTSASRG >_0018.000787_ BFUN_06OCT04_CONTIG480_REVISED_GENE446 bfun_06oct04_Contig480_revised_gene446 MSTPANAGNTNGTSHAFPLD GEDVPFEPGDTILQAARRAG RYIPHLCWHPDFHPQGSCRV CTVKVKGRPGSACTMHAADG QKVESNTDELNAERKMLLQM LFVEGNHFCPSCEKSGNCLL QATAYEMNMEGVLFDEFYPR RPIDASHPDVMIDFNRCILC ELCVRASHDIDGKDVFAIAG HGSGTHLVVNSVSGRLADTE MDLADRATSICPVGAILPKR RGFLIPIGERRYDPQTAADP DTQEAS >_0017.002570_ NP_810016.1 gi|29346513|ref|NP_810016.1| RNA polymerase ECF-type sigma factor [Bacteroides thetaiotaomicron VPI-5482] MPSPGYLNSKINIMRGFDFD KALVALQNELHCFAYKLTAD KDEAENLLQETMLRTLDNKD KFDSGTNFKGWMYTIMRNAF INNCRTKKIRGNLYVLSEPE YHFLLRDDSFIFVDNGHDAK EIREALKTLPKAHYVVFMLY RSQISGNSRKDRSVTEYDKK PYLL >_0016.003983_ YP_441364.1 gi|83719988|ref|YP_441364.1| transcriptional regulator [Burkholderia thailandensis E264] MGLTLMVAVPARHPLLAHKY LPLEEVLHYPLVLGDPHACE GFARQVDRILRRVDREPIVA EWVASIDLMMALVSAGFALG LAGTSQIVASREPGIVARPL AGRAPMLTTYLLRSDNEPSQ VLARFIERVSNLESLAAKKA VVSFDPDAREE >_0009.003367_ NP_244217.1 gi|15615913|ref|NP_244217.1| transcriptional regulator (Lrp/AsnC family) [Bacillus halodurans] MERKELDVLHILEENGRVPI PTLAKMIDATEEEVTAIIKK LENDHVILSYSAVIDWSKVK EVETVTAMIDVKVTPQRGVG FDEVAERIYRFPEVKALYLM SGAYDLSVVIEGKTMSEVAR FVSEKLSTIDTVLSTTTHFQ LKKYKHDGVVFKKDEDDKRI VVTP >_0006.003545_ NP_889419.1 gi|33601859|ref|NP_889419.1| hypothetical protein BB2883 [Bordetella bronchiseptica RB50] MGNLKDLHPPVARGQAHTAK GARVALQNLWGAAGLPDEAL DHVELHGAEPVLPSSFAVGT AAQASIAAAALAAAEIWHLR GGQRQRVAVDMRHAAQECRS YFKINGVTPNIWDKITGVYP CGDGGWVRIHANFPHHRDGA LALLGCPPGEAATREMVERA LARWRAGDFEQVAADAGMVV AAMRSFDQWDRHPQGLATAS QPVVRIERIDNADPRPLPKY GHEATPLQDIRVLDLTRIIA GPVCGRALAAYGADVMLINS PHLPNIDNIIDTSRGKLSVH ADLETADGRIALGNLLRSAH VFVQGYRPGGLQALGFGPED AARIRPGIVYVSLSAYGDSG PWAGRRGFDSLVQTATGFNH AEAQAAGQEAPKAMPVQILD HASGYLMAFGALAALARQRI EGGSWHVRVSLASTARWLRE LGRVPDGLACPMPPIEELLY AEESGFGELTAVRHAAQFSL TPARWTRPAMPPGSHPTVWP FR >_0006.003209_ NP_888734.1 gi|33601174|ref|NP_888734.1| phage-related exonuclease [Bordetella bronchiseptica RB50] MNRYILSPHEQGSDGWLLDR CGRVTGSRAADMLAMTAKKE WSTKRADYKFELAIEVLTGM PQGSDYTSKEMQWGIDQEPF ARMAYEEASGNVAIESGFMY LPDVAAGCSVDGLFVEDGRR GVLETKCPKSTTHIRYLEAG TLPDQYRPQCLHNVWVTGAE FADFVSFDPRFPEELQLFVC RFTPTAKELADHEKAVLQFL AERDELVAQLKRLAA >_0006.002379_ NP_887126.1 gi|33599566|ref|NP_887126.1| TetR family regulatory protein [Bordetella bronchiseptica RB50] MQNTVDESSLVTRRRAQLVK AAIKLFSRMGYHAATVKDIA DEAGVSAGLMYQYVSDKQDL LFLALQHIVQRNKEEMPLAL KGVEDPIARLYRAIDAYTRV IAANQQAVLLTYRETKSLKP EYIEQMKELELETNQLITEC VTDCIRAGYLAQTHEELLVY RIIVAAHAWPLKHWRLRHIV TLDEYLEQAIHAPWMGLLLP RGTSRYEELRRAGELCPTRI ASEPEVEDADQAPAPKAKKR ARRSRAA >_0006.001525_ NP_890103.1 gi|33602543|ref|NP_890103.1| putative decarboxylase [Bordetella bronchiseptica RB50] MGAIAAHETGDTAATAAWDE TVWRILKKEDIRLVTYVPDK VLKPLIDRVEADDHFQVVCP AREEEALGIVCGAQMAGMRS ILLTQTSGFATLANVLASLP VPYEIPVVMVISERGALGDR QLVQVRVWQTMRPILDSLGI PHHTLTRADEVEFVVEEAIR QAFATRSPAALILSPKLTKK SSD >_0005.002927_ YP_325300.1 gi|75911004|ref|YP_325300.1| Squalene/phytoene synthase [Anabaena variabilis ATCC 29413] MDLRSDALQILKDTSRTFYI PISILPSGLQEAVASAYLCM RAIDEIEDHPTLDNPTKAKL LRGISLTLQAGVDGFPVDAF AAGFSGYENTLAEVTMRIRE WSLLAPETIAPRIWDATAAM SDRMAYWAENNWKIYTESDL DRYTFGVAGAVGLLLSDLWT WYDGTQTNRTLAIGFGRGLQ AVNILRNHVEDLGRGVSFFP DGWDANNMQEYALRNLALAD AYTQALPDGPALNFCQIPLT LAHGTIDAIANGKEKLSRND VMALLEHLITFNLKAG >_0004.004940_ 17743724 gi|17743724|gb|AAL45972.1| ECF family sigma factor [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008687|:280231-280593, Atu5283 MLRGLRALHGFTPGTALKSW LFTILRNTFCTRYKVSKREC VGLPVGIEQTMSTPASREWH VQHQEAMRAIRDLDKDQKKA LLLVAGGTSYSDAAAICGCR VGTIKSRVNRARESLRRNLD >_0004.002836_ 17741413 gi|17741413|gb|AAL43868.1| transcriptional regulator, MarR family [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008689|:c49814-49329, Atu3052 MSKPTAKPPVSPTDINLDVL EDTLSFYIRTINLAVSRDLD NRLEGLDVAKGTGKITTLLL VDDYPGIRPSVIAHLTMRDR SAMGRVIDQMVSHDLIRREV SPDDSRAQELYITAAGSALA LKIREIVPQQSRDFFSFIPE DEQKQVIDILRRAYRHIVGL S >_0004.002752_ 17741321 gi|17741321|gb|AAL43784.1| Cobalamin biosynthesis associated protein [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:c2804886-2803531, Atu2803 MAAVADPVQPFGRRGLCPAL SAPMQTGDGFLSRVAFEADI SPHVMVTLCGLAERHGNGLI DITARGSLQFRGLTPESACA LAKDVLALDLPLREGLAVET SPLAGRDDAEIADGRFLAEE IRKGAGALALHDKLAAKTSV VIDGGGRLAMGDLLADIRLK AIRLEGRTFWQLLVGGPESK ALKAGLVEPVAAAGVVLSLL VFLAEKGPFARGRDVDQQTV TAICGERLLGWGSGGEARTP SLPLGLMMTGESRFAVGVAP AFGQIRSADLAHLCERADTF GIDALRPSLNHSLLFFGSSS ACEALREAAVASGFVTTAGD ARSSIAVCSGAPGCASAFLH THDLAAFAAEECAALLDGSF TLHVSGCGKGCAHPAPSLFT LAGTSDGLAFSISGRAGDPP AGILPFEQQQTALSRLARLY EKEHKPGENAAIFFARLGRE EIGAALRQDNQ >_0004.001731_ 17740205 gi|17740205|gb|AAL42763.1| methylenetetrahydrofolate reductase [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:1748130-1749089, Atu1764 MPQQTQTSAEKRCMVTKRIM LKLPDISAASLIPASIEASP AQVLGPASLAGIFPRGVRVY LTDTGAASQEKLVDAATHLR NLGYEPVPHLAARRIPSRVE FEEQVKRLAGEADVTDVLVV GGGVDRPAGPFASSMDMLSS GIFDRYGIKKIAIAGHPEGS PDFSEETAIAALRLKRDFGE RSDAAMRIVTQFGFDPARFI AWAEGLAASGIDLPVHIGVS GPAKITTLLKYAALCGVGNS IAYLKKNALSLTTLARGHSP DSIVGPIERHWQANPQGPIR QIHVFPFGGLQNSADWLVSR GSWQTRDAGRPAPADSMAG >_0120.001187_ YP_001301455.1 gi|150006712|ref|YP_001301455.1| hypothetical protein BDI_0038 [Parabacteroides distasonis ATCC 8503] MNKREIVGYALYQIGKFGRK EPDFLSIYFHDPAPKTFTSI LEWCKKKSYRFITLQECYAI LSGKIRQEGKVAYLSFDDGW QSNLNLIPIIDKYQAPITIF VSIDPLLSGNFWWEYALADG IELRNKMKLMSYKKFVSSLE SLKKKYSLERSVLTEQELVK IASHPLVSIQSHTMSHPILT NCDDEILSYELNSSKAYLEN LLHHKIEAFSYPNGNFSNRE IEAVKKAGYKMAFTTEYIPI QINHTNLYRIPRMAMNTYGG YYENLSKILGIWQDILKK >_0117.001119_ YP_795402.1 gi|116333875|ref|YP_795402.1| Transcriptional regulator, xre family [Lactobacillus brevis ATCC 367] MRLLGSKVKQYRKQKNLSQQ ELADGICTQATVSLMEKKDK ILSIKIILQVCRRLNVSLGD ILAGVGSGLDEQFDAICLDL RQDKYTEMAARLDQVDVSTE AASAFDRERYHYYRGFAELT VEQRPDEAIFHFNLVLRRSF HMAWDLYAIIGNVGMASAYL MRQDRDQARYYLGEATSYLT NFEWHDESSFQRLVWARYQI ALMYLKLEQPQQALTNVEAA LRVVKHRASLYLIDVLYEVK GLGEQALLARTAAKRSLMIA KTLALVTHNQALQERLNVTT DDLPAWSDQVNGFS >_0116.001280_ YP_194300.1 gi|58337715|ref|YP_194300.1| transcriptional regulator family [Lactobacillus acidophilus NCFM] MENIKKNKLHQLFENLEIGI GGVSSSIGVSQRQLRYWEKK GYIKPINEESGVRHYSLATV YLIAFIKDQLDAGYTLEAAV KKSKEVRVKSMIGRRLLHDA FDDVEITDEEKAYGKMKMGE VVLGDKKAEVIGIVDENGSH FELE >_0116.000275_ YP_194591.1 gi|58338006|ref|YP_194591.1| hypothetical protein LBA1749 [Lactobacillus acidophilus NCFM] MVAIAFYSITGQTERFIDKI QLKAHQISDANPKYDMGQKY ILIVPSYQDFMMDSVVDFLT YKDNKKNIIGIIGCGNRNFN DLFAQTAKKIAATLKVPILY LLEFSGTNQDVKNVRKIVHD LSAGQSTKEVQKPKELRGNI SFLSDYRD >_0113.002058_ YP_001088069.1 gi|126699172|ref|YP_001088069.1| MarR-family transcriptional regulator [Clostridium difficile 630] MDYSNELKELFLMNQTYATL FTLTNKIQIEGDKYFGILTS RQYMTILSILHLPEEETTLN NIARKMGTSKQNINRLVANL EKNGYVDVIPSPHDKRAINV KVTDLGKKVMVTCSRTGINF MADVFHEFTKDELETLWSLL KKMYRFNGEEQDGFEEDANF MEYEEIDKIKSEALEEFAKR RNRVNKND >_0111.004388_ YP_212951.1 gi|60682807|ref|YP_212951.1| hypothetical protein BF3340 [Bacteroides fragilis NCTC 9343] MEGINTPFVIDEHTAIVMTD PQNDFLSENGLGWGAFGENI QKNGTVENLRRIFEVAEAKG MLVFISPHYYYKHDHQWLFE GPIEKLMHDTGMFERRGQLT GEGFEGSGADWLDLYKPYIN EGTNIIVTAPHKLYGPENND LILQLRKRGVNKVVVCGMSG NLCAESHLRELQERGFEAAV VFDATASAKLPGMDADTAAF INFTLLAEKVYTTDEFVNEM WQR >_0111.003176_ YP_211426.1 gi|60681282|ref|YP_211426.1| putative ferredoxin [Bacteroides fragilis NCTC 9343] MDYKSKTANDKEISIYYFSA TGNSLKLSQDIAAAFGGAGL FRMTPSAGITASDSRIVGFI FPVYMGGLPGIVRRFLESYP FRKGVYYFSIGTYYTYKGCA VSVVDKIMSGKGVRLDYGYN LPTVGNCLKEYEVSSVRRTK ILERAELYTSRIIDDLKKGK RKKPFPYCRLSDLLHKGLFN AFFSRSHLNFSLENGCMGCG VCERVCPVNNITLKNGVPRW GTGCEACHACVHWCPRNVIQ IGKSKGRLQYHHPAVKRTML YRVE >_0111.000143_ YP_210751.1 gi|60680607|ref|YP_210751.1| hypothetical protein BF1070 [Bacteroides fragilis NCTC 9343] MNCKFEGSKVFFTSDTHFYH GNIIRFCNRPFKDVGMMNET IISNWNNTVGLDDIVFHLGD FCLGGSAEWTKILDRLNGKI YLILGNHDLKNLRQGYVDRF EHLAMQMHIEVNKQKIYLNH YPFLCFDGGYKDVWQLFGHV HTRNNNTGIDATRLQHLYPT QYDVGVDNNNFMPVSFAQVK TIIEKQIKQSKMEE >_0108.0000282_ YP_461465.1 gi|85859263|ref|YP_461465.1| 4Fe-4S protein [Syntrophus aciditrophicus SB] MLKTLRREVFSKMAKIKKKI KTIKIDESKCNGCRACEMIC SAFHSTPKYSTINPERSRIR MYRDVRNDLYLPVYAGEYTA AECMGRDVYVIDGREYEECA FCRAACPSRDIFHEPDSGLP LKCDMCEDDPPQEVPLCVKW CLNDALIYEEREEEVEEGIE MEDVETGLLAMVDKYGLQKV LDTAARVAKKGESVESKK >_0107.000684_ AFE_0704 AFE_0704 site-specific recombinase, phage integrase family {Acidithiobacillus ferrooxidans ATCC 23270} MTDVTLLGPWIRRFLLEYLV RERNLSINTQRSYRDMLSLF LPHVSKQLNKSVDRLTVSDL SADLIRQFLSDIEESRHCLA ATRNQRLGGLHALARFIGEN SPEHIEWCSQIRLIPFKKTA YPGITYLEKPEMDALLESPD RQTPQGQRDYALLLFLYNTG ARASEAADLRIVDVDWHAQC AHIIGKGNKRRTCPLWPTTL DQLRALATQRGLDQRVFLNR NGQPITRFGIHTMVERHAAR ACVQVPSMSTKQVSPHVIRH STATHLLRAGVDINTVRAWL GHVSLTTTNIYAETDLETKT RALATCAPPTAEGMATKHWR QQPDLMAFLHGL >_0102.001314_ NP_111110.1 gi|13541422|ref|NP_111110.1| Predicted ICC-like phosphoesterase [Thermoplasma volcanium GSS1] MRDIEILRDVFLSDLYCVYL RDISAVVVSDLHLGYEEEMN LHGLFLPRMQRDHVTHIMDK IVERYDPEKIIINGDFKQEF SKNLPSEWDDIIYFINRYDD RDLIFVRGNHDNYLATILSK KNKELLDYYEDDRYFIYHGD KDLSTKKITILGHEHPSLVL RDRVGGIYKLPAFAFNFKKN VIITPAMSFFSSGTDLSQSL LSEEHFTPSLKGMKASSFRI FAITDEFGLVDFGYLEDLRS SEGQQTRYR >_0095.002081_ NP_459861.1 gi|16764246|ref|NP_459861.1| putative inner membrane protein; homology to SgaT from Vibrio [Salmonella typhimurium LT2] MKDSVNILFVCGYGVGSSVM LQTVVKKALAKYDFSFDMEH TAAGEVGGFTDWADIYAISK KLLDVVSLDPKHGQYLIPIE NIMDGESIGKQIYDVVEKNF PHLLNK >_0086.001829_ ROSE_TM1040_30MAR04_CONTIG52_REVISED_GENE1831 rose_tm1040_30mar04_Contig52_revised_gene1831 MSDGRSGTGFGGESLLFLTD EQLRQGIEAMFFAYQGFTAD PDRILADLAYGRAHHRALHF INRSPGTTVNNLLSILGVTK QSLNRVLRALVEDGLVDSRV GTLDKRERNLYLTERGVDLE QRLSDAQRVRMRAAYKQAGP EAVQGFKKVLEAMMDADMRR AYTELREKSQ >_0081.003708_ SDEN_20JUL04_CONTIG99_REVISED_GENE3710 sden_20jul04_Contig99_revised_gene3710 MISFSTGLSNRGLISESFTE KIVKITTEELLRLFMFAPTQ VQKIPFAGRSLYIKRDDLIH PQFSGNKARKFQYFLTHEFA HITKLVGYGSAQANSLYSLA VLAKLKGWKFDYYVSHISDY LLKQPQGNYLAALDSGANII ALDKLVSQEVACSEFGAHAS MGLAPLAVPRDSQTQVMPKL ESVMQRLEATGKDNELYIPE GGRCEYAREGIELLGEEILQ WAKAQEIEHLVVFFPSGTGT TAVYLQAFFNQWASSQHKPF IEVHTCSCVGGDEYLLAQFK QLCPQLSHYPTILNTGKKYH FGKLYLECYQMWQKVCQTGI EFELLYDPIGFLVLEAALQH RYSHKDTILYLHQGGVLGNP SMLGRYQRKYPLG >_0079.003886_ SBAL_17SEP04_CONTIG249_REVISED_GENE3889 sbal_17sep04_Contig249_revised_gene3889 MEETKMDVVYIDARPLLGLS TRTNNRAEMSADSAKIGSLW QAFFESSQLTAMLNSPMYGV YYDYESDMNGEFAVLVGKAI DAPVEANHFTSLELEAGKYL KFTGQGDMPQCVIDLWGQVW GYFSANDCPHQRRYQTDFEV YLSATEVEIYIGIL >_0079.002481_ SBAL_17SEP04_CONTIG233_REVISED_GENE2484 sbal_17sep04_Contig233_revised_gene2484 MPEFIPFTHTDIASLVSLRT GETKLGQCVHLANHEHTLET ILATAKAHGASFAIFGVGED IGPRANLGRGGATDAFTTSM RQWLNLQSNRFLSGAECLVL GQVNAADLQQQTASNTTDNT NVTLDELRDAVEQLDERVIR IVSAILKAGLEPILIGGGHN NAYGLLMATYGHYQRQVAAV NLDPHSDFRLLEGRHSGNGF SYAADRGALGCYHVLGLHEL KNSEANLSQLSEFGGTWHSL QQIWVRREISLSQALLEIAA KLNDTGLPVGLELDVDAIAK MPSSASTAAGVPLLDAAHYV SYIARHCPCAYLHLAEAAPS CHEAGIEAGFRDVGQSLSEL IYAYVQARMQFLAQ >_0079.001431_ SBAL_17SEP04_CONTIG210_REVISED_GENE1433 sbal_17sep04_Contig210_revised_gene1433 VAHNIDVDELDSIAVAVEHN DIAQMRVANIKIASINLFNF IEPPLAYYDFENIYSHGQWQ KKCQWLSEFLAHRQPDIVGF QEVFSPEPLKRIASEQGLVH FAVIDAPTLISDYIYRSPVV ALASRYPIVEISSVEPDARL VAAMGLSSEFAFSRKVLRAT VEVPHIGKCDYYVVHFKSKR AGLALEPKLIEPKRFEHQPL ALDNLAPAASMKLHSETQLL TEQALGRWASTMQRGAEAAL LFNGILVRRQASKHPVIVMG DFNDSLTMGALDALTIQGES LHSNDIKAAGLGHLSDAALA AVFAQYQLKDAYELFIEANL RDSLTGYTAYHREHRAATHY YGPKGSVLDYILLSSEFDAS HGRSLAQVVDYQTCDRHLVR PEYERDAYSTDHAPVIVELA LRS >_0078.004065_ NP_822684.1 gi|29828050|ref|NP_822684.1| putative ArsR-family transcriptional regulator [Streptomyces avermitilis MA-4680] MTTAAPTARSRALAHPAREE IRLEAVLHALSDPMRLQIVR ELAADGDELSCSHFDLPVTK STTTHHFRVLRESGVIRQIY RGTAKMSGLRKDDLDALFPG LLDSVLTACAHQADRLGGA >_0078.003277_ NP_828093.1 gi|29833459|ref|NP_828093.1| putative TetR-family transcriptional regulator [Streptomyces avermitilis MA-4680] MNNSQQRGVVRPGARGTDRS AARRAELIAIGRKLFADTSY DALSMDDIARQAHVAKGLIY YYFKSKRGYYLAIIEDSVTG LVTRAAEGLELPPDQRVHRT IDGYLRYAEHNQAAYRAIIS GGVGFDAEVQAIRDGVRQAM VDAIAEGAYGRHEIAPLARM GLLGWLCSVEGATLDWTGRP ELPRDIMRKLLVKMLGGTLR AIEELDPAYPAPPAARRDT >_0077.000735_ NP_373138.1 gi|15925604|ref|NP_373138.1| putative regulatory protein [Staphylococcus aureus subsp. aureus Mu50] MMTRSQNKQQQLEQAKDIVI NSIGQTMDLYGTNRSVGNLY GTMVFEGSMTLDEMRHQLQM SKPSMSAGVKKLQEFDIVKQ QFTRGSRKQHFIAEKDFFIF FRNFFTKKFQREIDINVEAV KDAQAIINPLLESSDLTEAE TQEAQKIKAQLDHTHVYYEW LEQLTEAIESGEIFKYFPIP EQPSDSEN >_0075.001861_ NP_689018.1 gi|22538167|ref|NP_689018.1| hypothetical protein SAG2032 [Streptococcus agalactiae 2603V/R] MKRNKHLPLTETTYYILLAL FEEAHGYAIMQKVEEMSGGD VRIAAGTMYGAIENLLKQKW IKSIPSDDRRRKVYIITETG KEIVELETNRLRKLLNTANQ LGFGGDSYDKV >_0075.000186_ NP_687190.1 gi|22536339|ref|NP_687190.1| adc operon repressor AdcR [Streptococcus agalactiae 2603V/R] MTVLEQKLDHLVSQILLKAE NQHELLFGTCQSDVKLTNTQ EHILMLLSQEQLTNSDLAKK LNISQAAVTKAVKSLISQDM LKANKDSKDARITYFELSEL AKPIADEHTHHHDNTLGVYG RLVNHFSKDEKVVLERFLDL FSRELEG >_0073.005654_ YP_299252.1 gi|73538885|ref|YP_299252.1| Peptidase U32 [Ralstonia eutropha JMP134] MTISSSVRARHPQLVAPAGS LTALRAALEHGADAVYLGLR DATNARNFGGLNFTEADIRT GVAEAHARDAEVLFAINTFP QTGEVAKWQRAVDAAADLGA DAVIMADAGLMAYASERHPQ LRLHLSVQGSATHADAIELM RERFNVKRVVLPRVLSLAQI GKLARQTSVELEVFGFGSLC VMAEGRCLLSSYATGDSPNN KGVCSPAHAVRWTEQDGTMH ARLSGILIDSYAPGEPAGYP TLCKGRFEVQGERGYVLEEP TSLNALSLLPALIDIGIAAI KIEGRQRSPRYVADVVGVLR AAIDAACAEPARFTPRQEWQ GTLGRHAEGDQVTQGAYDRP WR >_0073.004948_ YP_297949.1 gi|73537582|ref|YP_297949.1| regulatory protein, MarR [Ralstonia eutropha JMP134] MATKKQESPVTITPSDEVDV ERLAHLVKDAARAYIRALQL RLAQHSVSYGHWTILRILWR HDGLSQRELSERAGVTEPTT FSAVKALEALGYVERMHLPG NNKKVHVFLSKAGRALRRKL EPLAEEVNAISVEGVTEEDV MTTRRVLISIAEKLVADEAA LAESGRRVPSTQEVGRLLAD L >_0073.001918_ YP_297582.1 gi|73543062|ref|YP_297582.1| regulatory protein, TetR [Ralstonia eutropha JMP134] MFCAAEPVSDANDIFGPLMP VSSLSPHANPKNRRGLETRQ RVLEVTRDLIRAHGYADVTL DQISASAGVAKSSLLWHFGS KEMLLAEAATSLFQQIAQEI EPDVRAGETPQRRLDRVFNQ VAQAFTANPEAKGVVLGMLF SGSVPASVRAEIRRGWDSHV RQLVDAFSQPHRPMPAAMAR LMLATFHGCYCHWYAGGCSE SVAEYLEPARELFRAWLGAD AG >_0070.000499_ PPEN_30JUL02_SCAFFOLD21_REVISED_GENE657 ppen_30jul02_Scaffold21_revised_gene657 MFFEVNSRQGVFNMENQIYI IYISVAGNTQSFVDDLADYA EKMHQNDTSNPLIISKEVTD QTDFADETQPYFAFVPTYLD GGNGIDNGVKELMTNTLGEY IAYHDNRKFCLGVIGSGNRN FNEQYCLTARRYAQDYGFEM IDDYELRGNSSDCKRIYDNM ANRVKNNI >_0070.000250_ PPEN_30JUL02_SCAFFOLD15_REVISED_GENE373 ppen_30jul02_Scaffold15_revised_gene373 MKITIGNILKQTRQSQGLTQ KAVADGICSQAMLSSLEHNK YIPNAQLIIALCQRLSISLE QLNLAENYAISNDKKLNQKL NFLCNQHQYQLLFELLHQDS TLNSLTTPSQIQSYYYYLAI SEFQLKLDVTHAVQHLKLAL ASESSKQSVLRRLCLGTLGL ILNLHTSSSGDVYLQQSLKN LQDNIYHPNLNILFYLQAFN AYHQQNFLECTHTIENGLSF ITNHDSHYMLANLYYLLAKV ASKTNNLNLQNEATQRSKIF TELFNERPYQDFKI >_0067.001781_ NP_142735.2 gi|33359332|ref|NP_142735.2| hypothetical protein PH0799 [Pyrococcus horikoshii] MTVVKMSKERMVELLQEHFE LNLYEARAYVALVAFGVLTP AELASVSEVPAPRTYDVLRS LEKKGFAMNQPGKTNKYRPV HPANVLEKFIQDWQERVKEE LEAKKKAKEELLELMAPLIE TEVPKYGVERVWVVRGIKNS TLKTKEMLEEVQREILLADD GFIAINLEDDIIKAVDRGVK TRIILTKNLLARIKTSKIVQ YAKEGKLELKVLDKFDLPML VCDEEVFFALEDLAARYFNY ETQVWIKDHRVVDLFKKRFE EYWEKAENA >_0064.000045_ YP_264178.1 gi|71065451|ref|YP_264178.1| possible transcriptional regulator, TetR family [Psychrobacter arcticus 273-4] MSRQHQFKVREENILAMAEQ LLLESGDGNITLDSLADQLD LAKGTLYKHFSSKDELYLRI IIRYEEQLFEINRIDDCPSA GVARMIFQQLFNPQKAMLLN QIEERLAASVTGLNRLFGEL YDIRRQRMKRLIDIISAYLK DEHSSLSTRDYLSSIWAMGQ GGAGLLNSSFYQRYLGRRDT LRYAFVQQMLELPSHYPAVD DEVMDEDMQELVEQIDTESE EHRNTNY >_0062.001707_ OOEN_16SEP02_SCAFFOLD8_REVISED_GENE2088 ooen_16sep02_Scaffold8_revised_gene2088 MNDKQYQEIASLLERLSRNN AMSIYGDWIKNSFSKELIAS ISVLTHNDLKILDSLFKQDL SISDVVSRTGLSQGGVSRRV NMMSEKGIIHKYQNDKNKKT VFLKLNPIGQELFDFHRKLH DHIKEIFFQKTQRFNEEQIR VVISFLEAILT >_0062.000951_ OOEN_16SEP02_SCAFFOLD33_REVISED_GENE1297 ooen_16sep02_Scaffold33_revised_gene1297 MKYKILLKGKQYFATINLFD GRIAMDVQVQFMAELSQLID KLSFLSRDQLKIALKGYKPA EVHTIEFIGEHSETNVTAIA SALYVTRGAVSKITKRLISR GLIVRYQKPDNQKEVYFKLT AQGQEVFEIHKKFTNYFMQR DEPIFRDSPDAVATTLAFLH QYNAFLDQKLTIKK >_0061.007109_ NPUN_22DEC03_PLASMIDB_REVISED_GENEPNPBR124 npun_22dec03_plasmidB_revised_genepNPBR124 MELVLVKLGGSLITDKDKPY TARREVITQLVEQVSLIKRE NPNLKLIIGNGAGSFAHQSA NKYNTINGFSGDEEKLGFCL VHQDALDLNFLLAKSFLQVG LPVVSLPPISMIITHNKKLL KSDFSGIESSLQAGLIPLVF GDVVLDQAIGGTVFSTDAML AELAKYFYLQGKFKVRLINV GNYAGVYDRAKVLNVKKNPI KTAQNMLNFGRKIKVYLFNS DYKFISQDCQRYLERTAKKR LSSIEQSSVL >_0061.003435_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR0442 npun_22dec03_Contig1_revised_geneNpR0442 MMMVGKLELMNPPTIEHFLI VDSLDTAFKAEIKRCSSPWL CFRESGVRTGRNKSRLTDLC VVTVEQARELLNASAVFQSP PLLIVEVVSPESVKRDYRHK RSEYAALEVPEYWIVDPLKA KVSVLLLEDGFYEETVFTGT >_0061.003313_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR0153 npun_22dec03_Contig1_revised_geneNpR0153 MRSIGCLMHWLFTGHLRVDK ITVKIAELPASLQGTTLVQL SDFHYDGLRLSEEMLEKAIA LTNEAEPDLILLTGDYVTDD PTPIHQLVHRLKHLQSRCGI YAVLGNHDIYYSHSKAEVTQ ALTSIGVHVLWNEIAYPLGK ELPFVGLADYWSREFYPAPV MNQLDSVIPRIVLSHNPDTA KILQQWRVDLQLSGHTHGGH IVIPGIGPLVFHYKKLLKKI PKKLRCWVTFLLGDCSKVVR YWEWAQGFHKVQENQLYVNR GLGTYKPGRLFCPPEVTVIT LVSH >_0061.002968_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF6064 npun_22dec03_Contig1_revised_geneNpF6064 MAELCKNYATQSMYEAGKSS SRVPKIKPRNNNGGIIIRFQ YQGKQYTLSPGGRHDDKLAI ANAERVASQIKTDILAGYFD YTLEKYQPRVKQADNVVSIN QQIVFNLKDLWEQYKVAKQA RLAETTKKSIWRDVDRMLGK LMANDLLPENACLLVGRLLG FYSASTLERLLIEIEACSNW AFENHLTHTHIWRRLRKQLP ERPKSERSKKAYSRKEVDYI IQAFGGDWYCNLNSAFKDSY YADFIEFLYLTGCRPEDAIA LTWDCVKDNVIVFDKAYSKG ILKSTKNNKARLFPITPQIR QLLDRRAIYVSTLLNKLVFP SQKGNRINLKDFTQRYTKRI IENLVSEGKVKQYLPTYNLR NTSITHYLRQGVDIATVAAL METSEEMINQHYWSPDEDII NNNVQLPEI >_0060.002608_ NMUL_10JAN05_CONTIG15_REVISED_GENE2609 nmul_10jan05_Contig15_revised_gene2609 MDPALDHLYEAAIIETIRRF EATGSPVITDGEQRKYHNFW TYCVDGLENTTPEGFIIPFS AGHSRRMPMLKEGPFRYRIY ADYYLETAMRYTNLPVKQAV ISPSALSLMYPPHEIEGYSR DQFIEDLLNEHETEVRRCLH KGAHKVQIDFTEGRLALKID PTGRLLDSFIDLNNMALSRF NSDELSRIGVHTCPGGDRDS THSADIDYSQLLPKLFQLKA KNFYIAFSGEKDRLRILNMI RAHLKPDQKVFIGVTSPINP EVERPEHICDLILEAAEFIP VEQLGTTDDCGFSPFCDDIS TTRETAFAKIAARVQGTALA SSILSGIAQSCTS >_0060.002136_ NMUL_10JAN05_CONTIG15_REVISED_GENE2137 nmul_10jan05_Contig15_revised_gene2137 MSTSWRIRIMLTIHAKLVEA MLAQAHKDHPFEICGVIAGP EKSNLPLRLIPMRNAAQSET FFKFDPQEQLQVWREMEARG EEPIVIYHSHTHTPAYPSRT DVQYASQPQSHYVIVPTDPA YGEEIRSFRILDGMVTEERI RMINSYKAEWELSMEAMVA >_0058.001544_ YP_209085.1 gi|59802373|ref|YP_209085.1| lipooligosaccharide glycosyl transferase G [Neisseria gonorrhoeae FA 1090] MKLKIDIATNNFKHGGGTER YTLDLVKGLNRQNITPAVYA TKFDHGIPEYAMIEPHLVDQ RRTLKKLRSFLFSSRLAQTR KNSAAKLIACHHADYADLLI CGGTHLGYLHHMAQKPNLLD RLAIRRNRSNYATAKLIVAH SHMMRRELVGLYGVPPERIQ VAPPPADTERFFPQPRETAA LRAKYGFADHETVFLFPSTG HTRKGLELLADFFEQTELPV KLAVAGSPLPRPMKNVVGLG FCTDMPELYRAADFTIMASL YEPFGLVGVESALCGTRVVL SENMACTEVMNEEAGFFFSR QNPETLAQAVAQAVSLKKQG GHRLSDPMRALNYNPSLSHH IDRLTDMLASV >_0054.000289_ NP_988584.1 gi|45359027|ref|NP_988584.1| hypothetical protein MMP1464 [Methanococcus maripaludis S2] MIDMVSKILKKYFDNLDINF ELDSQYISNKIEINPEICIL CNRCLEVCPVTAISSNFPEV PDIDNKCVYCNTCVETCPVD AIKITKTRVRVENGNLIIEN RLKSKKLDYNRKKCVMCLVC TKNCPFEAISESDDTISFNM DKCVLCGHCEEICPAKAIKL E >_0053.000315_ MMAG_12JAN01_CONTIG3650_REVISED_GENE321 mmag_12jan01_Contig3650_revised_gene321 MTEYYDSLETRSLDEREADH FEALPAQIALAKADSSAFAR LLADIDPEEVVDRAALAKLP VTRKSELIDAQQADPPFASL IATPKSELRRVFASPGPIYD PEGWDNDWWRIARALYAAGF RRGDLMHNCFSYHFTPGGVM FETGAHALGCPVFPAGVGNT DQQAKAIADLKPQGYGGTPS FLKIILERAEEMGLDHSSLT KACVSGEALLPPLRQSLKDL GVDVTQCYAT-DLGLI-YET KAREGLIVDEGVIRRDRASG TGDPVPEAKSAKSRHQLQQD YR >_0052.006176_ NP_102513.1 gi|13470944|ref|NP_102513.1| probable transcriptional regulator [Mesorhizobium loti] MTDHSSRPSPPPHIPIDALS EILQDFRLSGVNYGRCELRH PWSIEFPQQSLLRFHFVGQG PCWIHTEAEGWQELRDGDLA LLPQGIAHRLASAPDVAGGS LDDCRVTKLGSNVCEVVREG TGATSTLFCGSMALGACALN PLITLMPPIIKGCDVAGNDP VVGPLLTAMTVEAAQPQMGS ATILSRMADLLTARLIRCWV NCTGASTTGWLAAIRDPHIG RALAAMHRDPGHNWTLESLA SLAGQSRSIFAERFSAVLGE GAARYLARLRMQLARELLGQ NGLSVAEVAARLGYDSEASF ARAFKRITNVSPGVVRRTIP GRMDMNFGL >_0052.002364_ NP_107178.1 gi|13475611|ref|NP_107178.1| transcriptional regulator [Mesorhizobium loti] MANAAPAGWNAHAILEPPAN PAAEPTLVVPEQTPIPCFEF STGDLPPKDQFAAWRQSFAT MLDFVEPGDTEAGFAGTQVI WDLGDLALAQVSTNGLDFTS LAGHVRRDPVDHWLVTLIQD GRSQTITPSKSFDGDAGSVQ VHPLGRVFEGSVTDSEMLLL FVPRDFCRDMSWGLAAAEFS VLRGGMGPLLADYLASLVQR LPTMGVNDLPGLLSATRAML QVCIAPSPDHLTEAHDPISA TLLERARRFVQHHLFQPDLS IEQMTRELGISRSRLYRLFE ASGGIVHYIQHRRLLAAHAA LADPNDRRRILDIADEYGFG DGAEFSRAFRREFGYCPSEV RTGVKNGPSHWHMDEVETAA PGLRLGQLLRRLQAQ >_0052.001932_ NP_106458.1 gi|13474888|ref|NP_106458.1| ferredoxin 2[4Fe-4S] III, fdxB [Mesorhizobium loti] MTGAFVTRDGSPWTPHYLTA IDGATCIGCGRCFKVCSREV MHLHGVDDAGEILGICAGED DDFDGEPNRMVMIVDHAGRC IGCGACGRVCPKNCQTHLAA DQLAA >_0051.001546_ NP_247712.1 gi|15668908|ref|NP_247712.1| coenzyme F420-reducing hydrogenase, alpha subunit [Methanococcus jannaschii] MKIRGFESSMMGKDIDFIPP AMTRLCCLNEISHALAGVMA VEKAYNITVPNEGQYLREIA RLGEIVEVDAIKLREFKNTD DLADIGNKIKSVLGKKAKYL AVGGVLENISDKRKEKLINL AKEGLNLVDKDFVKLVDERK AKIPLPDVELIDAYNFDANK VETNGLPKTALYDGKVVYSG SLARMYKEGLINSKNLWDVL SSRMIEIEFCLNKIIELLNK LKLTHPYMEPIIKDGKAIGE AVIEGGEGIVYHKVELLGRE ILDYTILTSENFNKAVLDSV DNDEAKRIIQLCERCYYL >_0051.000428_ NP_247228.1 gi|15669880|ref|NP_247228.1| conserved hypothetical protein [Methanococcus jannaschii] MYPKRIDIIKKIVENVGEKE IIVSNIGIPSKELYYVKDRE RNFYMLGSMGLASSIGLGLA LNCEDKVIVIDGDGSILMNL GSLSTIGYMNPKNYILVIID NSAYGSTGNQKTHTGKNTNL EEIAKGCGLDTITTESLEEF EKEFKNALNEEKCKVIIAKT IPYNEKCSNIEIPPVVLKYR FMEAIKRS >_0050.002021_ MFLA_01DEC03_CONTIG130_REVISED_GENE2245 mfla_01dec03_Contig130_revised_gene2245 LSIIIGTFRILPMALPSFSY LQCSFAGMDFILRIETALLS PAILSGNRILETMPAPAFCQ AVPAPYFEAIEGNKLYKLAY YLENLASLPPAKRIWSYGSA QSNAMLALSALAQLMGTEFH YVLPYLADALKTQPKGNLAL ALDRGMQLHIDSKLYRRMTQ QAFVPGEHEWIFAEGANHAH VAHGFGYLGAEISEIAGTLG FKQVFLPSGTGSSACHLAAS LSGMEVMTTPVYGDTPYLQQ VFSAMQPQHHPVILTHEPHY RFGSLYRELWELQAELVAET GISFDLLYDLPAWACILQHR ERFKQPWLYLHQGGMKGSAT MRERYVRMFGSERPSKAGAR SEIQGKAEVD >_0049.002700_ NP_786695.1 gi|28379803|ref|NP_786695.1| aldose 1-epimerase [Lactobacillus plantarum WCFS1] MILETKKTVFDQLDGQDVMR YTLVNDHQTRISVLSYGGTW QEFVVNEDGVEHPLIWGLDN MTDYQRVGYCLCQSIGRVAG RIGGAKFKIDDQSYQVDMNE QTHSLHGGNHGFNTLNFEGA FSQTNDSVSVTLKKHINASD DNYPGNMDVAIKFTLDNQDQ VSIAFTGDTDAATLFNPTNH VYWNTTDDRTSLAQQKLQIT SAAHLEFDDEKVPTGKKLAV KGTAYDFNQAQPVEKALDQL KTENGGIEFDDAYEVAPSAA EPIAIVGDTDDKRQVKIYSD RNSLIIFTANPFDADKEDAH QYNALATEAQTLPDAINHPD FGDIVLRPGAPVTHTIRYQY ERLN >_0049.002246_ NP_786059.1 gi|28379167|ref|NP_786059.1| glycosyltransferase (putative) [Lactobacillus plantarum WCFS1] MNRVYVERSDWLFNRQSTML MKILITVENLVMDGVKRAAT VLGNALTSQAEVAFYSLAQP RSFYELSAPLITARRPASAT VLNYFGADPLTVYAPQIDDL LTTIGSGAYDAVILPGGLLT SFAPAIKQRFPRVNVIAWMH NNVDIYLNQYYVQMQDELVA GLLAADTVVTLTDYDWEGYS RFNAHTVKFYNPPTMQAHGQ QADLAQHIIAYTGRIDLQHK GLDYLLTVARALPDDWQIAV AGTGPEDQLATFNRLMAELD VRDRIIYRGALKDTELRAHY QQASVFMMTSRWEGMPLVMG EAMTMGLPIVSMWNTGSAEY LQGGQYGVLTPARDVNALIT GLMPLLTDFDCRAEYAARAR QRSHDFTLSKIVRQWLALLN RQYVPQTVVESNWPADLSSG GH >_0045.001322_ LBUL_20SEP02_SCAFFOLD7_REVISED_GENE1825 lbul_20sep02_Scaffold7_revised_gene1825 MAQSKGTDVVIQLLNQALKA GSTAKYVMFDTWFSNPHQIV QIRQRGLNVIAMVKKSSKIT YEFEGKRMNVKQIFNACKKR RGRSRCLLSVPVKVGDPAKD GAQIDARIVCVRNRSNRKDW IALICTDMTIDENEIIRIYG KRWDIEVFFKTCKSFLKLGT EYHGLSYDALTAHTAFVFLC YMFMSVEKRDDEDDRTIGEL FYCMVDELADITFNHSLQIL VEAMFESVKEIFQPTEEQME RFTNAFISRLPKYMQEAISP SLAA >_0045.001230_ LBUL_20SEP02_SCAFFOLD6_REVISED_GENE1679 lbul_20sep02_Scaffold6_revised_gene1679 VITEKAPGKLYIAGEYAVLE QDCPAILVALDQYVRVSIKP SSSDTGLIHSKQYSQDSIHW VRRGSKIVIDNRDNPFEYIL SAISFTERYCLEQKVKMDVY DLFVNSDLDSADGKKYGLGS SAAVTVATVKAILRFYGVQA SKDLIYKLSTISHYSVQGNG SAGDIAASVYGGWIAYQTFN KLWLKEELASKSLSAVVGEA WPGLKIQPLVPPKGMELLIG WSQQPASTSRLVDKTNANKN NLRTEYAKFLADSRKCVLQM IRGFEEANIALIQKQIGINR RLLQHFAAINNIAIEIPRLT ELIEIANKFGGAAKTSGAGN GDCGIVIVNKDTDIDRLRKE WVKNDILPLEFHIHQSELTY >_0044.001886_ LCAS_20SEP02_SCAFFOLD5_REVISED_GENE3163 lcas_20sep02_Scaffold5_revised_gene3163 LTTKHQILHREIVLDAARTM VAQDGIRDLTFQTLAKELNI CSQSLYNYFPNLPAVIEALG TEFMHNLYQELIENVSGISG KEAIRAFAEVAHRYFERQQS LDEIIYFVHQFPESSPFVQG TGDVINLLKRLIVHTELKQM AKESFVQDFISSVLGFTVLE VMGFLPDNKASRDTSFESLL DMYLNEIKE >_0043.003043_ JANN_22DEC04_CONTIG27_REVISED_GENE3044 jann_22dec04_Contig27_revised_gene3044 MGNTKPPDGAPVAVLPCVCR MGLGHQIDTKSQYGVAMKIL FVHQNFPGQYRELFSWLRAQ AGHELVFLTQRKDMPAIDGA RVITYTPHHKPAKDAYALTQ YWEECAGNGFGCAQAAGKLR GEGFRPDIILGHVGWGELTF LKEIWPNTPIIGYFEYYFLA QGGSVGFDPEFPASPHAPFT MHARNAVNFANIQTVDQGHS PTAWQRDTFPDSFKQKIYVK HDGIRTDQLRPDPTAEVALG RLGRAVTKEDEIFTYMARNM EPTRGFHSFMRALPHILDAR PNARALIIGGSDVSYGKKSG SEGGYRAEMEREVGDAVDWS RVHFLGRVPYGDYQKIIQVS RCHIYLTVPFVLSWSCLEAM SMGATIVASDVAPVREVIEH GKTGLLADFFDPKDIARNVV DVLERPGTYSHLGPAARNHV VQHYDFQTICLPEHLRQINA LVPPAKRIPLPA >_0037.000889_ NP_953468.1 gi|39997517|ref|NP_953468.1| hydrogenase maturation protease [Geobacter sulfurreducens PCA] MVVCVGNDLVADDAVGYEIH RRLAGAPLPAGVRLHYAAVG GIALLDYLTGGEAAMIVVDA VQFGAPAGTIHQLAWDELPN FGGGSISAHSIGLKETMDIG KLLYPERLPPTVTLVGIEGR CFDRTRDAMSPEVASAIEPA VALIRNYLHTLGQGI >_0037.000458_ NP_952558.1 gi|39996607|ref|NP_952558.1| heptosyltransferase family protein, putative [Geobacter sulfurreducens PCA] MLCKMRREGRRVIVITLLEH LGDIVACEPVARYVRSQEPN SFLIWFVSPTYREIVETYAS LDRVRTINCITSWSRLRRSV RFDRVVDLHIDGRVCPTCRI PHGNFDGDRSINLSNYFYYG SLLGSFCKAAGLPALDDQPV LHLPKRVKGCVASLRLAEQY LVIHCNSNELCKDWSSQKWN KLLDRLALDDIKVIEVGLQS DLGRSQSEHYIDLCGKLSLL ETAEVIRGSKLFVGIDSGPA HIANAVGTYGIILLGQYRAF NCYQPYSGAFADGSMASVIY SKSHVRDISVNTVYQEISRV LTLSATLSAVASNNKSRANA >_0036.000378_ EXIG_01APR05_CONTIG262_REVISED_GENE379 exig_01apr05_Contig262_revised_gene379 MAYNEQDHLLVIFPHPDDEA FSSAGTIIEHAENRGPVTYA CLTLGEMGRNMGRPVFTNRE ELATIRKRELIAASEKMKIS DLQMWGLRDKTVEFEDEAML AERIGELIAKTQPTRVISFY PGYAVHPDHEATARAVVRAL RAMDENERPEFLAVAFANNT RDELGEGTFVHDVSAYTDQK IAALQAHASQTGGLMKVISE DSNIRDLLVKERYYHYPL >_0035.001539_ NP_815090.1 gi|29375936|ref|NP_815090.1| transcriptional regulator, Cro/CI family [Enterococcus faecalis V583] MTLISRIKQLAQSKQLTLAQ LERNVGISNGQIRRWDTSSP KVENLLKIADYFSVSLDYLM GRTQQTEINHQAPMKSTQKE LHIHITTEELTEEEITQLEE EANRFLRFRKFEMTNS >_0032.001888_ YP_010559.1 gi|46579751|ref|YP_010559.1| transcriptional regulator, Fur family [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MNAREQLDAHGVEATPRRLA VLEGLAAFPHAPTAPELLEE VRKTVHLDKVTLYRTLDLLV EAGIVGRHPGGDGLLRYCLV HEETRPFHAHFYCTRCRRMS CLPATSTIPPRPEGLAEGMT AERVEVRFDGICNHCGDAPS SGSDSQSGSTTPARRPDTHG APRP >_0030.001035_ DHAF_12NOV03_CONTIG1049_REVISED_GENE1185 dhaf_12nov03_Contig1049_revised_gene1185 MSKDNAQYGILIDYEYCTNC HTCEVACKKELGLPVGQFGI KVLEDGPRENVYGKWSWTYL PLPTELCNLCEDRVKEGRLP TCVHHCQSGIMYYGTLEELS KKLVEKPNMVLFSR >_0028.001557_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE1562 ddes_06jun05_Contig143_revised_geneDde1562 MIEGKGAPALKKILLLAAFL GVWQACCLPSAAQSAAAVFV VEPAHRVVTLTGFTRPRTRL ALVSEVPARCTQVYADVGDP IGEKGVFATLDTTFIRLELQ ANRADQQRLKSDVAYFDKET SRYRNLVGGKHAAQSQLDGI VRDLVTARQQLERLRIDEMT LAERLRRHTISAPSGWRVIE RSVEPGEWVTQGQKLAVIGD FDTLLVPLALSVREFEALKA MERIELTLPDSGTRISAVVE RVSPDFDPATRKINVDLQVT GETGVLRGGIRTELALRMPD EGGAVAVPQSALTRAYEDWF LVRPDGERVRVLVLGPGEQA GTMRVRSAQVHAGDVFLVTP EAG >_0028.000736_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE0737 ddes_06jun05_Contig143_revised_geneDde0737 MSLYFCTPFKPPDHPRPSGD VTIAQDLCAFMRQQGQHVET VPYLPTDRIWRRPAMWPRLA WRRAALGLSLCRRRPGAWFT YHSYWRAPDLLGPAAAAHGV PYFIFAGAHSPKRAHDRASR WGHDKNLAALRAAAQIFVNK HRDLPALRSVLPESRLSFIA PGIRPDDFAFDRSSREELRR QWRVSTASVVLSVAMLRKGV KTEGIKQVISACARLRARGE DIILVVAGDGPEKEALERLA GYELAGNARFTGFVPREALY KYYSAADLFAFPGINEGLGM VYLEAQAAGLPVVAWDHDGA PELVKTGTTGIITPAWDTAA FAHAIGWLATDGRMRRTLGD NARNHVRAQHNLHTNYSRML DIMNTLCGETRPDGTCTHPD FK >_0019.000007_ NP_149271.1 gi|15004811|ref|NP_149271.1| Predicted HTH containing transcriptional regulator [Clostridium acetobutylicum] MVVIKLSSINLIILGYLQHK EKSAYEMVKDFDKWNLTKWL KISNPSIYKNIIKLCDNGYL NSRIVKEGEMPEKTLYSMNE KGNAYFNELMEESSKNIGNV YLEFNAFLVNIENLPKEKRR EYLENFKNKAEERSAFVESV YNYEKQHTEKPGSEFLIMDL YNEFYHILQKWSEKVFDHYN >_0018.002129_ BFUN_06OCT04_CONTIG481_REVISED_GENE2130 bfun_06oct04_Contig481_revised_gene2130 MPGFSITFQVMDRQRGLAII VGMNTLSLAQSVPPHMPAAS RTVGDLLREWRQRRRMSQLL LAAEADISTRHLSFVESGRA VPSREMVMHLAERLDVPLRA RNALLVAAGYAPLFRERQLS DPQLAAAREAVELVLKGHEP YPALAIDRHWTIIAANGALA PLLDGASPELLKPPVNALRL SLHPEGIAASIVNWHAWREH ILARLQRQIDVSGDDTLSAL HEELAAYPAPPGADAAEHDT ATAAQIAVPLRLRTPIGVLS FLSTTTVFGTPVDVTLSELA IEAFFPADQQTTAALRESAG NQRADAVRQP >_0017.001180_ NP_812138.1 gi|29348635|ref|NP_812138.1| hypothetical protein BT3226 [Bacteroides thetaiotaomicron VPI-5482] MKYFCVTEAVALYIQKYVQY RKRKIFFGATGKIEDLVPSI VKHKTEKYLVPMSDVHNDDV KNLLDKNNIQHTEAVMYRTV SNDFTSDEEFDYDMLVFFSP AGVTSLKKNFPDFDQKEIRI GTFGSTTAQAVRDAGLRLDL EAPTVQAPSMTAALDMFIKE NNK >_0013.003299_ NP_878974.1 gi|33591330|ref|NP_878974.1| oxidoreductase [Bordetella pertussis] MMQQEQRILFKNDVGDYAAV AAGEESVLQAGLRQSVPLNY HCASGSCGSCKARLIQGALK VYTGTDFIQVSHSAGQACEC PEVHLCQSHAVSDCVFEALY DKNVPPDLAAPKHYAASLND VRPLGSGLYRLLVDLDDSIR FLPGQYVMLATKAGGRARAY SVANFAQDSRQLEFILSCNP NGAMSPQLCDINNIGMQLQG YGPLGKAYIRPKKDNELVML VGGSGVSVALSTLEWAISSH YIDDRHLTIFWGVRDTSPID LIGVFNRYAAVHSNLRVAVC SDISPSVQDRGRFPYIEFFT GYPADHIVNDASISWEGKEV YISGPPPMVDHTIRQLMINT EIDALDIECDSFV >_0011.003091_ YP_105811.1 gi|53716990|ref|YP_105811.1| transcriptional regulator, LuxR family [Burkholderia mallei ATCC 23344] MSDIAVQEDAAVAASARPAS RANPGGTARDAQTIDVVWCG HARRAADADPRFAPRLGVAE MLASARGTAECGRIATSLLR LMGFATFAYFALEFTRDDAE CLYLHEAFTPAAYRGDYVRR RHHDVDPRMLGTRASSMPVV WDLRQLAREHAARGCDALDG FLRVMHDDDMCSGVMYSMPV PGTRVHAFASFTAPRRSRDW ISSATLEQVLSLGLSVHRFA APQLIASARERAAERLTPFE RELLVGIAEGASDKEIGRRL DTSAHNVDYHLRKLRKRFGV ANRIQLTYFASARGLI >_0011.001188_ YP_102393.1 gi|53725253|ref|YP_102393.1| hypothetical protein BMA0605 [Burkholderia mallei ATCC 23344] MQDLSTELAPELKPELKPEL KLDDPFVDAVHTEFVQLLDA AARADDAHFLQAYDVWMDHC RQHFEREERWMASTKFGPQY CHAADHNEVLKVAADVREKL ARDAQFELGRRLLVEVAEWF DQHVRTMDSMMVSHLKMLNF AMVDVPTEA >_0009.002559_ NP_243215.1 gi|15614912|ref|NP_243215.1| BH2349~unknown conserved protein in others [Bacillus halodurans] MLLEDVLKEYEYHCQARNFT KKTMTNKRQEYNQLKQFLET KRGITELESIYHQDLKSYIR SKQMSGLKPQSIHAIAKQIK AFFNWCVSEEYLKENPMDKV ALPKVPKEVLTGLTTDEVVK MMDSFKGDSYLEIRNKAIIA MMSDCGLRAMEIAGLKECNV RDTDIKVFGKGNKERMVFIS PALKKILLKYERAKKNYFKD KIQYSDSYFLAYQGKAMSTS TTL >_0008.001153_ NP_978715.1 gi|42781468|ref|NP_978715.1| 4'-phosphopantetheinyl transferase, putative [Bacillus cereus ATCC 10987] MIESKVVDSIPILNENDCQI WWGRISDLQSWHYNLLNDVE REKANSYHHSADRARFIIGC VISRLVLGKMLSMSPVQVPI NRMCPVCKLQHGRPQLPEGM PQLSVSHSGEWVVVAFTKSA PVGVDVEQMNPNVDVMKMAE GVLTDIEIAQVMKLPNEQKI EGFLTYWTRKEAVLKATGEG LMIPPVDITISAPNDSPNLL VFKDRQELVENTVMRDLRPS VDYMASIAIFSKEVTEIIQL DAVALLNYK >_0006.004449_ NP_890852.1 gi|33603292|ref|NP_890852.1| putative dioxygenase [Bordetella bronchiseptica RB50] MSTTNAASTEQPLKFAGYRK RDMPAHDAELTSTMPGTPMG EYMRQYWQPVCLSEELRDVP KAIRILGENLVAFRDKSGNI GVMHRHCAHRGASLEFGVIQ QHGIRCCYHGWHYGVDGTLL EAPCEHDDTRLRQTVCQGAY PAFERDGLVFAYMGPPENQP PFPEYDTYTLPKGTKLVPFS NVYPCNWLQVHENIMDHMHT AVLHNHMVVEGVDAEVSAGV TLEGFGDMPVMQWHPTRSGN GIVFVAGRRLPQDRVWVRIT EMNLPNYLQIGSLVPTAARE RHSTSGCTRWHVPVDDTNMI IFGWRHFNAEVDPDGMGCEA DCGVDKIDFLVGQTGNRSYD EAQRAPGDWEALTSQRPIAV HALENPGASDVGVYMFRKLL RDALHGKTKPDPAYAQLLAS GQSLPLYAQDSVLYAKQRED TAEDRKLINELGRKVLDIMR AADGLAAAERDAFVRAELDR IDDGVGKQIVAQALAMEA >_0006.004161_ NP_890349.1 gi|33602789|ref|NP_890349.1| AraC-family regulatory protein [Bordetella bronchiseptica RB50] MRSLYIDAALVRGMPDDCAV LDMTALMRELVVRASTDAPS TRDAPQRHEASDGALDEEYG LLSALMVAELRRLPRCGLDL PLPESAALRGLCERALADLS AFDSAQAQARRLKLSARTLY RRFLDETGLSFARWIQQARL LDAVRRLGDGQAVTDVALDL GYQSPSAFTAMFNRALGCSP REWRRLGEGRE >_0005.004734_ YP_324071.1 gi|75909775|ref|YP_324071.1| LmbE-like protein [Anabaena variabilis ATCC 29413] MEPKLSHKIYLKELGKLIPI VWLEKVQYIHSWLLLKWLLF CNSKPLLSNQKSAMIFSPHQ DDETFGCGGMIAYKREHGIP VVVVFITDGKGSRSLDLDCQ NQIVQTRKQEAISALGILGV ESSHIHFLSKPDGCLPELSN EEHQQTIREISELIQHYQPE EIYVPHRKDCHRDHEATYNL VKTVIREANIPVEVFQYPVW LFWRAPLFILLKIQDIAAAY RFSITSVQDKKKRAIASYSS QIENLPRGFIKQFLNSHEIF FKSEL >_0005.003164_ YP_321162.1 gi|75906866|ref|YP_321162.1| Protein of unknown function DUF820 [Anabaena variabilis ATCC 29413] MFTISDLEQLQAEHPEWQME LVDGSILVMGPSDYISEEIG VEFARQLANWVRPRKLGRVT GSSAGFILPRLETENGIEAD IEKRNLRAPDVSFVRAERLK ISKRDFVELVPDLMVEIKSK SDHIKPLEEKIQLFLQLGCT VGILIDPDKLTLTVYRINQE PVILQNNDKLTLPDLLPGWE LTVSEIWPPVFE >_0004.002311_ 17740837 gi|17740837|gb|AAL43343.1| transcriptional regulator, MarR family [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:c2329523-2329017, Atu2354 MALRLAHSNGPRTEKSESGS AEAAHNTAIDYGLLTTAISY HLRHSQLAVANGFADVLAEQ GLRPADFSVLVIVGGNPGLK QSDVAEALGIQRANFVAIVD SMEEKGLLVRRKSEEDRRVH FLDMTEEGSSLLDRLSNTWR DREEKLIDRIGGKKARDQLL ALLGRLRD >_0004.000482_ 17738843 gi|17738843|gb|AAL41514.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58 (Dupont)] (gb|AE008688|:482512-483240, Atu0495 MLSRLTLSDRFSNGFECLGS ETAIHGVAAWCDPLGGLYLP DLSLLVVSDLHLEKGAAFAR RGRMLPPYDTIATLKILSSL VSRYDPKIVVSLGDNFHDRV GSQHLPLPLRELIREMARGR EWIWINGNHDPDGTVDLPGS SVDEMFYGNLVFRHEPKVGE AAGEIAGHLHPSATVRRREK TVRRPCFATDGSRLLMPAFG VMSGGLDLRHQAMRGLFDHT ALVAHLMGRDRIYSVRFSNL LG >_0003.004029_ ARTH_26JUL04_CONTIG47_REVISED_GENE4035 arth_26jul04_Contig47_revised_gene4035 MEFPAFPDSAHPITWPEGFR AAASFTFDVDAESCTIAHDP TSTRRMSLMTHQSYGPKVAV PRLLQILQRQDIQATFFIPG FTAESYPDVVRRIVDAGHEI AHHGYLHEPMQGIDAATEAR YIDRGLEALAKVAGVRPIGY RAPWWELNWHSPALLADRGF LYDSSLLDGDAPYRFSVGEG DSRDLVEIPVDWALDDWEQY AFYPGVTGSGVIESPAKVLE MWTLEAEAHHAAGSCFVLTN HPFISGRPSKAVALEQLISR VKDMDGMWVTTMAGIAQHIR DTVTEVHSHARIEVPGFPDA GARFTPAQVRQPALAAEPAG R >_0003.001808_ ARTH_26JUL04_CONTIG41_REVISED_GENE1813 arth_26jul04_Contig41_revised_gene1813 MQIGVFSVSDITTDPTTGRT PTEHERIKASVAIAKKVEEI GMDVYAIGEHHNRPFFSSSP TTTLAYIAAQTERIILSTAT TLITTNDPVKIAEDFAMLQH LADGRVDLVMGRGNTAPVYP WFGKNIQDGIELAIENYSLL RRLWDEDTVNWSGKHRTPLQ NFTSTPRPLDGVAPFVWHGS IRTPQIAEVAAYYGDGFFAN NIFWPKEHYQQLIGLYRERY EHYGHGKADQAIVGLGGQFF MRKNSQDAVKEFRPYFDNAP VYGHGPSLEDFTSQTPLTVG SPQEVIEKTLTFREYFGDYQ RQLFLIDHAGLPLKTVLEQL DLFGEEVLPVLRREYAALTP AHVPEPPTHAGRVAARMAAQ VQEDSLTKPTAQDA >_0002.000879_ NP_148058.1 gi|14601518|ref|NP_148058.1| 3-phosphonopyruvate decarboxylase [Aeropyrum pernix] MSLLTLAFNPQPRAKTYCSG PILRVSGVWAVKILYIVLDG AADSPTSPRKTLEEASKPNI DSLGSHAVCGMVYTVKPGVA PQSDYATLSLLGYNPDEYYP GRGPLEAFGAGIEMRRGDIA LRANFATVDPGTLRIIDRRV GRSLTSREARELASAVDGME LEDGEGTALFRATIGHRGVL VLRHRSKPLSDAISNTDPAY ERRGRFSVALEKYEPFIKLS NPLVEDEAAVLAARMLNEFT LKAVEILDSHPVNLAREKRG LLKANAILSRDAGGLPEEKP PSFQERFGLRGASIVEMVVE RGISRYIGLDDIRVEIEGRA REEVYREEAARAVEALETHD LVYVHLKGPDEPGHDGSFEG KIRAVEDIDKHFFAPLLDRL SSAGLEPAFVVTSDHATPWD VGAHSGDPVPLMISHQSIQG SIGKFSETVCLRGRLGTIIG GYRIIPKTLSLLAG >_0001.001556_ NP_070625.1 gi|11499386|ref|NP_070625.1| phosphate regulatory protein, putative [Archaeoglobus fulgidus DSM 4304] MLENRKVYFSGKSSYILTLP KRWAMENGIEPGSEVVLSVG KNFITIYPKNPGLKIKRVEI DAKDVKHEALLRRIISYYLA GVDSLRIRVYDEEQRAAISR ASESLMGVEIVEDTGKEVIL EIFLDNSRLKPEGIIRRMGN TCVGMVSDFCTALRKYDRYL CSSIIFREREVDRLYFLLLR QLKQATLHADVAAQLDIPIE TIQEYRTVVRTLERVADHAA NMAESLIELGKPVDYLCDFV EIDLEMLRTAMVAFIENELE LAEVVLEEFDDIQDIERKYY SSILNEDVEEALNLKSIVDS LSRIAGYSADIAEVVVNMVV V >_0000.002643_ ADEH_23AUG04_CONTIG87_REVISED_GENE2646 adeh_23aug04_Contig87_revised_gene2646 MAEGEARMLLRVIDYVANPG GGVRFTVEMLRGLHRMYGPQ IELVSHGEALQRYASLLSRD RGVRLVDVPPVEAWRSRTLM AGIPGAGPLNLLLGTARFHH AVAGDLLEGADVVWFPWLHR HRIPWARAGRVVATLHDVIT LQFPEIAPAWIRRDERGTVR AWLASGARIAITSNATAATL HSMFGTDAERLEVIPLSGSH ERPAAPVTRGDLAFKGREYL LAPINITPHKNAEVLLRALG RTGARHPLVLTGSGMDLWPP RSTRARKLTELAEASGFVRD ASLFAPGYVDDRAYEQLLDE AWALVMPTLAEGGGSFPVLE AMTAGIPVIASDIPVMREMV ERTGGQVLWFDPRDPADLAA KLAELEARYPEYRGRAERQA KTLRLRTWDDVAAGYARLLG LT >_0000.000262_ ADEH_23AUG04_CONTIG40_REVISED_GENE264 adeh_23aug04_Contig40_revised_gene264 MDGSVATRRIGPPRAGIAER IWDAARAEFSKRGYHGARVQ GIARGASCNVALIYRHWASK RALYLDILRAVWLSAANELS RFVENGSGGPEAVIAAYLDA MMHDAMGAQILVREYLDGAP YLSQLTTAEPALLDPLRRAA AVIAEGTDGGVDPVLAVVTV GGLAALVASSREAMLPMVGQ QLAPEAWRAHVLHVLANGLA PRAGGPRG >_0118.002204_ NP_266594.1 gi|15672420|ref|NP_266594.1| prophage pi1 protein 03 [Lactococcus lactis subsp. lactis Il1403] MKKIRLPEMIDYFRKENGWT MKEFGEKLGKSESAISKWIK GVRSPMVEDFDKMVNLFNTD PDTLMYGASDLSTTLSEINK ISSQLEEPRQKVVLNTATNQ LDEQNQEKKKESKVIPINKI PDDLPPYISRKILENFVMPT NTMEYEPDEDMVDVPILGRI AAGLPLDAVENFDGTRPVPA HFLSSARDYYWLMVDGHSME PKIPFGSYVLIEAVPDVTDG TIGAVLFQDDCQATLKKVYH EIDCLRLVSINKEFKDQFAT QDNPAAVIGQAVKVEIDL >_0116.001589_ YP_194668.1 gi|58338083|ref|YP_194668.1| transcriptional regulator [Lactobacillus acidophilus NCFM] MARRKEFSKDKILDTAYKMA IKDGIEGLTARSIAKAGHFS TQPLYLEFDNMDDLRAQVLE RISADLRNHTLQQEFTGEPL IDLDLSYIEFAKTHVNLFRA MFADGKFGSKVIADTLLGLG TEKFKEQYPDTNYDEDKIRN IVIANWISTTGMAALVVNKI ASFSQNQIINVLNAQIHDAM LNDHLSETQENPMFAADEEA SLKDNLA >_0113.002083_ YP_001088182.1 gi|126699285|ref|YP_001088182.1| putative flavodoxin [Clostridium difficile 630] MNTIIIYSSKYGCTKDCANI LKNKLSDNVTFVDINNNNNN KIELSKFDKIIIGSSIYVGS VSKKIQVLCNDNVELLNKKQ VGIFLCCGFSEQADKYLKSN FPSSLLESANAIGIFGSEAR LEKMKFLDKLIMKAVSKGNY DSFRISQDNIDNFLINLNS >_0112.000789_ YP_909581.1 gi|119025736|ref|YP_909581.1| hypothetical protein BAD_0718 [Bifidobacterium adolescentis ATCC 15703] MTDALFLLDTDHDDVPVNSD ELNAGWRLTLPSSVRRHAIQ AMRLKDGDELQLSDGRGLRI HAVLRDVQQGIAEVLRFGKE PQPVTRLALVQALAKTGHDE QAIDMATQIGVDQVIPWQAD RSIAKWKAGRTDKKWRQVLE SATEQSRRSWTPQLDDCVTS KQVVAICKRACVHGDMVIVL HQDATKTWEQLETGIAKLSD TCLKDGRPRTIYVLVGPEGG ISDAEVEAFTKAGAEVCVLG SNILRASTAGPVALSLLARA LGRFV >_0111.002418_ YP_213855.1 gi|60683711|ref|YP_213855.1| putative glycosyltransferase [Bacteroides fragilis NCTC 9343] MNILIQQTKAFPHRANSFYW FYAQLTEWLTAHGVDSKLYF SYLELADSEFENGLLLPDHY SQFYTPRNIEAICRFIIDKQ IDVILDYSHVITGDTRKYYL EIKKRNPGIKICTMIHNCPS HTTQLKQYELSTLRFKDVHG PKKLFQWMLPQLYISLLKKV VSHQNRSAYDTLDEVVLLSP AYIPEFKKLIGKKDAWKLSA IPNAIKPVHSNIPIEEKDKE IIFVGRMATEKALPKLLKIW GMVQDKLPDWKLTLVGDGPQ FGTCRQIIAEKKLKRVCLTG HQMSIPYIDRARILCLTSVI EGLPTVFTEAMSLGVIPIGF DSFNAIYDMIDDGIDGFIIP DNNYEQYAETILRLAQNDTL RCRIAYKAQKRKNRYDIEQV GPLWMETFRKHGLIK >_0110.000433_ YP_001298825.1 gi|150004081|ref|YP_001298825.1| hypothetical protein BVU_1516 [Bacteroides vulgatus ATCC 8482] MIMTIVEGFDRIEKKLDRMG RVKDFMNGDELLDNYDIARL LNVSLRTVARYREKGLIRYY QTDDNGKNFYRSSEIQEFLL KRGKKK >_0106.001700_ SWOL_07JAN05_CONTIG134_REVISED_GENE1701 swol_07jan05_Contig134_revised_gene1701 MSVSGINFVCAGRDELEIPR EIVEQGGFHFPDLHSDSQEM ARLALAIKHNKQNSICMLPF CLTVEAEALGARVNLGNASS GPRISSYSYESIEQLKDLRM VDFNSGRIKSVLDAVETLTQ SGEVVALNVSGPFTIITSLF DPMVLYKAVKKNREGIDDVL RIIEGIILRYILEAIKMGAR IISYADAVGTADMVGPKIYK DIAGIASLNLLKRLQTSDEL DGCFVHICGKTSVSLENHGF ARSYPIEVSSNQTYGQAITE LLYKSQGSRFIGHNCMKRTV NKLKDNTVWRIEF >_0105.002007_ YP_054992.1 gi|50841765|ref|YP_054992.1| putative NADH dehydrogenase [Propionibacterium acnes KPA171202] MRHPQRVFSSMRKIMFVIGS MPLTHPSQSADGDPGKKYEV TRLDLGHLHPTRSGLVTIAT TVDDDVIISSQVNVGTLHRG DEKLFEVRDYRQIPMLASRH DWTAPFIGETGAAHAIEDAM GITIPTRVAWIRTLLAEFSR ITSHFTFLSWVGHHCDDAGL ENAIATAIENARRIWQRCSG NRIHPMITRIGGLRIHPESE WQPALGEWLDDANALVTRLR TALDATVGVRGVAVIPTNMI DRYGLSGPVSRASGVDLDTR RRTAVYDELKWPEVVIEPRG DAYSRLATLCNDAAVSSSLC RQILDRLPSLDGPLETRLPT AIKAPRGRTWTTIEALWGRA GYLLESRGSTTPWRMALRTP TFANVQVLEAVLPGTRVDDV EATIASLGWTLGDLDK >_0105.001925_ YP_056916.1 gi|50843689|ref|YP_056916.1| putative MarR-family transcriptional regulator [Propionibacterium acnes KPA171202] MCLSPLILSDKGGTVTEDSA VWLDDHQQRVWRLWLEVSTT LPAVLNRQLSRDSQLSVQDF EVLVRLSETDGGDMRVVALA EHMGWERSRLSHHLTRMEKR GLIERRVCCSDGRGAIIHLT DQGTAALQQAVPGHAALIRR VVFGGLDCGHLNDLEAILSR ICQSLAEEGKVNPTDHS >_0105.001081_ YP_055544.1 gi|50842317|ref|YP_055544.1| putative TetR-family transcriptional regulator [Propionibacterium acnes KPA171202] MASPTAIRTQGVIRRAAIEL AIEKPVDDVTVDDIAERAGV SRRTVFNHFPSKYDVYLPPL VTYPQEVLDAFATDVETPLA TLIRDLLEARWRATNVDFDD LCILMKIGRESSELRMAFKE AVEGQRAALVAASARRTGKS EDSLEVLALVGTVQAMERCA VQSVVARPGGSSAEVADTFD RVVDVWARLSAELSADMSST KP >_0104.001179_ NP_638326.1 gi|21232409|ref|NP_638326.1| phage-related lytic enzyme [Xanthomonas campestris pv. campestris str. ATCC 33913] MMFTDTQLASIMQCSPQRAQ RWHGPLLAAANRFGITTKRR AAHWLGQVGHESLSLSRMEE GLTYTTSARLLEVFGTRITP AQAPKFLRNPVGLANFVYAD RLGNGNTASGDGHRYRGRGP MQHTFRGNYRRIGELIGLPV EEQPDLLLQVEPSALGAAAY WHDNGLNVLADTGDVLGLGR KINLGNVRAKRLPEGHSDRV TRTQRALQILGVS >_0104.000516_ NP_637049.1 gi|21231132|ref|NP_637049.1| glycosyltransferase [Xanthomonas campestris pv. campestris str. ATCC 33913] MHVAHLNLLPAPSGWDAVQV FAQWPSLADIAEAVASTGTQ VTVLQVAAHTEQLTRQGVAY HFLDPGKRGSAVRARRLADL LQQLDVAVIHVHGLEFAGDA RRLARLLPQVPIVLQDHANR PPRWWRRPVWRARYAAAAGI AFTSLALAQPFVRARLFGPR TQLFAIPESSSRFTPGDRIQ ARAVTGLAGDPCVLWVGHLS EGKDPLCVLDAVAMAAARLP GLRLWCAFHEAPLLEAVRQR IHDDPRLASRVHLLGKVTHA QIQQLQRAADVFVSASHAES CGYAALEAYACGTLPLLSDI PAFRALSDNASMGALFPVGD AARLSDLLVAHARQRPGRAR VRAYFDARLSFAAVGRQWRD AYTQVLRTAAGQRQ >_0102.001253_ NP_110560.1 gi|13540872|ref|NP_110560.1| Predicted regulator of amino acid metabolism [Thermoplasma volcanium GSS1] MLLILSEYFKEHPIKQRIIE GLYRSGISVKEGRFFANDVE ISISEIAKTFGVNRKTVYDT VKLVEGNSHLKRVMESIMPM ADVSNVALLTGNQIITVYTT LGHYPTVWKDVFMAISKYGC YIREIFSRNMSQDESFIRII FYRPIPQKIIDQIKAIDGAR NVEVKVSSDVDEILCRTCDI KICPTKYISDVEESKEDQV >_0102.000959_ NP_111784.1 gi|13542096|ref|NP_111784.1| Predicted transcription regulator [Thermoplasma volcanium GSS1] MNSDPTKERILHGLITLYLL KELVRGPMHGYELQKTLSSV IGNPLPQGSIYVLLKTMKER NFVTSENAKNEKGQTLTKYY ITEEGKKFLCSHSESLLIAR KIIDDLLKTVDLIDEE >_0102.000616_ NP_110590.1 gi|13540902|ref|NP_110590.1| Threonine synthase [Thermoplasma volcanium GSS1] MEELLNSETKPPGRTPTIKA YALGKELRLNNLFLKFEGAN ATGTQKDRISEAHVLNAIRD GYNGIAVGTCGNYGASIAYY ANLYGIKSYIGMPSAYTHDR EDFMLSSNAEILYYNGKYEE TVSFIHDYATDNSLYDASPG SDHSYIDTKMYAGIAFEIYE AILKVPDYVIVPVGNGTTLS GIYEGFHLMYLNGTIDRMPR MVGVSTRGGNPIVSSWKRGY KKVVDLDPARIRETAVNEPL VSYRSFDGDKALAAIRETHG KALYVSDDEMVRFSELMYRE EGLNPLPASASAVAALVHLH LNPDDYVVSVITGRRM >_0102.000251_ NP_111975.1 gi|13542287|ref|NP_111975.1| Ni,Fe-hydrogenase III large subunit [Thermoplasma volcanium GSS1] MKYYRFGKPEGRFVGNTEKY SVYEHILSDQRREIPSESYR NLPGEFTFNYGPATGGLIES VAFDFHTPGELIKHVDVYPY IKTRKIKVRGLSPEDALLLV ERINGFHAASHAVAFEMAVE DALNIDVPEEVQYSRIIMLE LERMRSNLEVVKRLCEPAGF GVPQNQIAYLRENIARIITS FAGHRYFFSSSYVGGCQFNG LVLESVIEDVKEEFEGIFND LLQSKIFLNRLQNNGKISSS DMLGPAARAAGIEVDARIDE KRLPYSKLGFKPIVFDEPDA FGRFYVRSKEILSSADLISK AIRFSGNSPPAKIEKTNGEG AARVESPQGDLFYYVKINDG KISDVQISSPSLLNIEAFKK SMIGNIFTDYHFNWESFGIW VSELAVVIQ >_0099.000387_ TFUS_04MAR05_CONTIG93_REVISED_GENE1347 tfus_04mar05_Contig93_revised_gene1347 MDVRPLVSALDRALCTDPRL AELPGRFLFGLDDGRGDLAG MPIDLGIYAIDERSAQIRVG TFSGPVISSTEIVDIILDLA HRFLDISEGKWNVVQLPKKG RELLNNVEFSNFLRHTKSGD RWSVERFRRNILHGIITIST AESAVAGSVPLGVLTPPMQR ALLDAAEHSDSTIILTPWRG VIVTPVPTDQASNVAALLQE AGLVLDPASPWSRISACIGS PGCARSRGDTRSQAVKLVRR LSSPDTSDDGSPYHIAGCER ACGAPHTPHTLVLLGSST >_0095.001375_ NP_462677.1 gi|16767062|ref|NP_462677.1| putative helix-turn-helix protein [Salmonella typhimurium LT2] MSAKTKFKSPAFEPIHSAAS GLFSVDAIPQETMRSFDTAC LSSIKDLQPLEIKALREKLN VSQPVFARYLNTSVSTVQKW ESGAKRPSGMSLKLLNVVQK HGLKVLV >_0093.002557_ NP_341886.1 gi|15897281|ref|NP_341886.1| Transcription termination antitermination factor (nusG) [Sulfolobus solfataricus] MGCNFVEKPNMRNYYAIKVV GGQEINVALMLEERIKTNNI KGIYAIIVPPNLKGYVVLEA EGLHIVKPLIAGIRNARGLA QGLLPRDEILKIVSRKTVGP TVKPGDVVEVISGPFRGTQA QVIRVEEAKGEVVLNILESA FPLQVTVPLDQIKVSKR >_0093.002539_ NP_344516.1 gi|15899911|ref|NP_344516.1| Acetolactate synthase large subunit homolog (ilvB-6) [Sulfolobus solfataricus] MIKYLVMNAGRLFLSLLKES GVNKIFIVSGTDYASLIEAK VEDSSLPEFEIVPHEITAIS TAIGYALGNKLSAVAVHTTP GTANALGGIMSAFTSRIPLL VIAGRSPYTEKGNTASRNLR IHWTQEARDQGELVRQYVKY DFEIRMADQLPAVVSRAIQI MMSEPRGPVYIVLPREVSIQ EVNEARRIPMDYYEPAPSPD KINKAKEMLEKSERPLIITW RAGRRKEWFESLRRFADNYN IPVLNYAGEVLNYPSSGPMA LDRFDLRNSDLLLVVEAEVP YFPKKIDLDIPIVKIDVDPS YSYIPYYGFRCDLCIQSTPS NFFDYISIRPKSYDEIKELR AKQEEYKKQEIERLKDKKPI HPKYLSYEIGIVASEYNLVI FNEYQFNPRYARLNEFGSYF ADLSVGYLGFALGAGVGYKI ATNKDVLITTGDGSFIFGVP EAFYYVASKYPTMVVIYDNG GWLASAEAVDEVFPEGLAKS KKYYPGADFDRRFEIGKTVE TFHGYYELVEDPWEIKPALI RGLEKMRRENKIAVIQVIVD KVR >_0092.000488_ NP_268850.1 gi|15674676|ref|NP_268850.1| putative protease [Streptococcus pyogenes] MEKIIITATAESIEQVKALL AAGVDRIYVGEANYGLRLPH NFSYDELRQIAKLVHDAGKE LTVACNALMHQDMMDQIKPF LDLMIEIAVDYLVVGDAGVF YVNKRDGYNFKLIYDTSVFV TSSRQVNFWGQHGAVESVLA REIPSAELFTLAENLEFPAE VLVYGASVIHHSKRPLLENY YHFTKIDDEVSRERGLFLAE PGDASSHYSIYEDNHGTHIF INNDIDMMSKLGELYAHGLT HWKLDGIYCPGDDFVAITKL FIQAKTLLEAGQFTQEEAEK LDQAVHAHHPAGRGLDTGFY EFDPKTVK >_0090.001423_ YP_168774.1 gi|56698401|ref|YP_168774.1| transcriptional regulator, MarR family [Silicibacter pomeroyi DSS-3] MTHETDQLYQAVQATRPLLR NITAAVERGTLREGVTVGQR AILEGLSLTPGATAPQLGAA LQMKRQYISRILQEVQRAGL IERRTNPEHARSHRYWLTPR GEAIITAIRADEMAKLALFS EGFSSVELTAYHKVQLALTR FFADLAKEA >_0086.001141_ ROSE_TM1040_30MAR04_CONTIG49_REVISED_GENE1195 rose_tm1040_30mar04_Contig49_revised_gene1195 MTGGHHRVVPLCAKAQTALT HDISNARRFLMAQAALQPQN ACVLLSFSDGTTAQYPYIWL RDNDPEGFHPDTQERITDLS AISPDITVADVELNDSQLLI HWEGADSATSRFDLDWLRSY VPGTRTADPARTGFQHWRCD LGAGGIPRATAQEILSSDLA LRTWLEQTQIYGISIVEGLA DSTEAGMDVARRIGFLRQTN FGVTFEVKSKPNPNNLAYTP IALPLHTDLTNQELPPGFQF LHCLANEARGGGSLFCDGYA IAEDLRRDDPESFELLSTVS VPFRFHDQDTDIRNRKKVIT LDEDGRVIEICFNAHLADIF DLEPALMQRYYRAYRKFMIL TRSTNYLVTLKLKGGEMVVF DNRRVLHGREAFDPQTGYRH LHGCYVDRGEFESRLRVLHR GQ >_0086.000393_ ROSE_TM1040_30MAR04_CONTIG45_REVISED_GENE394 rose_tm1040_30mar04_Contig45_revised_gene394 LGRRFRGESHHKVDSKGRVS IPASFRRVLEASDPNWQPGD APELVIVYGDHRRQYLECYT MEAIEEVDAKIAALPRGSKG RKILERIFNGQSLPTTVDET GRLVLPAKLRQKIDLDKEAF FIASGDTFQIWKPETYEEVE MAEAEKLMDELPDDFDPLEF LDGAGGA >_0085.002706_ SHEW_20DEC04_CONTIG154_REVISED_GENE2708 shew_20dec04_Contig154_revised_gene2708 MSQVNRNSKAKGVNRRQFLA TTAKAGCAMGLLGLGVSGVA KQSSHLDPMAIRPPGALDEQ DFLSACVRCGLCVQACPYDT LKLARWFEGAPTGTPYFTAR TVPCEMCEDIPCVKACPSGS LDHALTNIDEAKMGIAVLID EKNCLNFKGLRCDVCYRICP LIDNAITLERQHNQRSDHHA MFLPTVNSDTCTGCGKCEHA CVLEEAAIKVLPAHIALGKT ATHDTYINTEETTLEMLNKG LTL >_0081.000345_ SDEN_20JUL04_CONTIG104_REVISED_GENE346 sden_20jul04_Contig104_revised_gene346 MSDIGATEIGIDTLGSTSIG SNAVSSVIIGLVNPKTPVNV GGIMRAAGCYRVDSVCYTGR RYELAAKSGDAQYDVDTKDA AKTIPLTGVESLLDQVPVGA KIICVDLVVGATPLPHFVHP EHAFYIFGPEDGTIPQVLID AAHEVVYVPTVGCMNLAASV NVLLYDRLAKSAQMLAGDEL IKQSRDNNNRTKVKHWRNKE >_0079.003423_ SBAL_17SEP04_CONTIG245_REVISED_GENE3426 sbal_17sep04_Contig245_revised_gene3426 MVISQLLVEENMTQLLKNWL TQGPFAQQLISFNHHDIVTG ALFTNQVAYFYEQLLAAPEK KWLLASDTSDLFAVGLCAAL LAGKEIILPPNTQTGTLSEL THQFDGILSDKPLCECQAFV LLKKELSLPNKPWPASEAIG ELVLFTSGSSGEPKAIRKTL EQLDVEVSVLEHTFAEHLPH CSVVSTVSHQHIYGLLFKIL WPLAASRPFLSEQIEYPETL SYYTALLPNLCLISSPAQLS RLPKALEHERQLRSPSLVFS SGGPLSFAAAKGVNQCYGHL PIEIFGSTETGGIAYRRQHE ADEPWQVFDRISIDQDPTDG ALLLKSPYLADDEWLRCEDK IEPTENGQFRLKGRLDRIVK IEEKRLSLAQMETLMSSHAY VEQAALVVLPQFKSQLGAVV TLSELGKNVLQEQGKLSINN ALKAHLLTQFERVTLPRRWR YPDILPLNTQGKRVTAQLVE LFDHD >_0079.002995_ SBAL_17SEP04_CONTIG240_REVISED_GENE2998 sbal_17sep04_Contig240_revised_gene2998 MKKKFATIFNNFENVHLTKD VGMIPCAFTMLEGYDKSIIF YWDKKGKEEKIELNEVILHP IKARFRLFYYLKLFIEIFLE NVTAINLYHDSIQTALFCYV AKLINVKVYLKLDLGDRGCE ALLKRKYSKNFIDKMRLVGL NLATCISTETLWIFDNLNKN NVFKSDRFLQIPNAILESTV KSEPKPYCQRFNRIVVVGRV GAFEKNHELILNALTEIKML NGWTIDFVGPIEESFNEKID LFYKNNESYINIVRFIGNKN RDELMDIYSTSKVFLLSSLW EGFSLAMVEAAYLGCYIIST KVGGVLEVTDNGSLGNIYEK EELAEKLMQLDDSVVGAGYD KRLTFCQDTFKLGKYSKNIA KRLG >_0076.001908_ SAMA_14OCT04_CONTIG72_REVISED_GENE1911 sama_14oct04_Contig72_revised_gene1911 MTLFFLDNQRFHAESGETVL SALKRVGHPINYSCTKGQCR SCLLRLDEGKIAPKAQKGLE PDLKAQQLVYACQCVAKNGM KLSSPAEDTYVSARLLDKLS LSEDVTRLIIKPDTRTPYIP GLCTGIRGPEGLGRTYGIAP GQGEGTFAVHVRRKRNGRFS RWLCDEVKVGGSLSLTRPWG RCNYDGAYGNDELIVVAFGT GIGPALGLVQSALDSGHQRP ISLYHWGKYLDDLYLHRTLL KLMLEHKVLHYQGLISASSD ADRIDNHRVRLMDVPQILKE RHELGRDKRLFLFGEPDMVA KVTEYAFLCALDMDRIHAQS FEYRDLRRRPRNQDGQ >_0074.002349_ RRUB_10JAN05_CONTIG98_REVISED_GENE3112 rrub_10jan05_Contig98_revised_gene3112 MNGRETMPESDTRTIHHVDA HVGQRVRQRRTALILDQETL ARRIGVSFQQIQKYERGRNR ISASRLYDIAKALAVPIDYF FSDLERGDPRHDGALAEDMG RLAQGGSAPPDPLRLTQSLD LAQAFWALPDDGMRQSFIAL LKAMSSFED >_0074.001834_ RRUB_10JAN05_CONTIG98_REVISED_GENE265 rrub_10jan05_Contig98_revised_gene265 MDTVSVFTRVGGPTNRASEA DRHVGKRIRERRVMLGLSQQ QMADMIGVTYQQAHKYERGI NRISAGRLFEIAQVLHVPVN YFFDGLDDEASETLSPRQRM CLELARNFAMIQNERHQEAL SQLARALAAQVSVVEVIEEC QAQRA >_0073.006290_ YP_298265.1 gi|73537898|ref|YP_298265.1| regulatory protein, ArsR [Ralstonia eutropha JMP134] MVRESPKQRAPDSASAKPEL DVDAIHKALANPMRRKILAW LRTPEAYFSAQEHPLDMGVC AGLIDTHCGLSQSTVSAHLA TLQRAGLIRARRIGQWVFFK RDEETIQAFLDHIHQDL >_0073.005949_ YP_299832.1 gi|73539465|ref|YP_299832.1| regulatory protein, TetR [Ralstonia eutropha JMP134] METSTASAGEDTPVKASRRL LPEEREQQIVEKAIEHFTRN GFGGSTRELARQIGVTQPLL YRYFQSKDALIERVYNEVFQ WRPGWEGQIADRSLPLTERL HAFYLDYSSVILREEWIRLF IFAGLTHEGINKKYLSKLRS KVFLPVLAEVRAEFGIPAPR NAAETEAEIEMIWSLHAAIF YIGVRKWIYGLKVPADMDAL IRRQVDMFLNGAAAAIRAMR AGTP >_0073.005659_ YP_299261.1 gi|73538894|ref|YP_299261.1| hypothetical protein Reut_B5069 [Ralstonia eutropha JMP134] MPATNNKARQVMVYGCARYG QPDSHQAMTLEQIGRNVATI GGYGFSGAYEHSATARTPAG GTAAPYLVPLDTVVGDAQAQ ALHLRSADDLFGGVVPFAFV ATKVISHGLVAPDAVAPAGW NPDFAQRAGAAVLPGFTAFS LEDARSAGLRLMAGGVVRLK DPCGIGGLGQQTVDSEAALD AALAAMSATDIESHGLVLER DLDSPDTYSVGQILVAGVLA SYCGTQDLTVNNTGEEVYGG STLAVVRGGFDALLAQPFGP RALSAIRAAMAYHDAALACY PGMVLSRCNYDVAFGVPVGA TGEGAARALTGVLEQSWRVG GATGAELAALRALKADPARE RVVAATREVYGPDAHLPAGA ELYFQGEDRHVGLLTKYAYL ESDNDGHP >_0071.003444_ NP_744479.1 gi|26989054|ref|NP_744479.1| hypothetical protein PP2330 [Pseudomonas putida KT2440] MFGLIKTWKALEARGIMGIN RRNADYVLKYNKRNLYPIVD DKIITKERALAAGIHVPEMY GIIETEKEIEKLDQIIGGRS DFVIKPAQGAGGDGILVIAD RFEDRYRTVSGKIISHEEIE HQISSILTGLYSLGGHRDRA LIEYRVTPDQIFKSISYEGV PDIRIIVLMGYPVMAMLRLP TRQSGGKANLHQGAIGVGVD LATGVTLRGTWLNNIISKHP DTTNAVDGVQLPNWDGFMKL AAGCYELCGLGYIGVDMVLD QDKGPLILELNARPGLNIQI ANDCGLTQRTHAIEAHIEAL AKEGVTEDAEQRVRVAQGLF GHVPNR >_0071.002963_ NP_743401.1 gi|26987976|ref|NP_743401.1| hypothetical protein PP1241 [Pseudomonas putida KT2440] MISKKISPQDMPILDLDLSK TKRFEASRFLDSPETIAAFL AEAMKANDAQTLMHALGEVA KAKGVNQFAQDAGVNRESLY KTLKGEEKTRFTTIQKLMVA LGVELTVRPLEKLPGS >_0071.000585_ NP_744020.1 gi|26988595|ref|NP_744020.1| ISPpu8, transposase [Pseudomonas putida KT2440] MAKLALEQAIAPEWVDQVFE EHRQRQYSRELLFSTIIKLM SLVSLGLKPSLHAAARQLDD LPVSLAALYDKISRTEPALL RALVTGCAQRLAPTIHELGC SAMLPDWQVRVVDGSHLAST EKRLGALRQERGAARPGFSV VVYDPDLDQVIDLQPCEDAY ASERVCVLPLLAEAKTNQVW IADRLYCTLPVMEACEQVKT SFVIRQQAKHPRLIQEGEWQ APMPVATGTVREQSIEVKGG HRWRRVELTLHSPNDSGDNS LMFWSNLPESISAQQIADFY RRRWSIEGMFQRLEAILESE IETLGSPRAALLGFTTAVLA YNVLALLKRSVEQAHRDALP ENWEASIYHLAVQIRGGYEG MQIALPSEYMPVVPMENLAQ RLLELARNIQPRQVAKSPRG PKVLKAKAWVQGTAVHAHVS TDRVIKAAKTKRP >_0071.000089_ NP_743181.1 gi|26987756|ref|NP_743181.1| hypothetical protein PP1020 [Pseudomonas putida KT2440] MPEHPLHRFFSSQRPRPTFE WERYQQRDVLIIDHPRCQAV FSRQGAQLLHFQPAGERPWL WCAEQWPQVGAIRGGVPVCW PWYGRHPSEDLWPAHGWARL LDWKLVDSREDEEGVTLKWR LDLCDWQVDLHARLGSRMEL SLSTEHQDSEPCQLSHALLA YWRISDVSEIALSGLEDIEG YDRLNRQACREDGALKLKGG CQKVYPGTPRVQLQDPAWQR ELCIDTGDSDDTVVWHPGNR PLMGVTGRESQRFVCVEATS GSGEGLSLAPGQRAHLRLQA HRLS >_0070.000031_ PPEN_30JUL02_SCAFFOLD10_REVISED_GENE132 ppen_30jul02_Scaffold10_revised_gene132 LTKLYFIRHGKTEWNLEGRY QGANGDSPLLKESYTEISQL ASFLAPNKFQHIYASPLRRA RVTATTLQHELDELQGYPTP ITISSRLKEFNLGIMEGMKF VDVEREYTDEVDAFRNHPDR YDPTKIKGETFQHLVKRMKP TILRICEKYPAKNDNVIIVS HGAALNALINSLLEVPLADL RKRGGLANTSTTVLASNDLG KSFELVDWNNTSYLKKRIDP TDVI >_0069.001073_ NP_894501.1 gi|33862941|ref|NP_894501.1| phosphoribosylaminoimidazole carboxylase [Prochlorococcus marinus str. MIT 9313] MVDSLCMSSVKSPMIGVVGG GQLAQMLAQAAKSRAVDVVV QSGSAIDPAAVEATRLVLAD PVDVEATGKLVQGCCGVTFE NEWVDIEALIPLEQQGVCFS PSLTALAPLVDKISQRQLLR DLDLPSPDWTLLSSISFAQP ELPREWNFPVMAKSSRWGYD GKGTKVLKSVEDLSQLLRSV DPTQWLLESWVPFEKELAIV VSRDAQGRVRSLPLAETHQF QQVCDWVIAPASVDHAVEMM AYNMAASLLTELNYVGVLAV EFFYGPDGLQVNEVAPRTHN SAHFSIEACSSSQFDQQLCI AAGLPVPETHLHAPGALMVN LLGLQKGGESSLDERLAKLR SCDRFHLHWYGKDCETPGRK LGHVTVLLNGVDAPSRQLEA ETALKHIRSIWPTQDTVCA >_0063.004564_ NP_252917.1 gi|15599423|ref|NP_252917.1| transcriptional regulator PchR [Pseudomonas aeruginosa PA01] MTITIIAPPQADAAAPAPGN RPGVAHIDPNMKLVTGTFCS ASEDWFEEPLERGLRLILVQ SGQLRCRIPGQPEHLIEGPS LCTIANDGDFTSAQIYGTDK PLRYTIVQLGVEALDSRLGW LPEQLIRRPGGDPRIMSCPA PRAMQALASQIATCQMLGPT RDLYLGGKALELAALSAQFL SGEGRPVEEPRITCSEVERI HAARDLLVGALQEPPSLDTL ASRVGMNPRKLTAGFRKVFG ASVFGYLQEYRLREAHRMLC DEEANVSTVAYRVGYSPAHF SIAFRKRYGISPSEIR >_0062.001150_ OOEN_16SEP02_SCAFFOLD3_REVISED_GENE1186 ooen_16sep02_Scaffold3_revised_gene1186 MDKNTVNQAGIVFQKLRKQR HLSTKKLAKGILSVSSVNKF ETGKTKLSFFKLGSLLKRMD ISLDDYYEQIESETLSDSYN HFLNKIEPIYQGNDLLTLKA IAKEETSKNNEEFINLIKTA SLISFIKKIDSNYLVEDMIV KKNKPIFNERRILEFIRTGH FKKLFNYFDLSND >_0061.006372_ NPUN_22DEC03_CONTIG1_REVISED_GENENPR6117 npun_22dec03_Contig1_revised_geneNpR6117 MTEDKWISSDPRSDKLLLRF RVKGFPKQFQISTGLKNLKR NRDIVRLRRDAIVTDISLGR FDPTLERYQFRPSAIATPPS RLHFLGEGNLGELWKMFTEF QSRQIEATTIYSTYRVVSRT IDRLPSRSLLDAPKIRNWLM ENTSQFMASKYLLRFDQCCR WALEEGIIKANPFESLKKIK RIKNNTDDCRAFTREERDTI IGAFESDERCSHYSNLIKFL FWTGCRHGEAFALTWKDISR DCLQISISRSKNLLGIEKGT KNGKSRIFQCSAGAKLHELL LSMKPTASGDSRLFSTKRGC SMTSFGVKPCLEPRREKPDR SSEKVVKSGKNSLLEPLRYQ AHFCNLGDRFW >_0061.003236_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF6602 npun_22dec03_Contig1_revised_geneNpF6602 MMTNMAVNDELWLKDPEFTP SPSRLFHLEPIGLGTCYVES LTSYVARLAAAHSVLPGTLL AREIKPIVGHNHTTNPLNSK SIVSLYGQASVKALNGTQIG AKQLVFALEILTKRTDLQFL TMLPWAKVFPVLGLLKHSHA WCPYCYQDWLNNQQVIYSPL LWALKPANICPIHYRFLESK CPHCDLEFLHLWHNSRPGFC LKCGGWLGMNYEVSLPDKRL FEEIPNLQQEIWIVKTLGDL IAQAPEFPCPPPRETIKTIL EAYVYQYTQGNVSAFGRWLG LSRYEILHWYSGVAIPNLDK LLKICYALSTNLVDFLQLKI LPLTLKQLVSLPAPKQKKLS SQSALVTSNSSPKCERVIQA MQLAQEEEPPPSLTQLAVRL GFKTPSSLTACSKSLSASLA TRYSEYQQQLRFLHIRDILE LALFSDEYPPPSLRKIAKRT GIGLATFYHYCPIVCHAISL RYKDYRKFVHKLAIEQGCRE VRQLASVLHAQGITPTAKNL RKFMPHPSLLWQAEVIDVLR QIRLDLSQ >_0061.000540_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF1021 npun_22dec03_Contig1_revised_geneNpF1021 MAASVEHGLNNNASIPPLES GDRLTRHEFERRYTAMPNKK AELIEGVVYVASPLRFRSHG QPHGQLITWLGVYQVCTPGL ALGDNATVRLDLDNEPQPDV LLLIDKPARGQAQISKDDYV EGAPELVAEVAASSASIDLY DKKRAYRRNGVQEYIVWQTL ENKLDWFCLQNGEYLSLVAD ADGVIKSRVFPGLWLDITSL ITGNMTKVLAVLQQGLNSKE HAEFVEGLT >_0061.000183_ NPUN_22DEC03_CONTIG1_REVISED_GENENPF0314 npun_22dec03_Contig1_revised_geneNpF0314 MHITKVTQNLKVGDKVNQKT NHVFVFLEIFAHEGGIQSYI KDIFRAYLGLSQAYKAEVFL LRDSPDRLNLFEDENLKFHY FKNQSPHLGRLQMTAALLKC LLQNRPQQVFCGHINLAVLI QTLCQPLGIPYTVLTYGKEV WEPLKNQERRALTSADKIWT ISRYSRDRACLANGINPKMV EMMPCAIDGDKFTPGSKQPE FVQKYRLTGSKVLMTVARLW SGDIYKGVDVTIRALPQIAQ VFPQVKYLVIGRGDDQPRLA QLAKDLGVSDRVVFAGFVAT EELMQHYHLADAYIMPSQEG FGIVYLEAMACGVPVLSGDD DGSADPLQDGKLGWRVPHRN PDAVAAACIEMLQGNDQRCD GQWLREQAIALFGIDAFQQH LQKMLLSSVFTPNNK >_0058.001902_ YP_207980.1 gi|59801268|ref|YP_207980.1| hypothetical protein NGO0867 [Neisseria gonorrhoeae FA 1090] MEVHDKIRTLREVNQWTQEE MAEKLEMSVNGYSKIERGKS GINLDKLRQIAQIFNIDVVE LLAEQNRSFFFSIGDNTNNH HNIIGSDEMLVFENEKLRSL LDAKDELIRQKDSEIAVLKK LVILLEEKK >_0056.000451_ SARO_25NOV03_CONTIG24_REVISED_GENE482 saro_25nov03_Contig24_revised_gene482 MPFEPPSFPSTDQLDRDIAQ MLADDARLSFRKIAADLGVT EGTVRGRVKRLQNAGLLKLV PILDIDRARDCGPGEGGQHM MFVTVKCANGKLDQVRHGLL GLPQVSALYDANAAPRLVAI CILSSLREAAEVTDKVIALD GIREVETELVLQSVKYNAAI APIAALEDTGQAVAERED >_0056.000378_ SARO_25NOV03_CONTIG24_REVISED_GENE405 saro_25nov03_Contig24_revised_gene405 MSATEAPETKAAATPAPTDA RTPPVAVWQILEKMRGQELI DWRTAPVEGRASVEERNLDI GFPLGWYAIDLSANLAVGEV RPLRYFSKDLAMWRGEDGQV RVIDAYCKHLGAHMGHGGKV HGNLLECPFHAWRYDGEEGT VKDIPYSKSIPPQVKRKCTR TWHVTEANRWIWLWYHPEDV APLFEVVHLPEATDPEWTDY DIYEWNVYGSIQNMAENGVD VAHFKYIHGTANVPLGDLRW GDWGRGADVKAKMGTPWGEV DGQISYDTMGPGQSWTRFTG ISETLLVACITPVELDHVHV RFCFTQPRSQAEGERAGVAK AIIRDICKQFDQDKIIWDRQ KFEPNALICEGDGPIAQFRK YYSRYYVK >_0056.000161_ SARO_25NOV03_CONTIG22_REVISED_GENE174 saro_25nov03_Contig22_revised_gene174 MARQRIPGNERRAQIVTAAR RVFSRHGYDGAKTLQIAREA NVSEALVYRHFPSKLALYRA VLRQVFIEQDERWREQGIRA DGTMALAAAIHTFIAASVHD ADDPQRIDTHRMTLASLAGD GSYASLIYRRSQRRNISAME AAFASSRADGGMQGEPISVS ASAMFVEHVGSMIAAIRALP QGAQPYGIEGDELVRQATWF CLRGIGMTDAAIATYFAKAA A >_0055.001759_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA1129 rgel_26jun05_Contig562_revised_geneMpeA1129 MLRPSLVEAEMSPSLLLRLS FALVLGLSAGLAHPAPDRQG NGEAAEFPGQRLALKRASDA VLGVQSTATEGASTIDTLGE YRAGSGVVIASDGLVLTIGY LILEAEEVELVLDSGKRMPA RVVAYDLATGFGLVQAVLPL GIAPAQLGQAHAVAEGEPLL FVSGGDDGALSAAELVSRRG FSGYWEYHIDGALFTSPARR DHSGAGLFNAQGELIGIGSL LVPSAPGDGSRRAGNMFVPV DLLPPIFAELRERGVSRASM RAWLGVNCVEQDDGLRVVRV SRDSPAEMAGLQPGDLIRRL DGAPVGGLESFYKMLWNGGS AERDLTIEVLREGRMQSVPV HSIDRTQTLRRARGI >_0055.001039_ RGEL_26JUN05_CONTIG562_REVISED_GENEMPEA0408 rgel_26jun05_Contig562_revised_geneMpeA0408 MSMTIAPPSLDALARQIQTA QDATRQIEPFTSQLDGFDVP AAYAVAQLVHEARLREGARA IGRKIGFTNPDMWSLYGVRD PVWAHVYDRTVVRVAGGRHV CRLAPFTEPKIEPEIVLHFH RAPPAGADAAALLDCIDWVA HAFEIVQSHFPGWKFRAADT VADAALHGALLVGEPQPVAR LQPGLRSALAAFSLELVCDG AVREVGVGANVLGSPLAALA HLAGVLSRQPQALPLQAGEL VTTGTLTTAQPVRAGQTWHT VLRGIDLPGLAVEFVD >_0053.003424_ MMAG_12JAN01_CONTIG3871_REVISED_GENE3442 mmag_12jan01_Contig3871_revised_gene3442 MKILVNAISARMGGIVTYTR NLLASFNERRVKSTFAVSAR FPDEGPGVLRLAASDLRPLR RFAWEQTIWRETVRRIAPDV LFSSANFGLFRSPVPQVLLV REGGLFDPFYLANCTPEQGP VAGFQRGIRRRLIIGSARSS DLVITPTATMRDMLLAWAPD LEGKVEVNSYGTLSDAFAPP CGASRPWRGDGTLRLLYVSV YYPHKQPGLVCQAVRGLLAE GIETRATITMSRAETRDFLG GIHDHILLEQAETDGLVELG RRRYEDLPGLYHGHDVFVFP SISETFGHPMVEALSSGLPV VAADTPVNREVCGDAALYFD PLRPSALVECIRRLDGDAEL RGKMVRMGRERVVQAFTWAS HVDRLIGMFERVAAGRRP >_0052.007138_ NP_109407.1 gi|13488400|ref|NP_109407.1| hypothetical protein [Mesorhizobium loti] MKMAEAGDLMSRGALPSPVG TAIAIAHHGELIQGVFKDDD GRLHRGLVTLPIAGLRSEAR FDRSDGDAVVVTPDHKIKAA AAARLTLDFLNVTGGGALTL QSSIPVGHGYGSSTADVVAA IRAVAAALNVQLRPSSIARL AVSAERASDAIAFDDHAVLF AQREGRVIENFAGSLPPLLL LGFKANGGVPVDTLQLPPAR YDSAEIQEFGVLRALVARAV RLQDPHLLGRAASASAVISQ RHLPKQGFDEIADIAERAGA CGVQVAHSGSLLGLIFDLFT PNLKRRAALVAQQIRRAGFK DVEVHLVNAEGWPW >_0052.004501_ NP_104857.1 gi|13473290|ref|NP_104857.1| hypothetical protein [Mesorhizobium loti] MTKNGKAEENKKINIALQGG GSHGAFSWGVLDRLLEDGRL EISAVSGTSAGAMNAVALAD GFVRGGVDGARQKLDDFWRA VAAKGRFSPVQRMPWDVAWG NWSIENTPGYVFFDTMSRVF SPYVANPLGLNPLRDVVAQE IDFKNVRACKSMELFISATN VETGQLRVFSDGEIDLDTVM ASACLPQLFRAVEIKGVPYW DGGYGGNPALYPFFKTAATE DVLLVQINPVVREGTPRSAN EIQNRIDEITFNAGLLREFR SIAFVKELIAAGRLPHGEYR DIRMHRIDADEAFKDLSASS KVNAEWAFLTYLRDLGRTAA SDWLEENYDAVGKRPTLDLS GELDDGFKPVRGPAPGRRVK EFLAVRKNPEAERRRA >_0052.003384_ NP_102856.1 gi|13471287|ref|NP_102856.1| hypothetical protein [Mesorhizobium loti] MSESIAALPMYDWPEVRGEV DAQWALLCDVFRQKGIDAPQ TIVRRNGDLPPVPGGIHDAE GAWIAPDPATLPPDELDFHK LWLHPALLFAQTCWGPMELG LSSHVQVIGQPSYDAYEGGQ GELYSSALVMRTGEGSEARS PADGKALLPLDLMRGKRFTF NSLDSMSGMIGLTRDLQAAG ESLDIFSSRSESGGHRASIV TVAEGRADVAAIDCESWALA QRFEPAARKVAIVGWTARRK GLPFITARTTPEKTVRAMRE ALAGLAEQPRIQRVG >_0052.001408_ NP_105452.1 gi|13473884|ref|NP_105452.1| hypothetical protein [Mesorhizobium loti] MFSTRRDIEVASAGTNHDAD TPLTRELVAWADIIFVMEKV HRTKLQKKFKASLKKARVIC LDIPDNYEFMDLELIDLLKA RVSRYLA >_0050.002480_ MFLA_01DEC03_CONTIG131_REVISED_GENE2767 mfla_01dec03_Contig131_revised_gene2767 MQDLAVAHVYRTYFPDPPGG LQEAIRQIALTTGMQGVDNT VFTLSPQPRPAILQRPEACV VRCRSWAAPASCDLGGIGAF LAFSRLAEKSDVLHYLFPWP FADVLHTVVRPGCPAVMTYI SDVVRQQLLGRLYAPLMWKV LRQMRLVVANSPAYARTSPV LSHPEIREKVRVIPLGIEES SYPKSGDEEILQRLGIGKDE PYFLFIGVLRYYKGAHFLVK AAKRIGAKVVFAGAGPEGAR LKALAGEVGAENVIFAGMVT NAEKVTLLQCCRALVLPSHL RSEAYGMVLVEAAMFGKPLI SCEIGTGTSYVNHHEETGFV VEPASPEALASAMAVLLSED RLATEMGAAARQRYELLFSG PALGKAYAALFREVDAS >_0049.002461_ NP_786370.1 gi|28379478|ref|NP_786370.1| transcription regulator [Lactobacillus plantarum WCFS1] MGTELQFVQYLHQMRDYNDQ HPTTKGLILLGYLERDFIRF YRAGKIEEGIQFAKKNLTRS RDLLNKLSSEEKQLQLSALV DVLAFEGIQNHAEIYEFVKL RNDYHRWLVKLPVTTERYQP LVVQIIRDFAAIAKPNALAY NAHLESTLDVMYYTNAHLHD QLSVKKVLAHVNARCNPESV RRSFSQEMHMSIRDYINAKK IQEAEHMLLATDLTIRAIAA ELCFYDAADFSKRFKKETGQ TPLEFRQLNTRTD >_0049.000493_ NP_785386.1 gi|28378494|ref|NP_785386.1| hypothetical protein lp_1833 [Lactobacillus plantarum WCFS1] MTAKFPYAKSFLASLETAGK QASTIEQYELTLADFFNYEQ HFNETFAKDQLLADLTENDI QAYLAMLREQRQFKTSTLNK CLSNLNGYFSYLFSHRIITT LPTFTIKGQPLTNRQQTTTW PEQLAAWLAMDDLHPYTRLF LLLTTKGYTATEMLTPGFYQ QLNAITFTAVEQAFLVKLRA YLQPLQTQSGSSDLFLKQRQ RGTDPHLTLAALHKYLAGDS QRLGVPLKPVALRQDFMLWF LNQHRTTEPTEIMQQLRLDE TSLEYYQNLLRQRDLRTLKA TKTD >_0041.000985_ NP_208076.1 gi|15645897|ref|NP_208076.1| conserved hypothetical protein [Helicobacter pylori 26695] MDFIGFEDLKCKDKENSQKV FVIRNDKLGDFILAIPALIA LKHAFLEKGKEVYLGVVVPS YTTPIALEFPFIDEVIIEDN HLSATLKSKPIDALIFLFSN FKNARLAFSLRKSIPYILAP KTKIYSWLYQKSVRQSRSLC LKTEYEYNLDLIHAFCKDHN LPNAQLKKIAWKLKDKSKER SIIASKLNADVGLLWIGVHM HSGGSSPVLPASHFIKLIDC LHNNLSCEIILICGPGERKA TEELLKKIPFAHLYDTSHSL VDLAKLCANLSVYIGNASGP LHVNALFDNQSIGFYPNELS ASIARWRPFNERFLGITPPN GSNDMGLIDIEKEGEKIVGF INYRKM >_0038.001742_ NP_281184.1 gi|15791360|ref|NP_281184.1| glucose kinase; GlcK [Halobacterium sp. NRC-1] MHYAGVDLGATNVRGAVSDE SGRIVGVDRRKTPSGPTGIA VTETVLAVMRAAAADAGVAP TAIEAAGIGSIGPLDLANGA IDSPANMPASVETIPLVGPL ANLLGVDTERVFLHNDTVAG VIGERFYADRTPDDMAYLTI SSGIGAGIAVDGTVIGGWDG NAGELGHMVVDPRGRRTCGC GRDGHWEAYCSGNNIPEYAR LLADDADGVETALPLDSGGF TAKDVFECAADGDTFAAHVV EQLGVWNGIGVTNLVQAYAP LVVYVGGAVALHNPEQVLGP IRAYIQERVFSNVPEIRLTT LGDDVVLKGAIASALTGGTG DSTHAP >_0037.001541_ NP_951708.1 gi|39995757|ref|NP_951708.1| hydrolase, carbon-nitrogen family [Geobacter sulfurreducens PCA] MDFTVALAQIKPKLGCVADN CLMVRQAVERGIDEKADLVV FPELALTGYFLKDLVPDVAL RLDAPEINALRELSRHISIA VGLVEVSADYRFFNTSLYLE GGEVRHVHRKVYLPTYGLFD EQRYLARGEHFRAFDSRFGR MGLLICEDMWHLSAPYILAM DGATTVICLSSSPGRGLTED DSLGSTIAWQKLTSTTAMFF NCRVLYCNRVGYEDGVNFWG GSEVVAPSGAVTSRARILEE DFLVAGVDEGALRRERIFSP MMRDESLLITLSELRRIERE RGR >_0036.001463_ EXIG_01APR05_CONTIG280_REVISED_GENE1464 exig_01apr05_Contig280_revised_gene1464 MQKKALVVIDIQNDMTKNYK EVIGTINQSIDWAVEQDMHV VYIRHENLSAGRRTFKTGTR GAELAPDLKIVSDHIFTKYK GNALTSEAFAAFIEQNDITD FYLTGGDATACVKSTCFNLR KLDYGVTVITDGITSYDKKK LPEMIEYYAKKCSHIIQFND LLSIN >_0035.002363_ NP_816235.1 gi|29377081|ref|NP_816235.1| transcriptional regulator, TetR family [Enterococcus faecalis V583] MPTNTFFHLPEEKQQRLLDA AQIEFSRHSLQEASIANIVK LAGIPRGSFYQYFENKEDLY FYYFATLRKNSERDLEKQII AENGDLIEAMDVYFSKMIVE VLTGENASFYRNLFVNMDYR ASRRVTDNLATGEEEKNRKQ HCHKHRGRKGHAAYAEHLYQ IIDHSKLTIETPKEFTWFMQ TAMHAVFSTIVDGYRQQREN TAYDSTEAVKQLKMKLSWLK NGAYK >_0035.000702_ NP_816116.1 gi|29376962|ref|NP_816116.1| transcriptional regulator, Cro/CI family [Enterococcus faecalis V583] MKENFLKQKRMEKNLTLEQV GNFVGVGKSTVRKWETGMIE NMGRDKIVALSKILDVSPLE ILGIPETEETPFVVPTNIMN IYKQLTTENQEAVFRYAQKK LTEQNYF >_0035.000545_ NP_815531.1 gi|29376377|ref|NP_815531.1| PTS system, IIA component, putative [Enterococcus faecalis V583] MVIKTSIFSLDTTYISNKKT QEEVFEEVYLDLLKKKLVTP DFLTNLLEREHNYPTGLSLT PIDPALPNIAIPHTESTFVR TTRIIPIKLKQSLSFHNMII PDETLSVSFLFMILNENGIE QTGLLATIMDFINTTDRSSL LAFFQCEDKEEVYRFLQKNF KGEI >_0034.004295_ YP_026242.1 gi|49176397|ref|YP_026242.1| pyrimidine-specific nucleoside hydrolase [Escherichia coli K12] MKLIITEDYQEMSRVAAHHL LGYMSKTRRVNLAITAGSTP KGMYEYLTTLVKGKPWYDNC YFYNFDEIPFRGKEGEGVTI TNLRNLFFTPAGIKEENIQK LTIDNYREHDQKLAREGGLD LVVLGLGADGHFCGNLPNTT HFHEQTVEFPIQGEMVDIVA HGELGGDFSLVPDSYVTMGP KSIMAAKNLLIIVSGAGKAQ ALKNVLQGPVTEDVPASVLQ LHPSLMVIADKAAAAELALG >_0034.002996_ NP_416025.1 gi|16129467|ref|NP_416025.1| persistence to inhibition of murein or DNA biosynthesis; regulatory protein [Escherichia coli K12] MMSFQKIYSPTQLANAMKLV RQQNGWTQSELAKKIGIKQA TISNFENNPDNTTLTTFFKI LQSLELSMTLCDAKNASPES TEQQNLEW >_0033.002198_ YP_049059.1 gi|50119892|ref|YP_049059.1| TetR-family transcriptional regulator [Erwinia carotovora subsp. atroseptica SCRI1043] MSKPQARQRLSREERHVQLI QAAWQIIREEGTEALTLGHL AEQAGVTKPVVYDHFTNRSG LLAALYKEYDARQTAIMDGI IRKTEPVLDKLAAVIATSYI DCVLLQGREMPNVMAALTGT PELEHIRQEYDAIFTEKCRA LFAPFCADQPPHDAALRAMI GSAEGLSYAAVKGIITEQQA KDELFHVIISMVKRCSAGR >_0032.002876_ YP_009504.1 gi|46578696|ref|YP_009504.1| glycosyl transferase, group 1 family protein [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough] MRIAIVHYWLVGMRGGEKVI EALCRMYPQADIFTHVVAPE RLSSMLAARHIRTSFIQRLP GSVRHYQKYLPLMPLALEQF DLRDYDLVISSESGPAKGVL TRADTAHVCYCHSPMRYLWD FQQDYLDNASPLMRPFMRIF FHYLRQWDVASSQRVDRFVA NSRNVARRITKHWRREATVV HPPVECDLFTQAPDVADLPD DEVADIVRQGGYYLCLGQLV GYKRVDIAIEACRKAGRRLV VVGDGEQRAMLERNAPEGVT FLGWRTQDEIRALYAGCRAL LFPGEEDFGIVPVECMASGR PVIAYGKGGALETVLDGETG VLFHEQTATACHDAIMRHER MERDFVPDSLRRHARTFDTS VFERKFRREVEAALHAVRG >_0031.001234_ NP_294185.1 gi|15805489|ref|NP_294185.1| hypothetical protein [Deinococcus radiodurans] MGEPFGGPLMGRLVSLLPSA TDLIFELGLGEQLLGVSHCC DHPGAASLPVLTRSIIGSDL PQAEIDRAVSEAVRAGRALY TVDGPLLDRLRPDLVVTQGV CEVCAVTPGTVAEAVRFLPG CLPAEQVLSLEGKTFAGILA DLRALARAAGVRERGEALAA ESERRWNAIRPVGVQVPEPP RVLTLEWVDPPFYGGHWVPE QVAQAGGKDVLGHAGRDSGR TSWEDVVQLDPDVIVVMCCG YGLSDNAEFARQVLSHPELR AVRAGQVWGVDANAHFSRPS LGVVRGAEVLAALLRGEESA GESVRVRAE >_0031.000070_ NP_051594.1 gi|10957469|ref|NP_051594.1| transposase, putative [Deinococcus radiodurans] MIDAPRVNHHDLSAHMPGMS TPQGKKRRADRTFRDEQLDR GFFIALLVVHLPPGKVLLSL DRTNWEHGETPINFLVLGAV VHGFTLPLIWVPLDQSGNSH TYARMWLVLKLLRAWPAKRW LGLVADREFIGAEWFRFLRR QGIKRAIRIRQTDMLDDMKG KEWFEHVQHGHFHEIGEKVF VFGELMRVVATRSPVGDLVI IATDFSARKTWRLYKQRWSI ECTFSSFKKRGFDLERTGMT ERSRLQRLFGLVTLAWMFCL RLGVWLSQTWPIPVLKHGRR AVSLVRHGAQHLVDALRWKP QQFMAILEVFIQAFCPPGAA ESEVVTY >_0030.003713_ DHAF_12NOV03_CONTIG944_REVISED_GENE4239 dhaf_12nov03_Contig944_revised_gene4239 LNYRKEIEKCIEFIENHIKE DITIEEIANQSGYSLYHFCR VFSLCKGISVMEYIRGRRLA LAATELFKGRKIIDIAFDYG FETPSGFAKAFRKAYGYSPT QYMMRMPQYADTKTTFEIGG YIMKPVMVKRPAFKVAGYGI KTNITGVTYTKDIASYWSNY EGKNLESKMYKILNPAEHGE VGLCVPLSEDGNVIYLNFRS AKICSNSVYYWIRLKIFCSL TEKHLSNW >_0030.003263_ DHAF_12NOV03_CONTIG1087_REVISED_GENE3740 dhaf_12nov03_Contig1087_revised_gene3740 MPRFSDTEKEMIKEKLLREG ERLFVAHGVKKVTVDDLVQA AGIAKGSFYAFYTNKEHLYM DIIEQCQMKLWQELDDFLAD HKDLRPQELTKRSFGRMLEG VKEYPILVKTDSAVMSYLQR KLPQDVLDAHGFEDSKALVK LQEYGVRFAYELPVVAKALQ ALYVAIAYLQQEEADHAVVV NLLLDSLIEQIVR >_0030.002778_ DHAF_12NOV03_CONTIG1084_REVISED_GENE3159 dhaf_12nov03_Contig1084_revised_gene3159 MTTWVIVIYLNIELDRNEVA VIYPKFYSLEAEKRERIINA ALKEFARNGYEKASTNEMTK EADISKGSLFSYFNTKKELY LFLLDYVVEVIESIYDEVDW QETDLFERMRKIGLIKFKIY KKFPHAINFLKVAAHEDAGE VKAEIAETGRHLIADGLGRG YENIDWTKFREDMEREKMLN IITWTILSFAEQQRDRVESF EDLSLDLLREWDGYFDILKR CFYK >_0030.000593_ DHAF_12NOV03_CONTIG1033_REVISED_GENE676 dhaf_12nov03_Contig1033_revised_gene676 MLEQNQSRKPELLAPAGDYE KLKFAIAYGADAVYMGGPAF GLRAYAGNFTMEQMAEAIQY THHAGRKLYVTVNIFAHEQD FEEMAAYLKQLESLGADGAI VSDPGIIALAQEAAPKLPLH LSTQMNSTNSYSINFWLKQG LERIVLARELTLAEIRAVRE KVPGELEMFIHGAMCMSYSG RCLLSNYLTGRDANRGECTQ PCRWGYGLVEEKRPGQVFPV EEDERGTYIFNSHDLCLLPY LPMLKPLGIDSYKIEGRMKS IHYVSSTVKVYREAIDTLWE QGEEAFKAKLSSWLEEMDKV SHRDYSPGFLFGKPGAESHN IESSNYIRDYEFVAFGLAAD NREHPQIPTLVKDEFSQGYW VEQRYHFQKGELIEVFSPHE EPWTFEVKGIHTVEGEEVDV ARHAKEILKLELPRPLPPFA ILRRAKKDKK >_0030.000080_ DHAF_12NOV03_CONTIG1005_REVISED_GENE102 dhaf_12nov03_Contig1005_revised_gene102 MDALDRALLNIVQTDFPITS RPYETLAGLTGTTEEEAWQR IQRFRQEGVIRRLGGVFDSH RLGYKSTLCAARVPEDKIQI LADLLMELPGVTHNYLRSHD YNIWFTLIASSQKEVEKILT TIKGLIGTDEVYSLPALRLF KISVDFDFNTEEQAGEQRHS EECECTANPRGSQKLEPYPV DDMDKLLIRELQGNIPESLT PYAEIAAKLMWQEEKVLRQT RALVDNQVIRRFGAVLRHQK AGFTANAMGVWQVPEAEAEA IGKVMASFREVSHCYQRPTL KDWPYNLFTMIHGRSEEECG EVMARIAQATGIKDYAMLFS IKELKKSSMQYYREEDD >_0030.000039_ DHAF_12NOV03_CONTIG1003_REVISED_GENE46 dhaf_12nov03_Contig1003_revised_gene46 VRVHTPSGLALKVVFVQNRN NRREWLAILTTDLSLETTEV VRIYGMRWSIETFFKMAKSH LKLGTEFQGRSFDMMVSHTT IVFTRYLILEWERRENNDER SLGGLFYLFADEVMDLDLKT ALRQLMTFVLNLLPNKPENN ESLSQLQKWIAALPSYIKAL FPQLGCES >_0028.001157_ DDES_06JUN05_CONTIG143_REVISED_GENEDDE1158 ddes_06jun05_Contig143_revised_geneDde1158 MKATDFSESELRMQEYVEHA ISRLGGVREVAVAAGVSRRT VYAWKNGERFPSRSNLNRLS GVLMRCAPSESPVPEFPAAD GRESAAGLGVHENDGTEYGR RAAFGATGAAAHAAHPLHDF SDMPDLRGVLDEFVFVSKAD ARPSAGGGSLQTGAENVERH AFRLDWLLSKTHDTSSLRLM EVMGRSMEPTLHNGDDVLVN EGDTYLVEDKVYVVRVQDEI YIKRFARTPGRLLFRGDNRD LAYQDIEIDPQDVSCDWTVI GRVIWAGKEL >_0026.001716_ NP_662969.1 gi|21674904|ref|NP_662969.1| UDP-N-acetylglucosamine 2-epimerase, putative [Chlorobium tepidum TLS] MSSVKKIVLIAQDRAAFLHV APLVSVFRKNGVFESVLVRV LTPGNRAEHDALAAAFGLSD ELRTIELEPCTPVAETASLM LALERVLSELEPAFVVPGGH DSASLAGAFAAAKMGIPVVS LDAGLRSYDRAEPEEISRLV IDSVAALHFVSEHSGIYNLM NEGVADERILFVGNTAIDSL VTLMAQANQSGVLETLSLAP KKFVTVLLKPEPFGNRDLLC KVLESLAATSTVLLPGSQSP EDALVGVSGLRMIDMPGYID LLRLLKESALVLTDSAEFEA ELTVMNVPCITLRQSTARPS TVELGTNVLIDPDEAEILER ATAILSGKQLKKTLIPEKWD GAASKRIAEVLERGA >_0024.003275_ CHUT_08NOV04_CONTIG199_REVISED_GENE770 chut_08nov04_Contig199_revised_gene770 MSQDTILLIWDRIGDYHLSR VKACEKLLGAPVFTADLAGT DNLYKWDSIAKTTHTVLSSK PAEQSDIWNRFRTFRRIIKT HSIAVVAMPYGRTEYHIFLL YARLKGIRTIIFSESWYSRG KLKDFLKSLLLKSLGNYFFV SGKRAFDHFTKNYKINPEKI ETGYSVVDNNHFQRRLFTEK KYVLTIARYSEEKNLSFLID SYAKSVISNTYNLMLIGEGP LRPALQSQIDILGLSDRVQL AGWVPYAALPKAYAEAVVFV LPSSFEPWGLVVNEAMSAGL PILVSDACGCMPELVAEGVN GWSFNSNTQQELIERLNAFA ALSEKGIEITGKNSMQLIRN YTPDTWAGSIARLAQAGR >_0024.002459_ CHUT_08NOV04_CONTIG199_REVISED_GENE3211 chut_08nov04_Contig199_revised_gene3211 MKPKETVCFNIKYSWHAISR MYNERASKYGTTAAVGFILL NIDVEKGTAATHIGPALGME ATSLSRVFNTMEQDGLIERR NDSSDKRKVTIFLTALGKDK RQIAKDFVLDFNNKVREQVS ESKLNTFFEVLHEIQSIIDN KNTSLT >_0023.002578_ NP_602033.1 gi|19554031|ref|NP_602033.1| hypothetical protein NCgl2743 [Corynebacterium glutamicum ATCC 13032] MVALADHLQMAVKRNELEPE LTSNPNPLSAEVHHLYPEET RLATEILERTNNWLAEKGIP PLPPAEVVAISLHLVNAGFR TEDLAETYVMTGVFEQLFEV IDSSFGITLDRQSVNAARFI THMRYFFVRVHHDGQLNDGM SVLRNSLEISHPDSVACAER LSQILSLRLGAELSSDEQTY LALHVARLAEDRGTTAD >_0020.000489_ CAUR_25MAY01_CONTIG1048_REVISED_GENE693 caur_25may01_Contig1048_revised_gene693 MSRFRVGVQLHPQHTTWESY RQAVQYAESIGCDTIWNWDH FFPLYGPPDGPHFEGWTLLT AMAVITQRAEVGCLVTCNSY RNPALLSNMAKTVDHISGGR LILGLGAGWFEKDYTDVFGW VPNRKFLGVTPPGGDLLGPS GSRLRALRQALPIIKERWAA DQPPPLRRIPILIGGGGEKV TLRITAEHADIWHGFGPIEN FKRKNAILDQWCEQIGRDPA TIERSVSPRAGEPIEPYLEA GATHIIIGMGEPWNFAPVEE LLKLRG >_0019.002006_ NP_347468.1 gi|15894119|ref|NP_347468.1| Fusion: transcriptional regulator and conserved domain [Clostridium acetobutylicum] MVKALVLYFLSVKPTYGYDI QRFIEIDGMDQWAKVKSGSI YYALNKLEKDGFIFTLREER TGARIRKIYAISDKGVEELR RVLKEELLKPIDNVEADKFM IYLMFNRLERDEIIDLTRQH IQSLEQRKKWWEDGRKIKVS EATLKVEILHFDNVIANLDN QIKWHKTLIEEIDEIIKFSK GVEQLIRKIDFGALEDVQYK SKCEDGDVVKQISNVADDIM KNPTDVEEKIETLIKLLRKH >_0018.007238_ BFUN_06OCT04_CONTIG482_REVISED_GENE7239 bfun_06oct04_Contig482_revised_gene7239 MRQVTYVANQRRYLQTPARS SKYCLDALYQPRDIDRKAVI EMQGESRFARAAQKFVSIRE FYDIFRPITRTLAAPTPKRM ASDKELADFLAGVERRAFKQ TVYAVRDDDASLDIVQDAMI KLAEKYGDRPAAELPLLFQR ILQNAMHDYFRRAKVRNTWV SLFSSLGNADDDEFDPLETF EAQQGSAGAESNEQKLEREQ VLQLIDDEIQKLPARQREAF LMRYWEDMDVAETAAAMGCS EGSVKTHCSRATHTLAQALK AKGITL