FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0454, 294 aa 1>>>pF1KE0454 294 - 294 aa - 294 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9253+/-0.000863; mu= 11.9189+/- 0.052 mean_var=67.2663+/-13.426, 0's: 0 Z-trim(106.3): 17 B-trim: 0 in 0/50 Lambda= 0.156378 statistics sampled from 8920 (8931) to 8920 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.653), E-opt: 0.2 (0.274), width: 16 Scan time: 2.350 The best scores are: opt bits E(32554) CCDS53830.1 GPN3 gene_id:51184|Hs108|chr12 ( 294) 1985 456.6 9.7e-129 CCDS9147.1 GPN3 gene_id:51184|Hs108|chr12 ( 284) 1809 416.9 8.4e-117 CCDS53831.1 GPN3 gene_id:51184|Hs108|chr12 ( 323) 1805 416.0 1.8e-116 CCDS289.1 GPN2 gene_id:54707|Hs108|chr1 ( 310) 657 157.0 1.6e-38 CCDS1760.2 GPN1 gene_id:11321|Hs108|chr2 ( 388) 288 73.8 2.2e-13 CCDS46248.1 GPN1 gene_id:11321|Hs108|chr2 ( 362) 277 71.3 1.2e-12 >>CCDS53830.1 GPN3 gene_id:51184|Hs108|chr12 (294 aa) initn: 1985 init1: 1985 opt: 1985 Z-score: 2424.7 bits: 456.6 E(32554): 9.7e-129 Smith-Waterman score: 1985; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF 190 200 210 220 230 240 250 260 270 280 290 pF1KE0 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE 250 260 270 280 290 >>CCDS9147.1 GPN3 gene_id:51184|Hs108|chr12 (284 aa) initn: 1809 init1: 1809 opt: 1809 Z-score: 2210.4 bits: 416.9 E(32554): 8.4e-117 Smith-Waterman score: 1884; 96.6% identity (96.6% similar) in 294 aa overlap (1-294:1-284) 10 20 30 40 50 60 pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV :::::::::::::::: :::::::::::::::::::::::::::::::::: CCDS91 MPRYAQLVMGPAGSGK----------STYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV 10 20 30 40 50 70 80 90 100 110 120 pF1KE0 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE0 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE0 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF 180 190 200 210 220 230 250 260 270 280 290 pF1KE0 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE 240 250 260 270 280 >>CCDS53831.1 GPN3 gene_id:51184|Hs108|chr12 (323 aa) initn: 1805 init1: 1805 opt: 1805 Z-score: 2204.6 bits: 416.0 E(32554): 1.8e-116 Smith-Waterman score: 1805; 99.6% identity (100.0% similar) in 269 aa overlap (26-294:55-323) 10 20 30 40 50 pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEH .::::::::::::::::::::::::::::: CCDS53 SPYRSNVCTQTTDRSTWRKDAELYLLSVITQSTYCATMVQHCEALNRSVQVVNLDPAAEH 30 40 50 60 70 80 60 70 80 90 100 110 pF1KE0 FNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 FNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILF 90 100 110 120 130 140 120 130 140 150 160 170 pF1KE0 DCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 DCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISL 150 160 170 180 190 200 180 190 200 210 220 230 pF1KE0 EIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDY 210 220 230 240 250 260 240 250 260 270 280 290 pF1KE0 SMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 SMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE 270 280 290 300 310 320 >>CCDS289.1 GPN2 gene_id:54707|Hs108|chr1 (310 aa) initn: 684 init1: 399 opt: 657 Z-score: 805.2 bits: 157.0 E(32554): 1.6e-38 Smith-Waterman score: 681; 40.5% identity (70.5% similar) in 264 aa overlap (4-264:10-261) 10 20 30 40 50 pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAE ..: :.:: :::: .::: : . .::.: : ::::::: : CCDS28 MAGAAPTTAFGQAVIGPPGSGK----------TTYCLGMSEFLRALGRRVAVVNLDPANE 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 HFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYIL . : .:. ::. . ::: :.::.::::::..::::. :.:::. : .. :.: CCDS28 GLPYECAVDVGELVGLGDVM--DALRLGPNGGLLYCMEYLEANLDWLRAKLDPLRGHYFL 60 70 80 90 100 120 130 140 150 160 170 pF1KE0 FDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMIS ::::::.:: :: ..... .:. ::..:. .: ::::.. .. :::: . ..:..:. CCDS28 FDCPGQVELCTHHGALRSIFSQMAQWDLRLTAVHLVDSHYCTDPAKFISVLCTSLATMLH 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE0 LEIPQVNIMTKMDLLSKKAKK--EIEKFLDP-DMYSLLEDSTSDLRSKKFKKLTKAICGL .:.:..:...::::. . .: ... . . :. ::. .:: .....:.. . : CCDS28 VELPHINLLSKMDLIEHYGKLAFNLDYYTEVLDLSYLLDHLASDPFFRHYRQLNEKLVQL 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE0 IDDYSMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQEC :.:::.: :.: . .:.::.. ::: .: : : CCDS28 IEDYSLVSFIPLNIQDKESIQRVLQAVDKANGYCFRAQEQRSLEAMMSAAMGADFHFSST 230 240 250 260 270 280 pF1KE0 QDE CCDS28 LGIQEKYLAPSNQSVEQEAMQL 290 300 310 >>CCDS1760.2 GPN1 gene_id:11321|Hs108|chr2 (388 aa) initn: 183 init1: 85 opt: 288 Z-score: 353.6 bits: 73.8 E(32554): 2.2e-13 Smith-Waterman score: 304; 24.8% identity (58.1% similar) in 298 aa overlap (2-294:30-307) 10 20 30 pF1KE0 MPRYA--QLVMGPAGSGKVRICGDKERKSTYC ::. ::.: ::::: .:. CCDS17 MRCLYGRVGGARRKMAASAAAAELQASGGPRHPVCLLVLGMAGSGK----------TTFV 10 20 30 40 50 40 50 60 70 80 90 pF1KE0 ATMVQHCEALNRSVQVVNLDPAAEHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFC .. : .: . :.:::::... . . :::. .. .::.. .: :::::.: CCDS17 QRLTGHLHAQGTPPYVINLDPAVHEVPFPANIDIRDTVKYKEVMKQYGL--GPNGGIVTS 60 70 80 90 100 100 110 120 130 140 pF1KE0 MEYFANNFDWLENCLGHVED--DYILFDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVF .. ::. :: . . . .... :.:.: :::::..: ... : . : . .. CCDS17 LNLFATRFDQVMKFIEKAQNMSKYVLIDTPGQIEVFTWSASGTIITEALAS-SFPTVVIY 110 120 130 140 150 160 150 160 170 180 190 200 pF1KE0 LVDSQFMVESFKFISGILAALSAMISLEIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSL ..:.. .. :.:..: : : . . ..: . .:.: :..... .: . : . CCDS17 VMDTSRSTNPVTFMSNMLYACSILYKTKLPFIVVMNKTDIIDHSFA--VEWMQD---FEA 170 180 190 200 210 220 210 220 230 240 250 260 pF1KE0 LEDSTSDLRSKKFKKLTKAICGLIDD-YSMVRFLPYDQSDEESMNIVLQHIDFAIQYGED ..:. .. .. ..::... ..:. :: .: . . ... .. .. : . : CCDS17 FQDALNQ-ETTYVSNLTRSMSLVLDEFYSSLRVVGVSAVLGTGLDELFVQVTSAAEEYER 230 240 250 260 270 280 270 280 290 pF1KE0 LEFKEPKEREDESSSMFDEYFQECQDE :.. :: .: . . :. : : CCDS17 -EYRPEYERLKKSLANAESQQQREQLERLRKDMGSVALDAGTAKDSLSPVLHPSDLILTR 290 300 310 320 330 340 >>CCDS46248.1 GPN1 gene_id:11321|Hs108|chr2 (362 aa) initn: 147 init1: 85 opt: 277 Z-score: 340.7 bits: 71.3 E(32554): 1.2e-12 Smith-Waterman score: 277; 23.5% identity (58.8% similar) in 272 aa overlap (26-294:20-281) 10 20 30 40 50 60 pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV : . .. : .: . :.:::::... . . CCDS46 MTGHTRSSLPRCTGIVLLIKLRFSERLTGHLHAQGTPPYVINLDPAVHEVPFPA 10 20 30 40 50 70 80 90 100 110 pF1KE0 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVED--DYILFDCP :::. .. .::.. .: :::::.: .. ::. :: . . . .... :.:.: : CCDS46 NIDIRDTVKYKEVMKQYGL--GPNGGIVTSLNLFATRFDQVMKFIEKAQNMSKYVLIDTP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 GQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIP ::::..: ... : . : . ....:.. .. :.:..: : : . . ..: CCDS46 GQIEVFTWSASGTIITEALAS-SFPTVVIYVMDTSRSTNPVTFMSNMLYACSILYKTKLP 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE0 QVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDD-YSM . .:.: :..... .: . : . ..:. .. .. ..::... ..:. :: CCDS46 FIVVMNKTDIIDHSFA--VEWMQD---FEAFQDALNQ-ETTYVSNLTRSMSLVLDEFYSS 180 190 200 210 220 240 250 260 270 280 290 pF1KE0 VRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE .: . . ... .. .. : . : :.. :: .: . . :. : : CCDS46 LRVVGVSAVLGTGLDELFVQVTSAAEEYER-EYRPEYERLKKSLANAESQQQREQLERLR 230 240 250 260 270 280 CCDS46 KDMGSVALDAGTAKDSLSPVLHPSDLILTRGTLDEEDEEADSDTDDIDHRVTEESHEEPA 290 300 310 320 330 340 294 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 07:44:28 2016 done: Thu Nov 3 07:44:29 2016 Total Scan time: 2.350 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]