FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8146, 373 aa 1>>>pF1KB8146 373 - 373 aa - 373 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6112+/-0.000829; mu= 15.2875+/- 0.050 mean_var=71.7126+/-14.411, 0's: 0 Z-trim(107.8): 31 B-trim: 549 in 1/48 Lambda= 0.151453 statistics sampled from 9762 (9790) to 9762 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.674), E-opt: 0.2 (0.301), width: 16 Scan time: 2.510 The best scores are: opt bits E(32554) CCDS30659.1 MECR gene_id:51102|Hs108|chr1 ( 373) 2472 549.2 2.2e-156 CCDS30660.1 MECR gene_id:51102|Hs108|chr1 ( 297) 1948 434.6 5.2e-122 CCDS11451.1 VAT1 gene_id:10493|Hs108|chr17 ( 393) 331 81.4 1.5e-15 CCDS665.1 CRYZ gene_id:1429|Hs108|chr1 ( 329) 274 68.9 7.3e-12 CCDS32492.1 VAT1L gene_id:57687|Hs108|chr16 ( 419) 269 67.9 1.9e-11 CCDS44162.1 CRYZ gene_id:1429|Hs108|chr1 ( 295) 264 66.7 3e-11 >>CCDS30659.1 MECR gene_id:51102|Hs108|chr1 (373 aa) initn: 2472 init1: 2472 opt: 2472 Z-score: 2921.2 bits: 549.2 E(32554): 2.2e-156 Smith-Waterman score: 2472; 99.7% identity (100.0% similar) in 373 aa overlap (1-373:1-373) 10 20 30 40 50 60 pF1KB8 MWVCSTLWRVRTPARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVVEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MWVCSTLWRVRTPARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVVEL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 KNLELAAVRGSDVRVKMLAAPINPSDINMIQGNYGLLPELPAVGGNEGVAQVVAVGSNVT :::::::::::::::::::::::::::::::::::.:::::::::::::::::::::::: CCDS30 KNLELAAVRGSDVRVKMLAAPINPSDINMIQGNYGFLPELPAVGGNEGVAQVVAVGSNVT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 GLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 GLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 LQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 LQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 ELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYGGMAKQPVVASVSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 ELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYGGMAKQPVVASVSLL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 IFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQVPLQDYQSALEAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 IFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQVPLQDYQSALEAS 310 320 330 340 350 360 370 pF1KB8 MKPFISSKQILTM ::::::::::::: CCDS30 MKPFISSKQILTM 370 >>CCDS30660.1 MECR gene_id:51102|Hs108|chr1 (297 aa) initn: 1948 init1: 1948 opt: 1948 Z-score: 2303.9 bits: 434.6 E(32554): 5.2e-122 Smith-Waterman score: 1948; 99.7% identity (100.0% similar) in 297 aa overlap (77-373:1-297) 50 60 70 80 90 100 pF1KB8 VYGHHGDPAKVVELKNLELAAVRGSDVRVKMLAAPINPSDINMIQGNYGLLPELPAVGGN :::::::::::::::::::.:::::::::: CCDS30 MLAAPINPSDINMIQGNYGFLPELPAVGGN 10 20 30 110 120 130 140 150 160 pF1KB8 EGVAQVVAVGSNVTGLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 EGVAQVVAVGSNVTGLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGV 40 50 60 70 80 90 170 180 190 200 210 220 pF1KB8 NPCTAYRMLMDFEQLQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 NPCTAYRMLMDFEQLQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSD 100 110 120 130 140 150 230 240 250 260 270 280 pF1KB8 RLKSLGAEHVITEEELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 RLKSLGAEHVITEEELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYG 160 170 180 190 200 210 290 300 310 320 330 340 pF1KB8 GMAKQPVVASVSLLIFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 GMAKQPVVASVSLLIFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACS 220 230 240 250 260 270 350 360 370 pF1KB8 QVPLQDYQSALEASMKPFISSKQILTM ::::::::::::::::::::::::::: CCDS30 QVPLQDYQSALEASMKPFISSKQILTM 280 290 >>CCDS11451.1 VAT1 gene_id:10493|Hs108|chr17 (393 aa) initn: 287 init1: 130 opt: 331 Z-score: 392.6 bits: 81.4 E(32554): 1.5e-15 Smith-Waterman score: 331; 32.2% identity (58.2% similar) in 273 aa overlap (23-286:26-286) 10 20 30 40 50 pF1KB8 MWVCSTLWRVRTPARQWRGLLPASGCHGPAAS--SYSASAEPARVRALVYGHHGDPA :: . :::: . .:.: : .: :: : CCDS11 MSDEREVAEAATGEDASSPPPKTEAASDPQHPAASEGAAAAAASPPLLRCLVLTGFGGYD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 KVVELKNLELA--AVRGSDVRVKMLAAPINPSDINMIQGNYGLLPELPAVGGNEGVAQVV :: .:.. : : ... ... : .: .:. :: : :: ::.. : ::.. :. CCDS11 KV-KLQSRPAAPPAPGPGQLTLRLRACGLNFADLMARQGLYDRLPPLPVTPGMEGAGVVI 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 AVGSNVTGLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYR ::: .:. : :: :. : . : :. :.. . .: . .. ::.: :: ::: CCDS11 AVGEGVSDRKAGDRVMVLNRS-GMWQEEVTVPSVQTFLIPEAMTFEEAAALLVNYITAYM 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 MLMDFEQLQPGDSVIQNASNSGVGQAVIQIAAALGLRTI-NV-VRDRPDIQKLSDRLKSL .:.:: .:::: ::. . . .:::.:..:. ::. :: : . .: . :: CCDS11 VLFDFGNLQPGHSVLVHMAAGGVGMAAVQLC-----RTVENVTVFGTASASK-HEALKEN 180 190 200 210 220 230 240 250 260 270 280 pF1KB8 GAEHVITEEELRRPEMKNFFKDMPQPR---LALNCVGGKSSTELLRQLARGGTMVTYGGM :. : : . . .. . .: . .:. .... .::..... : : .:::: CCDS11 GVTHPI---DYHTTDYVDEIKKI-SPKGVDIVMDPLGGSDTAKGYNLLKPMGKVVTYGMA 240 250 260 270 280 290 300 310 320 330 340 pF1KB8 AKQPVVASVSLLIFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQV CCDS11 NLLTGPKRNLMALARTWWNQFSVTALQLLQANRAVCGFHLGYLDGEVELVSGVVARLLAL 290 300 310 320 330 340 >>CCDS665.1 CRYZ gene_id:1429|Hs108|chr1 (329 aa) initn: 274 init1: 231 opt: 274 Z-score: 326.5 bits: 68.9 E(32554): 7.3e-12 Smith-Waterman score: 284; 23.3% identity (59.0% similar) in 317 aa overlap (43-358:8-313) 20 30 40 50 60 70 pF1KB8 PARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVVELK-NLELAAVRGS .::. . : : .:..:. .. . . CCDS66 MATGQKLMRAVRVFEFGGP-EVLKLRSDIAVPIPKDH 10 20 30 80 90 100 110 120 130 pF1KB8 DVRVKMLAAPINPSDINMIQGNYGLLPELPAVGGNEGVAQVVAVGSNVTGLKPGDWVIPA .: .:. : .:: . . .:.:. : :: . :.. .. . :::.:....: :: :. . CCDS66 QVLIKVHACGVNPVETYIRSGTYSRKPLLPYTPGSDVAGVIEAVGDNASAFKKGDRVFTS 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB8 NAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQLQPGDSVIQNA .. : . :. ..... ..: . ....:..:. :::: :. .. :.::. .. CCDS66 STISGGYAEYALAADHTVYKLPEKLDFKQGAAIGIPYFTAYRALIHSACVKAGESVLVHG 100 110 120 130 140 150 200 210 220 230 240 250 pF1KB8 SNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEEELRRPEMKNFF ...::: :. ::: : ::. .... . ::. :.. ::..:....:. . . . CCDS66 ASGGVGLAACQIARAYGLKILGTA-GTEEGQKIV--LQN-GAHEVFNHREVNYIDKIKKY 160 170 180 190 200 210 260 270 280 290 300 310 pF1KB8 KDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYGGMAKQPVVASVSLLIFKDLKLRGFW . .. ... . .. : :..:: ... : .. . . . :. .. : CCDS66 VGEKGIDIIIEMLANVNLSKDLSLLSHGGRVIVVG--SRGTIEINPRDTMAKESSIIGVT 220 230 240 250 260 270 320 330 340 350 360 370 pF1KB8 LSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQVPLQDYQSALEASMKPFISSKQIL : . :. .:.. .: .. : : :: ::. : : CCDS66 LFSSTKE----EFQQYAAALQAGMEIGWLKPVIGSQYPLEKVAEAHENIIHGSGATGKMI 280 290 300 310 320 pF1KB8 TM CCDS66 LLL >>CCDS32492.1 VAT1L gene_id:57687|Hs108|chr16 (419 aa) initn: 131 init1: 107 opt: 269 Z-score: 319.0 bits: 67.9 E(32554): 1.9e-11 Smith-Waterman score: 269; 25.7% identity (56.7% similar) in 268 aa overlap (19-286:21-276) 10 20 30 40 50 pF1KB8 MWVCSTLWRVRTPARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVV : :: : : .. . . : .::.: . : :. CCDS32 MAKEGVEKAEETEQMIEKEAGKEPAEGGGGDGSHRLGDAQE---MRAVVLAGFGGLNKLR 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 ELKNLELAAVRGSDVRVKMLAAPINPSDINMIQGNYGLLPELPAVGGNEGVAQVVAVGSN ... . . ....... : .: :. . ::: :. : : : : . : :.:.. CCDS32 LFRK-AMPEPQDGELKIRVKACGLNFIDLMVRQGNIDNPPKTPLVPGFECSGIVEALGDS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 VTGLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDF : : . :: :. : .. ..: . : . ..:.:. .. ::.. .: ::: ::.. CCDS32 VKGYEIGDRVM-AFVNYNAWAEVVCTPVEFVYKIPDDMSFSEAAAFPMNFVTAYVMLFEV 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 EQLQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVIT .:. : ::. .....:::::: :. ... :. . . . ..: . : ... CCDS32 ANLREGMSVLVHSAGGGVGQAVAQLCSTVPNVTVFGTASTFKHEAIKDSVTHLFDRNADY 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 EEELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYGGMAKQPVVASVS .:..: .. :. .:.:. : .. . : : ::.. :: CCDS32 VQEVKRISAEGV--DI-----VLDCLCGDNTGKGLSLLKPLGTYILYGSSNMVTGETKSF 240 250 260 270 280 300 310 320 330 340 350 pF1KB8 LLIFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQVPLQDYQSALE CCDS32 FSFAKSWWQVEKVNPIKLYEENKVIAGFSLLNLLFKQGRAGLIRGVVEKLIGLYNQKKIK 290 300 310 320 330 340 >>CCDS44162.1 CRYZ gene_id:1429|Hs108|chr1 (295 aa) initn: 248 init1: 248 opt: 264 Z-score: 315.4 bits: 66.7 E(32554): 3e-11 Smith-Waterman score: 264; 27.4% identity (66.7% similar) in 201 aa overlap (43-242:8-203) 20 30 40 50 60 70 pF1KB8 PARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVVELK-NLELAAVRGS .::. . : : .:..:. .. . . CCDS44 MATGQKLMRAVRVFEFGGP-EVLKLRSDIAVPIPKDH 10 20 30 80 90 100 110 120 130 pF1KB8 DVRVKMLAAPINPSDINMIQGNYGLLPELPAVGGNEGVAQVVAVGSNVTGLKPGDWVIPA .: .:. : .:: . . .:.:. : :: . :.. .. . :::.:....: :: :. . CCDS44 QVLIKVHACGVNPVETYIRSGTYSRKPLLPYTPGSDVAGVIEAVGDNASAFKKGDRVFTS 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB8 NAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQLQPGDSVIQNA .. : . :. ..... ..: . ....:..:. :::: :. .. :.::. .. CCDS44 STISGGYAEYALAADHTVYKLPEKLDFKQGAAIGIPYFTAYRALIHSACVKAGESVLVHG 100 110 120 130 140 150 200 210 220 230 240 250 pF1KB8 SNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEEELRRPEMKNFF ...::: :. ::: : ::. .... . ::. :.. ::..:....:. CCDS44 ASGGVGLAACQIARAYGLKILGTA-GTEEGQKIV--LQN-GAHEVFNHREVNYIDKIKVV 160 170 180 190 200 210 260 270 280 290 300 310 pF1KB8 KDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYGGMAKQPVVASVSLLIFKDLKLRGFW CCDS44 GSRGTIEINPRDTMAKESSIIGVTLFSSTKEEFQQYAAALQAGMEIGWLKPVIGSQYPLE 220 230 240 250 260 270 373 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 19:59:01 2016 done: Fri Nov 4 19:59:02 2016 Total Scan time: 2.510 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]