FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0089, 351 aa 1>>>pF1KA0089 351 - 351 aa - 351 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6619+/-0.00102; mu= 13.6478+/- 0.061 mean_var=55.9099+/-11.053, 0's: 0 Z-trim(102.5): 31 B-trim: 0 in 0/49 Lambda= 0.171526 statistics sampled from 6976 (6982) to 6976 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.584), E-opt: 0.2 (0.214), width: 16 Scan time: 2.520 The best scores are: opt bits E(32554) CCDS33729.1 GPD1L gene_id:23171|Hs108|chr3 ( 351) 2295 576.3 1.3e-164 CCDS8799.1 GPD1 gene_id:2819|Hs108|chr12 ( 349) 1733 437.2 9.6e-123 CCDS58229.1 GPD1 gene_id:2819|Hs108|chr12 ( 326) 1292 328.1 6.4e-90 >>CCDS33729.1 GPD1L gene_id:23171|Hs108|chr3 (351 aa) initn: 2295 init1: 2295 opt: 2295 Z-score: 3068.7 bits: 576.3 E(32554): 1.3e-164 Smith-Waterman score: 2295; 100.0% identity (100.0% similar) in 351 aa overlap (1-351:1-351) 10 20 30 40 50 60 pF1KA0 MAAAPLKVCIVGSGNWGSAVAKIIGNNVKKLQKFASTVKMWVFEETVNGRKLTDIINNDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MAAAPLKVCIVGSGNWGSAVAKIIGNNVKKLQKFASTVKMWVFEETVNGRKLTDIINNDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 ENVKYLPGHKLPENVVAMSNLSEAVQDADLLVFVIPHQFIHRICDEITGRVPKKALGITL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ENVKYLPGHKLPENVVAMSNLSEAVQDADLLVFVIPHQFIHRICDEITGRVPKKALGITL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 IKGIDEGPEGLKLISDIIREKMGIDISVLMGANIANEVAAEKFCETTIGSKVMENGLLFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 IKGIDEGPEGLKLISDIIREKMGIDISVLMGANIANEVAAEKFCETTIGSKVMENGLLFK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 ELLQTPNFRITVVDDADTVELCGALKNIVAVGAGFCDGLRCGDNTKAAVIRLGLMEMIAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ELLQTPNFRITVVDDADTVELCGALKNIVAVGAGFCDGLRCGDNTKAAVIRLGLMEMIAF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 ARIFCKGQVSTATFLESCGVADLITTCYGGRNRRVAEAFARTGKTIEELEKEMLNGQKLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ARIFCKGQVSTATFLESCGVADLITTCYGGRNRRVAEAFARTGKTIEELEKEMLNGQKLQ 250 260 270 280 290 300 310 320 330 340 350 pF1KA0 GPQTSAEVYRILKQKGLLDKFPLFTAVYQICYESRPVQEMLSCLQSHPEHT ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GPQTSAEVYRILKQKGLLDKFPLFTAVYQICYESRPVQEMLSCLQSHPEHT 310 320 330 340 350 >>CCDS8799.1 GPD1 gene_id:2819|Hs108|chr12 (349 aa) initn: 1733 init1: 1733 opt: 1733 Z-score: 2317.1 bits: 437.2 E(32554): 9.6e-123 Smith-Waterman score: 1733; 71.8% identity (90.5% similar) in 347 aa overlap (4-350:2-348) 10 20 30 40 50 60 pF1KA0 MAAAPLKVCIVGSGNWGSAVAKIIGNNVKKLQKFASTVKMWVFEETVNGRKLTDIINNDH : :::::::::::::.:::.:.:. .: .: : :::::: ..:.:::.:::..: CCDS87 MASKKVCIVGSGNWGSAIAKIVGGNAAQLAQFDPRVTMWVFEEDIGGKKLTEIINTQH 10 20 30 40 50 70 80 90 100 110 120 pF1KA0 ENVKYLPGHKLPENVVAMSNLSEAVQDADLLVFVIPHQFIHRICDEITGRVPKKALGITL :::::::::::: ::::. .. .:..:::.:.::.::::: .:::.. :.. .: ::.: CCDS87 ENVKYLPGHKLPPNVVAVPDVVQAAEDADILIFVVPHQFIGKICDQLKGHLKANATGISL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KA0 IKGIDEGPEGLKLISDIIREKMGIDISVLMGANIANEVAAEKFCETTIGSKVMENGLLFK :::.::::.::::::..: :..:: .:::::::::.::: ::::::::: : .: :.: CCDS87 IKGVDEGPNGLKLISEVIGERLGIPMSVLMGANIASEVADEKFCETTIGCKDPAQGQLLK 120 130 140 150 160 170 190 200 210 220 230 240 pF1KA0 ELLQTPNFRITVVDDADTVELCGALKNIVAVGAGFCDGLRCGDNTKAAVIRLGLMEMIAF ::.::::::::::...::::.::::::.::::::::::: ::::::::::::::::::: CCDS87 ELMQTPNFRITVVQEVDTVEICGALKNVVAVGAGFCDGLGFGDNTKAAVIRLGLMEMIAF 180 190 200 210 220 230 250 260 270 280 290 300 pF1KA0 ARIFCKGQVSTATFLESCGVADLITTCYGGRNRRVAEAFARTGKTIEELEKEMLNGQKLQ :..::.: ::.::::::::::::::::::::::.::::::::::.::.::::.::::::: CCDS87 AKLFCSGPVSSATFLESCGVADLITTCYGGRNRKVAEAFARTGKSIEQLEKELLNGQKLQ 240 250 260 270 280 290 310 320 330 340 350 pF1KA0 GPQTSAEVYRILKQKGLLDKFPLFTAVYQICYESRPVQEMLSCLQSHPEHT ::.:. :.: ::..:::.:::::: :::..:::..:: :.. :::.:::: CCDS87 GPETARELYSILQHKGLVDKFPLFMAVYKVCYEGQPVGEFIHCLQNHPEHM 300 310 320 330 340 >>CCDS58229.1 GPD1 gene_id:2819|Hs108|chr12 (326 aa) initn: 1292 init1: 1292 opt: 1292 Z-score: 1727.8 bits: 328.1 E(32554): 6.4e-90 Smith-Waterman score: 1581; 68.3% identity (84.4% similar) in 347 aa overlap (4-350:2-325) 10 20 30 40 50 60 pF1KA0 MAAAPLKVCIVGSGNWGSAVAKIIGNNVKKLQKFASTVKMWVFEETVNGRKLTDIINNDH : :::::::::::::.:::.:.:. .: .: : :::::: ..:.:::.:::..: CCDS58 MASKKVCIVGSGNWGSAIAKIVGGNAAQLAQFDPRVTMWVFEEDIGGKKLTEIINTQH 10 20 30 40 50 70 80 90 100 110 120 pF1KA0 ENVKYLPGHKLPENVVAMSNLSEAVQDADLLVFVIPHQFIHRICDEITGRVPKKALGITL :::::::::::: :: :: .:::.. :.. .: ::.: CCDS58 ENVKYLPGHKLPPNV-----------------------FIGKICDQLKGHLKANATGISL 60 70 80 90 130 140 150 160 170 180 pF1KA0 IKGIDEGPEGLKLISDIIREKMGIDISVLMGANIANEVAAEKFCETTIGSKVMENGLLFK :::.::::.::::::..: :..:: .:::::::::.::: ::::::::: : .: :.: CCDS58 IKGVDEGPNGLKLISEVIGERLGIPMSVLMGANIASEVADEKFCETTIGCKDPAQGQLLK 100 110 120 130 140 150 190 200 210 220 230 240 pF1KA0 ELLQTPNFRITVVDDADTVELCGALKNIVAVGAGFCDGLRCGDNTKAAVIRLGLMEMIAF ::.::::::::::...::::.::::::.::::::::::: ::::::::::::::::::: CCDS58 ELMQTPNFRITVVQEVDTVEICGALKNVVAVGAGFCDGLGFGDNTKAAVIRLGLMEMIAF 160 170 180 190 200 210 250 260 270 280 290 300 pF1KA0 ARIFCKGQVSTATFLESCGVADLITTCYGGRNRRVAEAFARTGKTIEELEKEMLNGQKLQ :..::.: ::.::::::::::::::::::::::.::::::::::.::.::::.::::::: CCDS58 AKLFCSGPVSSATFLESCGVADLITTCYGGRNRKVAEAFARTGKSIEQLEKELLNGQKLQ 220 230 240 250 260 270 310 320 330 340 350 pF1KA0 GPQTSAEVYRILKQKGLLDKFPLFTAVYQICYESRPVQEMLSCLQSHPEHT ::.:. :.: ::..:::.:::::: :::..:::..:: :.. :::.:::: CCDS58 GPETARELYSILQHKGLVDKFPLFMAVYKVCYEGQPVGEFIHCLQNHPEHM 280 290 300 310 320 351 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 18:00:18 2016 done: Wed Nov 2 18:00:18 2016 Total Scan time: 2.520 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]