FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0463, 416 aa 1>>>pF1KE0463 416 - 416 aa - 416 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3815+/-0.000918; mu= 12.1487+/- 0.056 mean_var=81.9823+/-16.031, 0's: 0 Z-trim(106.6): 14 B-trim: 3 in 1/51 Lambda= 0.141649 statistics sampled from 9057 (9060) to 9057 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.652), E-opt: 0.2 (0.278), width: 16 Scan time: 2.960 The best scores are: opt bits E(32554) CCDS4069.1 FAM172A gene_id:83989|Hs108|chr5 ( 416) 2756 572.9 1.9e-163 CCDS54880.1 FAM172A gene_id:83989|Hs108|chr5 ( 370) 2468 514.1 8.9e-146 CCDS54879.1 FAM172A gene_id:83989|Hs108|chr5 ( 306) 1517 319.7 2.4e-87 >>CCDS4069.1 FAM172A gene_id:83989|Hs108|chr5 (416 aa) initn: 2756 init1: 2756 opt: 2756 Z-score: 3047.9 bits: 572.9 E(32554): 1.9e-163 Smith-Waterman score: 2756; 99.5% identity (99.8% similar) in 416 aa overlap (1-416:1-416) 10 20 30 40 50 60 pF1KE0 MSISLSSLILLPIWINMAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MSISLSSLILLPIWINMAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 EGFEYAFNEKGQLRHIKTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 EGFEYAFNEKGQLRHIKTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 KVSIPVDATESEPKSFIFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 KVSIPVDATESEPKSFIFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 IPFIKRAVAEGYGVIVLNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 IPFIKRAVAEGYGVIVLNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 RDFYEKYRNPQREKEMMQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 RDFYEKYRNPQREKEMMQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 ELMIQREADVKNKVTAVALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 ELMIQREADVKNKVTAVALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESM 310 320 330 340 350 360 370 380 390 400 410 pF1KE0 LPDCPRVPAGTDRHELTSWKSFPSIFKFFTEASEAKTNSLKPAVTRRSHRIKHEEL ::::::: :::::::::::::::::::::::::::::.:::::::::::::::::: CCDS40 LPDCPRVSAGTDRHELTSWKSFPSIFKFFTEASEAKTSSLKPAVTRRSHRIKHEEL 370 380 390 400 410 >>CCDS54880.1 FAM172A gene_id:83989|Hs108|chr5 (370 aa) initn: 2468 init1: 2468 opt: 2468 Z-score: 2730.6 bits: 514.1 E(32554): 8.9e-146 Smith-Waterman score: 2468; 99.5% identity (99.7% similar) in 370 aa overlap (47-416:1-370) 20 30 40 50 60 70 pF1KE0 MAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTLEGFEYAFNEKGQLRHI :::::::::::::::::::::::::::::: CCDS54 MKKDEPPLDFPDTLEGFEYAFNEKGQLRHI 10 20 30 80 90 100 110 120 130 pF1KE0 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIPVDATESEPKSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIPVDATESEPKSF 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE0 IFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQIPFIKRAVAEGYGVIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 IFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQIPFIKRAVAEGYGVIV 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE0 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE0 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE0 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVPAGTDRHEL ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: CCDS54 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVSAGTDRHEL 280 290 300 310 320 330 380 390 400 410 pF1KE0 TSWKSFPSIFKFFTEASEAKTNSLKPAVTRRSHRIKHEEL :::::::::::::::::::::.:::::::::::::::::: CCDS54 TSWKSFPSIFKFFTEASEAKTSSLKPAVTRRSHRIKHEEL 340 350 360 370 >>CCDS54879.1 FAM172A gene_id:83989|Hs108|chr5 (306 aa) initn: 1517 init1: 1517 opt: 1517 Z-score: 1681.6 bits: 319.7 E(32554): 2.4e-87 Smith-Waterman score: 1918; 82.2% identity (82.4% similar) in 370 aa overlap (47-416:1-306) 20 30 40 50 60 70 pF1KE0 MAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTLEGFEYAFNEKGQLRHI :::::::::::::::::::::::::::::: CCDS54 MKKDEPPLDFPDTLEGFEYAFNEKGQLRHI 10 20 30 80 90 100 110 120 130 pF1KE0 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIPVDATESEPKSF ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIP----------- 40 50 60 70 140 150 160 170 180 190 pF1KE0 IFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQIPFIKRAVAEGYGVIV ::::::: CCDS54 -----------------------------------------------------EGYGVIV 80 200 210 220 230 240 250 pF1KE0 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM 90 100 110 120 130 140 260 270 280 290 300 310 pF1KE0 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA 150 160 170 180 190 200 320 330 340 350 360 370 pF1KE0 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVPAGTDRHEL ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: CCDS54 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVSAGTDRHEL 210 220 230 240 250 260 380 390 400 410 pF1KE0 TSWKSFPSIFKFFTEASEAKTNSLKPAVTRRSHRIKHEEL :::::::::::::::::::::.:::::::::::::::::: CCDS54 TSWKSFPSIFKFFTEASEAKTSSLKPAVTRRSHRIKHEEL 270 280 290 300 416 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 07:06:06 2016 done: Thu Nov 3 07:06:06 2016 Total Scan time: 2.960 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]