FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8925, 273 aa 1>>>pF1KB8925 273 - 273 aa - 273 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1390+/-0.000687; mu= 8.5172+/- 0.042 mean_var=145.0253+/-30.733, 0's: 0 Z-trim(115.1): 168 B-trim: 853 in 1/50 Lambda= 0.106501 statistics sampled from 15442 (15628) to 15442 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.48), width: 16 Scan time: 2.850 The best scores are: opt bits E(32554) CCDS13145.1 2 gene_id:4821|Hs108|chr20 ( 273) 1845 294.1 7.1e-80 CCDS41558.1 3 gene_id:159296|Hs108|chr10 ( 364) 481 84.6 1.1e-16 CCDS4387.1 5 gene_id:1482|Hs108|chr5 ( 324) 466 82.3 4.9e-16 CCDS9660.1 8 gene_id:26257|Hs108|chr14 ( 239) 436 77.6 9.5e-15 CCDS9659.1 1 gene_id:7080|Hs108|chr14 ( 371) 368 67.3 1.9e-11 CCDS41945.1 1 gene_id:7080|Hs108|chr14 ( 401) 368 67.3 2e-11 CCDS42855.1 4 gene_id:644524|Hs108|chr20 ( 354) 365 66.8 2.5e-11 >>CCDS13145.1 2 gene_id:4821|Hs108|chr20 (273 aa) initn: 1845 init1: 1845 opt: 1845 Z-score: 1547.5 bits: 294.1 E(32554): 7.1e-80 Smith-Waterman score: 1845; 100.0% identity (100.0% similar) in 273 aa overlap (1-273:1-273) 10 20 30 40 50 60 pF1KB8 MSLTNTKTGFSVKDILDLPDTNDEEGSVAEGPEEENEGPEPAKRAGPLGQGALDAVQSLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MSLTNTKTGFSVKDILDLPDTNDEEGSVAEGPEEENEGPEPAKRAGPLGQGALDAVQSLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LKNPFYDSSDNPYTRWLASTEGLQYSLHGLAAGAPPQDSSSKSPEPSADESPDNDKETPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LKNPFYDSSDNPYTRWLASTEGLQYSLHGLAAGAPPQDSSSKSPEPSADESPDNDKETPG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 GGGDAGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPTQVKIWFQNHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GGGDAGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPTQVKIWFQNHR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 YKMKRARAEKGMEVTPLPSPRRVAVPVLVRDGKPCHALKAQDLAAATFQAGIPFSAYSAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YKMKRARAEKGMEVTPLPSPRRVAVPVLVRDGKPCHALKAQDLAAATFQAGIPFSAYSAQ 190 200 210 220 230 240 250 260 270 pF1KB8 SLQHMQYNAQYSSASTPQYPTAHPLVQAQQWTW ::::::::::::::::::::::::::::::::: CCDS13 SLQHMQYNAQYSSASTPQYPTAHPLVQAQQWTW 250 260 270 >>CCDS41558.1 3 gene_id:159296|Hs108|chr10 (364 aa) initn: 477 init1: 334 opt: 481 Z-score: 413.1 bits: 84.6 E(32554): 1.1e-16 Smith-Waterman score: 485; 40.1% identity (61.3% similar) in 279 aa overlap (6-255:8-275) 10 20 30 pF1KB8 MSLTNTKTGFSVKDILDLPDTN----------DEEG---------SVAEGPEEENEGP :.: :::::::.: . . : : ..::: . . : CCDS41 MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGE 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB8 EPAKRAGPLGQGALDAVQSLPLKNPFYDSSDNP--YTRWL---ASTEGLQYSLHGLAAGA : . : :. ..:: . ::. : :.. . . .: .. . .. CCDS41 EDEEDEGE----KLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRD 70 80 90 100 110 100 110 120 130 140 150 pF1KB8 PPQDSSS--KSPEPSADESPDNDKETPGGGGDAGKKRKRRVLFSKAQTYELERRFRQQRY : : . :: : ..: . ...: : ..:: :::::.::..::::::.:::: CCDS41 RSQKSCQLKKSLETAGDCKAAEESERP----KPRSRRKPRVLFSQAQVFELERRFKQQRY 120 130 140 150 160 170 160 170 180 190 200 pF1KB8 LSAPEREHLASLIRLTPTQVKIWFQNHRYKMKRARAEKGMEV---TPLPSPRRVAVPVLV ::::::::::: ..:: :::::::::.::: :: : .:..:. .: : :::::::::: CCDS41 LSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLV 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB8 RDGKPCHALKAQDLAAATFQAGIPFSAYSAQSLQHMQYNAQYSSASTPQYPTAHPLVQAQ :::::: . .:: .: ...: :::: .:. . :. . ..:. CCDS41 RDGKPCVTPSAQAYGAP-YSVGA--SAYSYNSFPAYGYGNSAAAAAAAAAAAAAAAAYSS 240 250 260 270 280 270 pF1KB8 QWTW CCDS41 SYGCAYPAGGGGGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTAA 290 300 310 320 330 340 >>CCDS4387.1 5 gene_id:1482|Hs108|chr5 (324 aa) initn: 383 init1: 348 opt: 466 Z-score: 401.3 bits: 82.3 E(32554): 4.9e-16 Smith-Waterman score: 466; 37.2% identity (60.3% similar) in 282 aa overlap (6-264:8-275) 10 20 30 40 pF1KB8 MSLTNTKTGFSVKDILDLPDTNDEEGSVAEGPE-----EENEGP--------EPAKRA : : :::::::.: .... :.: . : : . .: .: : CCDS43 MFPSPALTPTPFSVKDILNL---EQQQRSLAAAGELSARLEATLAPSSCMLAAFKPEAYA 10 20 30 40 50 50 60 70 80 90 100 pF1KB8 GPLGQGALDAVQSLPLKNPFYDSSDNPYTRWLASTEGLQYSLHGLAAGAPPQDSSSKSPE :: . :. .:: . .: : . . .. . : .: ... : CCDS43 GPEA-----AAPGLPELRAELGRAPSPAKCASAFPAAPAFYPRAYSDPDPAKDPRAEKKE 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB8 PSADESPDNDKETPGGGGD---AGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLA : .. . ..: . ... : ..:: :::::.::.:::::::.::::::::::..:: CCDS43 LCALQKAVELEKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSAPERDQLA 120 130 140 150 160 170 170 180 190 200 210 pF1KB8 SLIRLTPTQVKIWFQNHRYKMKRARAEKGMEVTPLPSP-----RRVAVPVLVRDGKPCHA :...:: :::::::::.::: :: : .. .:.. :: : ::.:::::::::::: . CCDS43 SVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVRDGKPCLG 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB8 LKAQDLAAATFQAGIPFSAYSAQSLQHMQYNAQYSSASTPQYP--TAHPLVQAQQWTW .: : .. .:. .:.: . : . ..: .: : .:.: CCDS43 DSAP--YAPAYGVGLNPYGYNA----YPAYPGYGGAACSPGYSCTAAYPAGPSPAQPATA 240 250 260 270 280 CCDS43 AANNNFVNFGVGDLNAVQSPGIPQSNSGVSTLHGIRAW 290 300 310 320 >>CCDS9660.1 8 gene_id:26257|Hs108|chr14 (239 aa) initn: 557 init1: 388 opt: 436 Z-score: 378.2 bits: 77.6 E(32554): 9.5e-15 Smith-Waterman score: 526; 45.8% identity (64.0% similar) in 236 aa overlap (54-273:21-239) 30 40 50 60 70 80 pF1KB8 EEGSVAEGPEEENEGPEPAKRAGPLGQGALDAVQSLPLKNPFYDSSD-NPYTRWLASTEG :: : :: ..: . . .: . :: : .: CCDS96 MATSGRLSFTVRSLLDLPEQDA-QHLPRREPEPRAPQPDPCAAWLDSERG 10 20 30 40 90 100 110 120 130 140 pF1KB8 LQYSLHGLAAGAPPQDSSS-KSPEPSADESPDNDKETPGGGGDAGKKRKRRVLFSKAQTY .: : .: :: .. :.... :. .::. :: :..::::::::::: CCDS96 -HY---------PSSDESSLETSPPDSSQRPSARPASPGS--DAEKRKKRRVLFSKAQTL 50 60 70 80 90 150 160 170 180 190 pF1KB8 ELERRFRQQRYLSAPEREHLASLIRLTPTQVKIWFQNHRYKMKRARAEKGMEVTPLPS-- ::::::::::::::::::.::::.:::::::::::::::::.::::: . : : . CCDS96 ELERRFRQQRYLSAPEREQLASLLRLTPTQVKIWFQNHRYKLKRARAPGAAESPDLAASA 100 110 120 130 140 150 200 210 220 230 240 pF1KB8 -----P---RRVAVPVLVRDGKPCHALKAQDL--AAATFQAGIPFSAYSAQSLQHMQYNA : :::.::::::::.:: . . .. ::: . : : .: : : : : CCDS96 ELHAAPGLLRRVVVPVLVRDGQPCGGGGGGEVGTAAAQEKCGAPPAA--ACPLP--GYPA 160 170 180 190 200 210 250 260 270 pF1KB8 QYSSASTPQYPTAHPLVQAQ--QWTW ... .:. . :.. .:.: CCDS96 FGPGSALGLFPAYQHLASPALVSWNW 220 230 >>CCDS9659.1 1 gene_id:7080|Hs108|chr14 (371 aa) initn: 491 init1: 357 opt: 368 Z-score: 319.2 bits: 67.3 E(32554): 1.9e-11 Smith-Waterman score: 456; 38.9% identity (59.1% similar) in 257 aa overlap (26-255:62-303) 10 20 30 40 50 pF1KB8 MSLTNTKTGFSVKDILDLPDTNDEEGSVAEGPEEENEG-PEPAKRA-GPLGQGAL :.:. . . : :. .. : : .: : CCDS96 GGLGAPLAAYRQGQAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSAVGGYCNGNL 40 50 60 70 80 90 60 70 80 90 100 110 pF1KB8 DAVQSLPLKNPFYDSSDNPYTR--WLASTEGLQY-SLHGLAAGAPPQDSSSKSPEPSADE .. :: :. :. : . : ... .. .. . . : .. :. . : . CCDS96 GNMSELP---PYQDTMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGGLGSLGD 100 110 120 130 140 120 130 140 150 160 170 pF1KB8 SPDNDKETPGGGGDAGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPT : : .. .::::::::.::.:::::::.::.::::::::::::.:.:::: CCDS96 VSKNMAPLP-----SAPRRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPT 150 160 170 180 190 200 180 190 200 pF1KB8 QVKIWFQNHRYKMKRARAEKGMEV--------------TPLP--------SPRRVAVPVL ::::::::::::::: .:. . : : :::::::::: CCDS96 QVKIWFQNHRYKMKRQAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPVL 210 220 230 240 250 260 210 220 230 240 250 260 pF1KB8 VRDGKPCHALKAQDLAAATFQAGIPFSAYSAQSLQHMQYNAQYSSASTPQYPTAHPLVQA :.:::::.: : .::..:. .. :. ::. :: ..:. CCDS96 VKDGKPCQA-GAPAPGAASLQG------HAQQQAQHQAQAAQAAAAAISVGSGGAGLGAH 270 280 290 300 310 270 pF1KB8 QQWTW CCDS96 PGHQPGSAGQSPDLAHHAASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGRTW 320 330 340 350 360 370 >>CCDS41945.1 1 gene_id:7080|Hs108|chr14 (401 aa) initn: 491 init1: 357 opt: 368 Z-score: 318.7 bits: 67.3 E(32554): 2e-11 Smith-Waterman score: 456; 38.9% identity (59.1% similar) in 257 aa overlap (26-255:92-333) 10 20 30 40 50 pF1KB8 MSLTNTKTGFSVKDILDLPDTNDEEGSVAEGPEEENEG-PEPAKRA-GPLGQGAL :.:. . . : :. .. : : .: : CCDS41 GGLGAPLAAYRQGQAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSAVGGYCNGNL 70 80 90 100 110 120 60 70 80 90 100 110 pF1KB8 DAVQSLPLKNPFYDSSDNPYTR--WLASTEGLQY-SLHGLAAGAPPQDSSSKSPEPSADE .. :: :. :. : . : ... .. .. . . : .. :. . : . CCDS41 GNMSELP---PYQDTMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGGLGSLGD 130 140 150 160 170 120 130 140 150 160 170 pF1KB8 SPDNDKETPGGGGDAGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPT : : .. .::::::::.::.:::::::.::.::::::::::::.:.:::: CCDS41 VSKNMAPLP-----SAPRRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPT 180 190 200 210 220 230 180 190 200 pF1KB8 QVKIWFQNHRYKMKRARAEKGMEV--------------TPLP--------SPRRVAVPVL ::::::::::::::: .:. . : : :::::::::: CCDS41 QVKIWFQNHRYKMKRQAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPVL 240 250 260 270 280 290 210 220 230 240 250 260 pF1KB8 VRDGKPCHALKAQDLAAATFQAGIPFSAYSAQSLQHMQYNAQYSSASTPQYPTAHPLVQA :.:::::.: : .::..:. .. :. ::. :: ..:. CCDS41 VKDGKPCQA-GAPAPGAASLQG------HAQQQAQHQAQAAQAAAAAISVGSGGAGLGAH 300 310 320 330 340 270 pF1KB8 QQWTW CCDS41 PGHQPGSAGQSPDLAHHAASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGRTW 350 360 370 380 390 400 >>CCDS42855.1 4 gene_id:644524|Hs108|chr20 (354 aa) initn: 536 init1: 357 opt: 365 Z-score: 316.9 bits: 66.8 E(32554): 2.5e-11 Smith-Waterman score: 474; 46.9% identity (63.0% similar) in 192 aa overlap (36-216:117-288) 10 20 30 40 50 60 pF1KB8 TKTGFSVKDILDLPDTNDEEGSVAEGPEEENEGPEPAKRAGPLGQGALDAVQSLPLKNPF : : :: : : .: . : .: CCDS42 AAAAATYHMPPGVSQFPHGAMGSYCNGGLGNMGELPAYTDGMRGGAATGWYGANP--DPR 90 100 110 120 130 140 70 80 90 100 110 120 pF1KB8 YDSSDNPYTRWLASTEGLQYSLHGLAAGAPPQDSSSKSPEPSADESPDNDKETPGGGGDA :.: .:... . :.. . : .: ...:: : .... : CCDS42 YSS----ISRFMGPSAGVNVAGMGSLTGIA---DAAKSLGP-----------LHAAAAAA 150 160 170 180 130 140 150 160 170 180 pF1KB8 GKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPTQVKIWFQNHRYKMKR . .::::::::.::.:::::::.::.::::::::::::.:.::::::::::::::::::: CCDS42 APRRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKR 190 200 210 220 230 240 190 200 210 220 230 pF1KB8 ARAEK-----------GMEVTPLPSPRRVAVPVLVRDGKPCHALKAQDLAAATFQAGIPF .: : : ::::::::::::.:::::. CCDS42 QAKDKAAQQLQQEGGLGPPPPPPPSPRRVAVPVLVKDGKPCQNGASTPTPGQAGPQPPAP 250 260 270 280 290 300 240 250 260 270 pF1KB8 SAYSAQSLQHMQYNAQYSSASTPQYPTAHPLVQAQQWTW CCDS42 TPAPELEELSPSPPALHGPGGGLAALDAAAGEYSGGVLGANLLYGRTW 310 320 330 340 350 273 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:31:11 2016 done: Fri Nov 4 16:31:11 2016 Total Scan time: 2.850 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]