FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0507, 259 aa 1>>>pF1KE0507 259 - 259 aa - 259 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3379+/-0.000593; mu= 15.5049+/- 0.036 mean_var=71.9277+/-14.067, 0's: 0 Z-trim(113.1): 29 B-trim: 0 in 0/52 Lambda= 0.151226 statistics sampled from 13775 (13796) to 13775 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.424), width: 16 Scan time: 2.620 The best scores are: opt bits E(32554) CCDS33655.1 FAM109B gene_id:150368|Hs108|chr22 ( 259) 1799 400.7 5.1e-112 CCDS9152.1 FAM109A gene_id:144717|Hs108|chr12 ( 249) 482 113.4 1.5e-25 CCDS53833.1 FAM109A gene_id:144717|Hs108|chr12 ( 262) 482 113.4 1.6e-25 >>CCDS33655.1 FAM109B gene_id:150368|Hs108|chr22 (259 aa) initn: 1799 init1: 1799 opt: 1799 Z-score: 2124.6 bits: 400.7 E(32554): 5.1e-112 Smith-Waterman score: 1799; 100.0% identity (100.0% similar) in 259 aa overlap (1-259:1-259) 10 20 30 40 50 60 pF1KE0 MKLNERSVAHYALSDSPADHMGFLRTWGGPGTPPTPSGTGRRCWFVLKGNLLFSFESREG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MKLNERSVAHYALSDSPADHMGFLRTWGGPGTPPTPSGTGRRCWFVLKGNLLFSFESREG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 RAPLSLVVLEGCTVELAEAPVPEEFAFAICFDAPGVRPHLLAAEGPAAQEAWVKVLSRAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RAPLSLVVLEGCTVELAEAPVPEEFAFAICFDAPGVRPHLLAAEGPAAQEAWVKVLSRAS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 FGYMRLVVRELESQLQDARQSLALQRRSSWKSVASRCKPQAPNHRAAGLENGHCLSKDSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 FGYMRLVVRELESQLQDARQSLALQRRSSWKSVASRCKPQAPNHRAAGLENGHCLSKDSS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 PVGLVEEAGSRSAGWGLAEWELQGPASLLLGKGQSPVSPETSCFSTLHDWYGQEIVELRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PVGLVEEAGSRSAGWGLAEWELQGPASLLLGKGQSPVSPETSCFSTLHDWYGQEIVELRQ 190 200 210 220 230 240 250 pF1KE0 CWQKRAQGSHSKCEEQDRP ::::::::::::::::::: CCDS33 CWQKRAQGSHSKCEEQDRP 250 >>CCDS9152.1 FAM109A gene_id:144717|Hs108|chr12 (249 aa) initn: 522 init1: 211 opt: 482 Z-score: 572.0 bits: 113.4 E(32554): 1.5e-25 Smith-Waterman score: 484; 40.2% identity (60.5% similar) in 261 aa overlap (1-247:1-248) 10 20 30 40 50 60 pF1KE0 MKLNERSVAHYALSDSPADHMGFLRTWGGPGTPPTPSGTGRRCWFVLKGNLLFSFESREG :::::::.: :: :.:.:. ::: :: .. :: ::::.::.:: ::. . CCDS91 MKLNERSLAFYATCDAPVDNAGFLYKKGG-----RHAAYHRR-WFVLRGNMLFYFEDAAS 10 20 30 40 50 70 80 90 100 110 120 pF1KE0 RAPLSLVVLEGCTVELAEAPVPEEFAFAICFDAPGVRPHLLAAEGPAAQEAWVKVLSRAS : :.....::::::::.:: ::::::. : . .: ..::::. :.:.:::.::::: CCDS91 REPVGVIILEGCTVELVEAA--EEFAFAVRFAGTRARTYVLAAESQDAMEGWVKALSRAS 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE0 FGYMRLVVRELESQLQDARQSLALQRRSSWKSVASRCKPQAPNHRAAGLENGHCLSKDSS : :.::::::::.:: .: . .. . . . : :. .: : : . . CCDS91 FDYLRLVVRELEQQLAAVRGGGGM----ALPQPQPQSLPLPPSLPSA-LAPVPSLPSAPA 120 130 140 150 160 190 200 210 220 pF1KE0 PVGLVEEAGSRSA----GWGLAEWELQG--------PASLLLGKGQSPVSP-ETSCFSTL :: . :: : : : .. : ....: .: . . :. : CCDS91 PVPALPLPRRPSALPPKENGCAVWSTEATFRPGPEPPPPPPRRRASAPHGPLDMAPFARL 170 180 190 200 210 220 230 240 250 pF1KE0 HDWYGQEIVELRQCW-QKRAQGSHSKCEEQDRP :. ::::: :: : ..:.: CCDS91 HECYGQEIRALRGQWLSSRVQP 230 240 >>CCDS53833.1 FAM109A gene_id:144717|Hs108|chr12 (262 aa) initn: 522 init1: 211 opt: 482 Z-score: 571.7 bits: 113.4 E(32554): 1.6e-25 Smith-Waterman score: 484; 40.2% identity (60.5% similar) in 261 aa overlap (1-247:14-261) 10 20 30 40 pF1KE0 MKLNERSVAHYALSDSPADHMGFLRTWGGPGTPPTPSGTGRRCWFVL :::::::.: :: :.:.:. ::: :: .. :: :::: CCDS53 MAPGSPPGPAIATMKLNERSLAFYATCDAPVDNAGFLYKKGG-----RHAAYHRR-WFVL 10 20 30 40 50 50 60 70 80 90 100 pF1KE0 KGNLLFSFESREGRAPLSLVVLEGCTVELAEAPVPEEFAFAICFDAPGVRPHLLAAEGPA .::.:: ::. .: :.....::::::::.:: ::::::. : . .: ..::::. CCDS53 RGNMLFYFEDAASREPVGVIILEGCTVELVEAA--EEFAFAVRFAGTRARTYVLAAESQD 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE0 AQEAWVKVLSRASFGYMRLVVRELESQLQDARQSLALQRRSSWKSVASRCKPQAPNHRAA :.:.:::.:::::: :.::::::::.:: .: . .. . . . : :. .: CCDS53 AMEGWVKALSRASFDYLRLVVRELEQQLAAVRGGGGM----ALPQPQPQSLPLPPSLPSA 120 130 140 150 160 170 180 190 200 210 pF1KE0 GLENGHCLSKDSSPVGLVEEAGSRSA----GWGLAEWELQG--------PASLLLGKGQS : : . .:: . :: : : : .. : .... CCDS53 -LAPVPSLPSAPAPVPALPLPRRPSALPPKENGCAVWSTEATFRPGPEPPPPPPRRRASA 170 180 190 200 210 220 220 230 240 250 pF1KE0 PVSP-ETSCFSTLHDWYGQEIVELRQCW-QKRAQGSHSKCEEQDRP : .: . . :. ::. ::::: :: : ..:.: CCDS53 PHGPLDMAPFARLHECYGQEIRALRGQWLSSRVQP 230 240 250 260 259 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 03:43:18 2016 done: Thu Nov 3 03:43:18 2016 Total Scan time: 2.620 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]