FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6927, 207 aa 1>>>pF1KB6927 207 - 207 aa - 207 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9259+/-0.000795; mu= 10.3657+/- 0.047 mean_var=94.8799+/-18.814, 0's: 0 Z-trim(109.5): 187 B-trim: 141 in 2/50 Lambda= 0.131670 statistics sampled from 10698 (10932) to 10698 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.718), E-opt: 0.2 (0.336), width: 16 Scan time: 1.740 The best scores are: opt bits E(32554) CCDS12757.1 LIN7B gene_id:64130|Hs108|chr19 ( 207) 1319 260.4 5.8e-70 CCDS7864.1 LIN7C gene_id:55327|Hs108|chr11 ( 197) 1109 220.5 5.7e-58 CCDS9021.1 LIN7A gene_id:8825|Hs108|chr12 ( 233) 1038 207.0 7.4e-54 CCDS77328.1 LIN7B gene_id:64130|Hs108|chr19 ( 137) 503 105.2 1.9e-23 CCDS73357.1 DLG2 gene_id:1740|Hs108|chr11 ( 852) 293 65.9 8.2e-11 CCDS41696.1 DLG2 gene_id:1740|Hs108|chr11 ( 870) 293 65.9 8.3e-11 CCDS44691.1 DLG2 gene_id:1740|Hs108|chr11 ( 749) 292 65.7 8.4e-11 CCDS55782.1 DLG2 gene_id:1740|Hs108|chr11 ( 909) 293 65.9 8.6e-11 CCDS44690.1 DLG2 gene_id:1740|Hs108|chr11 ( 975) 293 66.0 9.1e-11 >>CCDS12757.1 LIN7B gene_id:64130|Hs108|chr19 (207 aa) initn: 1319 init1: 1319 opt: 1319 Z-score: 1369.5 bits: 260.4 E(32554): 5.8e-70 Smith-Waterman score: 1319; 100.0% identity (100.0% similar) in 207 aa overlap (1-207:1-207) 10 20 30 40 50 60 pF1KB6 MAALVEPLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MAALVEPLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 LDITGSAEIRAHATAKATVAAFTASEGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LDITGSAEIRAHATAKATVAAFTASEGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 RVIPGGVADRHGGLKRGDQLLSVNGVSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RVIPGGVADRHGGLKRGDQLLSVNGVSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEE 130 140 150 160 170 180 190 200 pF1KB6 MEARFEKMRSARRRQQHQSYSSLESRG ::::::::::::::::::::::::::: CCDS12 MEARFEKMRSARRRQQHQSYSSLESRG 190 200 >>CCDS7864.1 LIN7C gene_id:55327|Hs108|chr11 (197 aa) initn: 1109 init1: 1109 opt: 1109 Z-score: 1154.3 bits: 220.5 E(32554): 5.7e-58 Smith-Waterman score: 1109; 84.7% identity (98.0% similar) in 196 aa overlap (1-196:1-196) 10 20 30 40 50 60 pF1KB6 MAALVEPLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDT :::: ::. ::::. ::.::::.::::::.::::::::::::::.::.:.:::::..:.: CCDS78 MAALGEPVRLERDICRAIELLEKLQRSGEVPPQKLQALQRVLQSEFCNAVREVYEHVYET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 LDITGSAEIRAHATAKATVAAFTASEGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYIS .::..: :.::.::::::::::.:::::.::::::::::.:::::::::::::::::::: CCDS78 VDISSSPEVRANATAKATVAAFAASEGHSHPRVVELPKTEEGLGFNIMGGKEQNSPIYIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 RVIPGGVADRHGGLKRGDQLLSVNGVSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEE :.::::.::::::::::::::::::::::::.:::::::::::::.:::::::::.:::: CCDS78 RIIPGGIADRHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAQGKVKLVVRYTPKVLEE 130 140 150 160 170 180 190 200 pF1KB6 MEARFEKMRSARRRQQHQSYSSLESRG ::.::::::::.:::: CCDS78 MESRFEKMRSAKRRQQT 190 >>CCDS9021.1 LIN7A gene_id:8825|Hs108|chr12 (233 aa) initn: 1082 init1: 1028 opt: 1038 Z-score: 1080.3 bits: 207.0 E(32554): 7.4e-54 Smith-Waterman score: 1038; 78.5% identity (95.5% similar) in 200 aa overlap (1-198:14-213) 10 20 30 40 pF1KB6 MAAL--VEPLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSR ::.: :.:: :.:::.::.::::.::.:::.: .:::.:..::::. CCDS90 MLKPSVTSAPTADMATLTVVQPLTLDRDVARAIELLEKLQESGEVPVHKLQSLKKVLQSE 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB6 FCSAIREVYEQLYDTLDITGSAEIRAHATAKATVAAFTASEGHAHPRVVELPKTDEGLGF ::.::::::. ...:. ..: :.::.::::::::::.:::::.:::::::::::::::: CCDS90 FCTAIREVYQYMHETITVNGCPEFRARATAKATVAAFAASEGHSHPRVVELPKTDEGLGF 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB6 NIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNGVSVEGEQHEKAVELLKAAQG :.::::::::::::::.::::::.::::::::::::::::::::::.:::::::::::. CCDS90 NVMGGKEQNSPIYISRIIPGGVAERHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAKD 130 140 150 160 170 180 170 180 190 200 pF1KB6 SVKLVVRYTPRVLEEMEARFEKMRSARRRQQHQSYSSLESRG ::::::::::.:::::::::::.:.::::::.: CCDS90 SVKLVVRYTPKVLEEMEARFEKLRTARRRQQQQLLIQQQQQQQQQQTQQNHMS 190 200 210 220 230 >>CCDS77328.1 LIN7B gene_id:64130|Hs108|chr19 (137 aa) initn: 513 init1: 476 opt: 503 Z-score: 534.3 bits: 105.2 E(32554): 1.9e-23 Smith-Waterman score: 706; 66.2% identity (66.2% similar) in 207 aa overlap (1-207:1-137) 10 20 30 40 50 60 pF1KB6 MAALVEPLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MAALVEPLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 LDITGSAEIRAHATAKATVAAFTASEGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYIS :::::::::::::::: CCDS77 LDITGSAEIRAHATAK-------------------------------------------- 70 130 140 150 160 170 180 pF1KB6 RVIPGGVADRHGGLKRGDQLLSVNGVSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEE :::::::::::::::::::::::::::::::::: CCDS77 --------------------------SVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEE 80 90 100 110 190 200 pF1KB6 MEARFEKMRSARRRQQHQSYSSLESRG ::::::::::::::::::::::::::: CCDS77 MEARFEKMRSARRRQQHQSYSSLESRG 120 130 >>CCDS73357.1 DLG2 gene_id:1740|Hs108|chr11 (852 aa) initn: 369 init1: 187 opt: 293 Z-score: 307.6 bits: 65.9 E(32554): 8.2e-11 Smith-Waterman score: 293; 38.5% identity (68.9% similar) in 148 aa overlap (61-202:384-526) 40 50 60 70 80 pF1KB6 PPQKLQALQRVLQSRFCSAIREVYEQLYDTLDITGSAEIRAH-----ATAKATVAAFTAS : . ..:. .: :: . ... : CCDS73 LCDKPASPRHYSPVECDKSFLLSAPYSHYHLGLLPDSEMTSHSQHSTATRQPSMTLQRAV 360 370 380 390 400 410 90 100 110 120 130 140 pF1KB6 EGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNG ...:: : : : . ::::::.:: :.. :..: .. :: :: : :.::::.::::: CCDS73 SLEGEPRKVVLHKGSTGLGFNIVGG-EDGEGIFVSFILAGGPADLSGELQRGDQILSVNG 420 430 440 450 460 470 150 160 170 180 190 200 pF1KB6 VSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEEMEARFE-KMRSARRRQQHQSYSSLE ....: .::.:. ::.: .: ....: : : :::: :... :......:.:: CCDS73 IDLRGASHEQAAAALKGAGQTVTIIAQYQP----EDYARFEAKIHDLREQMMNHSMSSGS 480 490 500 510 520 pF1KB6 SRG CCDS73 GSLRTNQKRSLYVRAMFDYDKSKDSGLPSQGLSFKYGDILHVINASDDEWWQARRVMLEG 530 540 550 560 570 580 >>CCDS41696.1 DLG2 gene_id:1740|Hs108|chr11 (870 aa) initn: 369 init1: 187 opt: 293 Z-score: 307.5 bits: 65.9 E(32554): 8.3e-11 Smith-Waterman score: 293; 38.5% identity (68.9% similar) in 148 aa overlap (61-202:384-526) 40 50 60 70 80 pF1KB6 PPQKLQALQRVLQSRFCSAIREVYEQLYDTLDITGSAEIRAH-----ATAKATVAAFTAS : . ..:. .: :: . ... : CCDS41 LCDKPASPRHYSPVECDKSFLLSAPYSHYHLGLLPDSEMTSHSQHSTATRQPSMTLQRAV 360 370 380 390 400 410 90 100 110 120 130 140 pF1KB6 EGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNG ...:: : : : . ::::::.:: :.. :..: .. :: :: : :.::::.::::: CCDS41 SLEGEPRKVVLHKGSTGLGFNIVGG-EDGEGIFVSFILAGGPADLSGELQRGDQILSVNG 420 430 440 450 460 470 150 160 170 180 190 200 pF1KB6 VSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEEMEARFE-KMRSARRRQQHQSYSSLE ....: .::.:. ::.: .: ....: : : :::: :... :......:.:: CCDS41 IDLRGASHEQAAAALKGAGQTVTIIAQYQP----EDYARFEAKIHDLREQMMNHSMSSGS 480 490 500 510 520 pF1KB6 SRG CCDS41 GSLRTNQKRSLYVRAMFDYDKSKDSGLPSQGLSFKYGDILHVINASDDEWWQARRVMLEG 530 540 550 560 570 580 >>CCDS44691.1 DLG2 gene_id:1740|Hs108|chr11 (749 aa) initn: 369 init1: 187 opt: 292 Z-score: 307.4 bits: 65.7 E(32554): 8.4e-11 Smith-Waterman score: 292; 39.4% identity (71.1% similar) in 142 aa overlap (62-202:288-423) 40 50 60 70 80 90 pF1KB6 PQKLQALQRVLQSRFCSAIREVYEQLYDTLDITGSAEIRAHATAKATVAAFTASEGHAHP : :. .. .. :: . ... : ...: CCDS44 NNGTLEYKTSLPPISPGRYSPIPKHMLVDDDYTSHSQ-HSTATRQPSMTLQRAVSLEGEP 260 270 280 290 300 310 100 110 120 130 140 150 pF1KB6 RVVELPKTDEGLGFNIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNGVSVEGE : : : : . ::::::.:: :.. :..: .. :: :: : :.::::.:::::....: CCDS44 RKVVLHKGSTGLGFNIVGG-EDGEGIFVSFILAGGPADLSGELQRGDQILSVNGIDLRGA 320 330 340 350 360 370 160 170 180 190 200 pF1KB6 QHEKAVELLKAAQGSVKLVVRYTPRVLEEMEARFE-KMRSARRRQQHQSYSSLESRG .::.:. ::.: .: ....: : : :::: :... :......:.:: CCDS44 SHEQAAAALKGAGQTVTIIAQYQP----EDYARFEAKIHDLREQMMNHSMSSGSGSLRTN 380 390 400 410 420 430 CCDS44 QKRSLYVRAMFDYDKSKDSGLPSQGLSFKYGDILHVINASDDEWWQARRVMLEGDSEEMG 440 450 460 470 480 490 >>CCDS55782.1 DLG2 gene_id:1740|Hs108|chr11 (909 aa) initn: 369 init1: 187 opt: 293 Z-score: 307.2 bits: 65.9 E(32554): 8.6e-11 Smith-Waterman score: 293; 38.5% identity (68.9% similar) in 148 aa overlap (61-202:423-565) 40 50 60 70 80 pF1KB6 PPQKLQALQRVLQSRFCSAIREVYEQLYDTLDITGSAEIRAH-----ATAKATVAAFTAS : . ..:. .: :: . ... : CCDS55 LCDKPASPRHYSPVECDKSFLLSAPYSHYHLGLLPDSEMTSHSQHSTATRQPSMTLQRAV 400 410 420 430 440 450 90 100 110 120 130 140 pF1KB6 EGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNG ...:: : : : . ::::::.:: :.. :..: .. :: :: : :.::::.::::: CCDS55 SLEGEPRKVVLHKGSTGLGFNIVGG-EDGEGIFVSFILAGGPADLSGELQRGDQILSVNG 460 470 480 490 500 510 150 160 170 180 190 200 pF1KB6 VSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEEMEARFE-KMRSARRRQQHQSYSSLE ....: .::.:. ::.: .: ....: : : :::: :... :......:.:: CCDS55 IDLRGASHEQAAAALKGAGQTVTIIAQYQP----EDYARFEAKIHDLREQMMNHSMSSGS 520 530 540 550 560 pF1KB6 SRG CCDS55 GSLRTNQKRSLYVRAMFDYDKSKDSGLPSQGLSFKYGDILHVINASDDEWWQARRVMLEG 570 580 590 600 610 620 >>CCDS44690.1 DLG2 gene_id:1740|Hs108|chr11 (975 aa) initn: 395 init1: 187 opt: 293 Z-score: 306.8 bits: 66.0 E(32554): 9.1e-11 Smith-Waterman score: 293; 38.5% identity (68.9% similar) in 148 aa overlap (61-202:489-631) 40 50 60 70 80 pF1KB6 PPQKLQALQRVLQSRFCSAIREVYEQLYDTLDITGSAEIRAH-----ATAKATVAAFTAS : . ..:. .: :: . ... : CCDS44 LCDKPASPRHYSPVECDKSFLLSAPYSHYHLGLLPDSEMTSHSQHSTATRQPSMTLQRAV 460 470 480 490 500 510 90 100 110 120 130 140 pF1KB6 EGHAHPRVVELPKTDEGLGFNIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNG ...:: : : : . ::::::.:: :.. :..: .. :: :: : :.::::.::::: CCDS44 SLEGEPRKVVLHKGSTGLGFNIVGG-EDGEGIFVSFILAGGPADLSGELQRGDQILSVNG 520 530 540 550 560 570 150 160 170 180 190 200 pF1KB6 VSVEGEQHEKAVELLKAAQGSVKLVVRYTPRVLEEMEARFE-KMRSARRRQQHQSYSSLE ....: .::.:. ::.: .: ....: : : :::: :... :......:.:: CCDS44 IDLRGASHEQAAAALKGAGQTVTIIAQYQP----EDYARFEAKIHDLREQMMNHSMSSGS 580 590 600 610 620 630 pF1KB6 SRG CCDS44 GSLRTNQKRSLYVRAMFDYDKSKDSGLPSQGLSFKYGDILHVINASDDEWWQARRVMLEG 640 650 660 670 680 690 207 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 12:07:17 2016 done: Fri Nov 4 12:07:17 2016 Total Scan time: 1.740 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]