FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2200, 517 aa 1>>>pF1KE2200 517 - 517 aa - 517 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.8372+/-0.00109; mu= -8.5273+/- 0.065 mean_var=475.2933+/-100.171, 0's: 0 Z-trim(116.6): 787 B-trim: 462 in 1/54 Lambda= 0.058829 statistics sampled from 16356 (17232) to 16356 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.529), width: 16 Scan time: 3.890 The best scores are: opt bits E(32554) CCDS7878.2 WT1 gene_id:7490|Hs108|chr11 ( 517) 3711 329.2 7e-90 CCDS44562.1 WT1 gene_id:7490|Hs108|chr11 ( 514) 3679 326.5 4.6e-89 CCDS44561.1 WT1 gene_id:7490|Hs108|chr11 ( 497) 2309 210.2 4.5e-54 CCDS55751.1 WT1 gene_id:7490|Hs108|chr11 ( 302) 2116 193.6 2.7e-49 CCDS55750.1 WT1 gene_id:7490|Hs108|chr11 ( 288) 1310 125.2 1e-28 >>CCDS7878.2 WT1 gene_id:7490|Hs108|chr11 (517 aa) initn: 3711 init1: 3711 opt: 3711 Z-score: 1727.1 bits: 329.2 E(32554): 7e-90 Smith-Waterman score: 3711; 100.0% identity (100.0% similar) in 517 aa overlap (1-517:1-517) 10 20 30 40 50 60 pF1KE2 MQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 MQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 ASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 GPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 NHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 NHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 LECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 LECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 VFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 VFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 YQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 YQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEK 430 440 450 460 470 480 490 500 510 pF1KE2 PFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL ::::::::::::::::::::::::::::::::::::: CCDS78 PFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL 490 500 510 >>CCDS44562.1 WT1 gene_id:7490|Hs108|chr11 (514 aa) initn: 3501 init1: 3413 opt: 3679 Z-score: 1712.5 bits: 326.5 E(32554): 4.6e-89 Smith-Waterman score: 3679; 99.4% identity (99.4% similar) in 517 aa overlap (1-517:1-514) 10 20 30 40 50 60 pF1KE2 MQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 GPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 NHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 NHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 LECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 LECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 VFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 YQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: CCDS44 YQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTG---EK 430 440 450 460 470 490 500 510 pF1KE2 PFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL ::::::::::::::::::::::::::::::::::::: CCDS44 PFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL 480 490 500 510 >>CCDS44561.1 WT1 gene_id:7490|Hs108|chr11 (497 aa) initn: 3387 init1: 2301 opt: 2309 Z-score: 1084.3 bits: 210.2 E(32554): 4.5e-54 Smith-Waterman score: 3528; 96.1% identity (96.1% similar) in 517 aa overlap (1-517:1-497) 10 20 30 40 50 60 pF1KE2 MQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 GPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 NHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 NHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 LECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHG ::::::::::::::::: :::::::::::::::::::::::::: CCDS44 LECMTWNQMNLGATLKG-----------------HSTGYESDNHTTPILCGAQYRIHTHG 310 320 330 340 370 380 390 400 410 420 pF1KE2 VFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKP 350 360 370 380 390 400 430 440 450 460 470 480 pF1KE2 YQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: CCDS44 YQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTG---EK 410 420 430 440 450 460 490 500 510 pF1KE2 PFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL ::::::::::::::::::::::::::::::::::::: CCDS44 PFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL 470 480 490 >>CCDS55751.1 WT1 gene_id:7490|Hs108|chr11 (302 aa) initn: 2073 init1: 1848 opt: 2116 Z-score: 998.5 bits: 193.6 E(32554): 2.7e-49 Smith-Waterman score: 2116; 98.7% identity (99.0% similar) in 303 aa overlap (215-517:3-302) 190 200 210 220 230 240 pF1KE2 PSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFPNHSF .::::::::::::::::::::::::::::: CCDS55 MEKGYSTVTFDGTPSYGHTPSHHAAQFPNHSF 10 20 30 250 260 270 280 290 300 pF1KE2 KHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQLECM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQLECM 40 50 60 70 80 90 310 320 330 340 350 360 pF1KE2 TWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHGVFRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHGVFRG 100 110 120 130 140 150 370 380 390 400 410 420 pF1KE2 IQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCD 160 170 180 190 200 210 430 440 450 460 470 480 pF1KE2 FKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEKPFSC ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::: CCDS55 FKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTG---EKPFSC 220 230 240 250 260 490 500 510 pF1KE2 RWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL ::::::::::::::::::::::::::::::::: CCDS55 RWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL 270 280 290 300 >>CCDS55750.1 WT1 gene_id:7490|Hs108|chr11 (288 aa) initn: 1310 init1: 1310 opt: 1310 Z-score: 629.0 bits: 125.2 E(32554): 1e-28 Smith-Waterman score: 1997; 94.1% identity (94.4% similar) in 303 aa overlap (215-517:3-288) 190 200 210 220 230 240 pF1KE2 PSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFPNHSF .::::::::::::::::::::::::::::: CCDS55 MEKGYSTVTFDGTPSYGHTPSHHAAQFPNHSF 10 20 30 250 260 270 280 290 300 pF1KE2 KHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQLECM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQLECM 40 50 60 70 80 90 310 320 330 340 350 360 pF1KE2 TWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHGVFRG ::::::::::::: :::::::::::::::::::::::::::::: CCDS55 TWNQMNLGATLKG-----------------HSTGYESDNHTTPILCGAQYRIHTHGVFRG 100 110 120 130 370 380 390 400 410 420 pF1KE2 IQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCD 140 150 160 170 180 190 430 440 450 460 470 480 pF1KE2 FKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEKPFSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 FKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEKPFSC 200 210 220 230 240 250 490 500 510 pF1KE2 RWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL ::::::::::::::::::::::::::::::::: CCDS55 RWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL 260 270 280 517 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:11:48 2016 done: Sun Nov 6 15:11:48 2016 Total Scan time: 3.890 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]