FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0042, 511 aa 1>>>pF1KE0042 511 - 511 aa - 511 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4057+/-0.000741; mu= 14.1701+/- 0.045 mean_var=96.5402+/-19.207, 0's: 0 Z-trim(110.9): 31 B-trim: 9 in 2/51 Lambda= 0.130533 statistics sampled from 11945 (11975) to 11945 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.732), E-opt: 0.2 (0.368), width: 16 Scan time: 2.940 The best scores are: opt bits E(32554) CCDS58130.1 PRDM11 gene_id:56981|Hs108|chr11 ( 477) 3264 624.7 6.8e-179 CCDS73277.1 PRDM11 gene_id:56981|Hs108|chr11 (1177) 3116 597.1 3.6e-170 CCDS43307.1 PRDM9 gene_id:56979|Hs108|chr5 ( 894) 535 111.0 5.9e-24 CCDS45557.1 PRDM7 gene_id:11105|Hs108|chr16 ( 492) 531 110.1 6e-24 CCDS42932.1 PRDM15 gene_id:63977|Hs108|chr21 (1178) 349 76.0 2.6e-13 CCDS63370.1 PRDM15 gene_id:63977|Hs108|chr21 (1198) 349 76.0 2.6e-13 CCDS5054.2 PRDM1 gene_id:639|Hs108|chr6 ( 825) 317 69.9 1.3e-11 CCDS9115.1 PRDM4 gene_id:11108|Hs108|chr12 ( 801) 303 67.2 7.6e-11 >>CCDS58130.1 PRDM11 gene_id:56981|Hs108|chr11 (477 aa) initn: 3264 init1: 3264 opt: 3264 Z-score: 3325.1 bits: 624.7 E(32554): 6.8e-179 Smith-Waterman score: 3264; 100.0% identity (100.0% similar) in 477 aa overlap (35-511:1-477) 10 20 30 40 50 60 pF1KE0 AEPIASLMIVECRACLRCSPLFLYQREKDRMTENMKECLAQTNAAVGDMVTVVKTEVCSP :::::::::::::::::::::::::::::: CCDS58 MTENMKECLAQTNAAVGDMVTVVKTEVCSP 10 20 30 70 80 90 100 110 120 pF1KE0 LRDQEYGQPCSRRPDSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LRDQEYGQPCSRRPDSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNH 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE0 GPPVFVSDTPVPVGIPDRAALTIPQGMEVVKDTSGESDVRCVNEVIPKGHIFGPYEGQIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GPPVFVSDTPVPVGIPDRAALTIPQGMEVVKDTSGESDVRCVNEVIPKGHIFGPYEGQIS 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE0 TQDKSAGFFSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TQDKSAGFFSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFR 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE0 ACRDIRPGEWLRVWYSEDYMKRLHSMSQETIHRNLARGEKRLQREKSEQVLDNPEDLRGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ACRDIRPGEWLRVWYSEDYMKRLHSMSQETIHRNLARGEKRLQREKSEQVLDNPEDLRGP 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE0 IHLSVLRQGKSPYKRGFDEGDVHPQAKKKKIDLIFKDVLEASLESAKVEAHQLALSTSLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 IHLSVLRQGKSPYKRGFDEGDVHPQAKKKKIDLIFKDVLEASLESAKVEAHQLALSTSLV 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE0 IRKVPKYQDDAYSQCATTMTHGVQNIGQTQGEGDWKVPQGVSKEPGQLEDEEEEPSSFKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 IRKVPKYQDDAYSQCATTMTHGVQNIGQTQGEGDWKVPQGVSKEPGQLEDEEEEPSSFKA 340 350 360 370 380 390 430 440 450 460 470 480 pF1KE0 DSPAEASLASDPHELPTTSFCPNCIRLKKKVRELQAELDMLKSGKLPEPPVLPPQVLELP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 DSPAEASLASDPHELPTTSFCPNCIRLKKKVRELQAELDMLKSGKLPEPPVLPPQVLELP 400 410 420 430 440 450 490 500 510 pF1KE0 EFSDPAGKLVWMRLLSEGRVRSGLCGG ::::::::::::::::::::::::::: CCDS58 EFSDPAGKLVWMRLLSEGRVRSGLCGG 460 470 >>CCDS73277.1 PRDM11 gene_id:56981|Hs108|chr11 (1177 aa) initn: 3116 init1: 3116 opt: 3116 Z-score: 3168.6 bits: 597.1 E(32554): 3.6e-170 Smith-Waterman score: 3116; 100.0% identity (100.0% similar) in 456 aa overlap (35-490:1-456) 10 20 30 40 50 60 pF1KE0 AEPIASLMIVECRACLRCSPLFLYQREKDRMTENMKECLAQTNAAVGDMVTVVKTEVCSP :::::::::::::::::::::::::::::: CCDS73 MTENMKECLAQTNAAVGDMVTVVKTEVCSP 10 20 30 70 80 90 100 110 120 pF1KE0 LRDQEYGQPCSRRPDSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LRDQEYGQPCSRRPDSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNH 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE0 GPPVFVSDTPVPVGIPDRAALTIPQGMEVVKDTSGESDVRCVNEVIPKGHIFGPYEGQIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 GPPVFVSDTPVPVGIPDRAALTIPQGMEVVKDTSGESDVRCVNEVIPKGHIFGPYEGQIS 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE0 TQDKSAGFFSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 TQDKSAGFFSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFR 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE0 ACRDIRPGEWLRVWYSEDYMKRLHSMSQETIHRNLARGEKRLQREKSEQVLDNPEDLRGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 ACRDIRPGEWLRVWYSEDYMKRLHSMSQETIHRNLARGEKRLQREKSEQVLDNPEDLRGP 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE0 IHLSVLRQGKSPYKRGFDEGDVHPQAKKKKIDLIFKDVLEASLESAKVEAHQLALSTSLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 IHLSVLRQGKSPYKRGFDEGDVHPQAKKKKIDLIFKDVLEASLESAKVEAHQLALSTSLV 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE0 IRKVPKYQDDAYSQCATTMTHGVQNIGQTQGEGDWKVPQGVSKEPGQLEDEEEEPSSFKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 IRKVPKYQDDAYSQCATTMTHGVQNIGQTQGEGDWKVPQGVSKEPGQLEDEEEEPSSFKA 340 350 360 370 380 390 430 440 450 460 470 480 pF1KE0 DSPAEASLASDPHELPTTSFCPNCIRLKKKVRELQAELDMLKSGKLPEPPVLPPQVLELP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 DSPAEASLASDPHELPTTSFCPNCIRLKKKVRELQAELDMLKSGKLPEPPVLPPQVLELP 400 410 420 430 440 450 490 500 510 pF1KE0 EFSDPAGKLVWMRLLSEGRVRSGLCGG :::::: CCDS73 EFSDPAASESMVSGPAIMEDDDQEVDSADESVSNDMMTATDEPSKMSSATGRRIRRFKQE 460 470 480 490 500 510 >>CCDS43307.1 PRDM9 gene_id:56979|Hs108|chr5 (894 aa) initn: 530 init1: 283 opt: 535 Z-score: 543.6 bits: 111.0 E(32554): 5.9e-24 Smith-Waterman score: 535; 44.0% identity (75.0% similar) in 168 aa overlap (103-267:198-365) 80 90 100 110 120 130 pF1KE0 PCSRRPDSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNHGPPVFVSD :. :. .:: ::..:.: : ::::.::.: CCDS43 KLELRKKETERKMYSLRERKGHAYKEVSEPQDDDYLYCEMCQNFFIDSCAAHGPPTFVKD 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE0 TPVPVGIPDRAALTIPQGMEVVKDTSGESDVRCVNEV--IPKGHIFGPYEGQISTQDKSA . : : :.:.::..: :... . .. . ::. .: : ::::::.:. ....: CCDS43 SAVDKGHPNRSALSLPPGLRIGPSGIPQAGLGVWNEASDLPLGLHFGPYEGRITEDEEAA 230 240 250 260 270 280 200 210 220 230 240 pF1KE0 GF-FSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFRACRDI . .::::. : :. .::.:.. ::::::: .:...::::.:::. ..:..:.:: : CCDS43 NNGYSWLITKGRNCYEYVDGKDKSWANWMRYVNCARDDEEQNLVAFQYHRQIFYRTCRVI 290 300 310 320 330 340 250 260 270 280 290 300 pF1KE0 RPGEWLRVWYSEDYMKRLHSMSQETIHRNLARGEKRLQREKSEQVLDNPEDLRGPIHLSV ::: : :::...: ..: CCDS43 RPGCELLVWYGDEYGQELGIKWGSKWKKELMAGREPKPEIHPCPSCCLAFSSQKFLSQHV 350 360 370 380 390 400 >>CCDS45557.1 PRDM7 gene_id:11105|Hs108|chr16 (492 aa) initn: 519 init1: 272 opt: 531 Z-score: 543.4 bits: 110.1 E(32554): 6e-24 Smith-Waterman score: 531; 39.6% identity (71.1% similar) in 197 aa overlap (76-267:172-365) 50 60 70 80 90 100 pF1KE0 TNAAVGDMVTVVKTEVCSPLRDQEYGQPCSRRPDSSAMEVEPKKLKGK--RDLIVPKSFQ :: .. . .. ::. ... : : CCDS45 TSDSEQAQKPVSPPGEASTSGQHSRLKLELRRKETEGKMYSLRERKGHAYKEISEP---Q 150 160 170 180 190 110 120 130 140 150 160 pF1KE0 QVDFWFCESCQEYFVDECPNHGPPVFVSDTPVPVGIPDRAALTIPQGMEVVKDTSGESDV . :. .:: ::..:.: : ::::.::.:. : : :.:.::..: :... . .. . CCDS45 DDDYLYCEMCQNFFIDSCAAHGPPTFVKDSAVDKGHPNRSALSLPPGLRIGPSGIPQAGL 200 210 220 230 240 250 170 180 190 200 210 220 pF1KE0 RCVNEV--IPKGHIFGPYEGQISTQDKSAGF-FSWLIVDKNNRYKSIDGSDETKANWMRY ::. .: : ::::::.:. ....:. .::::. : :. .::.:...:::::: CCDS45 GVWNEASDLPLGLHFGPYEGRITEDEEAANSGYSWLITKGRNCYEYVDGKDKSSANWMRY 260 270 280 290 300 310 230 240 250 260 270 280 pF1KE0 VVISREEREQNLLAFQHSERIYFRACRDIRPGEWLRVWYSEDYMKRLHSMSQETIHRNLA : .:...::::.:::. ..:..:.:: :::: : :: ...: ..: CCDS45 VNCARDDEEQNLVAFQYHRQIFYRTCRVIRPGCELLVWSGDEYGQELGIRSSIEPAESLG 320 330 340 350 360 370 290 300 310 320 330 340 pF1KE0 RGEKRLQREKSEQVLDNPEDLRGPIHLSVLRQGKSPYKRGFDEGDVHPQAKKKKIDLIFK CCDS45 QAVNCWSGMGMSMARNWASSGAASGRKSSWQGENQSQRSIHVPHAVWPFQVKNFSVNMWN 380 390 400 410 420 430 >>CCDS42932.1 PRDM15 gene_id:63977|Hs108|chr21 (1178 aa) initn: 305 init1: 220 opt: 349 Z-score: 352.5 bits: 76.0 E(32554): 2.6e-13 Smith-Waterman score: 349; 31.2% identity (59.5% similar) in 205 aa overlap (76-267:4-203) 50 60 70 80 90 100 pF1KE0 TNAAVGDMVTVVKTEVCSPLRDQEYGQPCSRRPDSSAMEVEPKKLKGKRDLIVPK-SFQ- ::: .:. :... . .: .:: CCDS42 MPRRRPPASGAAQFPERIATRSPDPIPLCTFQR 10 20 30 110 120 130 140 150 pF1KE0 QVD-----------FWFCESCQEYFVDECPNHGPPVFVSDTPVPVGIPDRAALTIPQGME ::. : .::.:..: .:::. :: :.:.:. : .:: ..: ..: CCDS42 QVSEMAEDGSEEIMFIWCEDCSQYHDSECPELGPVVMVKDSFVL----SRARSSLPPNLE 40 50 60 70 80 160 170 180 190 200 210 pF1KE0 VVKDTSGESDVRCVNEVIPKGHIFGPYEGQISTQDKSAGFFSWLIVDKNNRYKSIDGSDE . . .: : ..... . . :::.:.. .. .. . : . .:... .: :.: CCDS42 IRRLEDGAEGVFAITQLVKRTQ-FGPFESRRVAKWEKESAFPLKVFQKDGHPVCFDTSNE 90 100 110 120 130 140 220 230 240 250 260 270 pF1KE0 TKANWMRYVVISREEREQNLLAFQHSERIYFRACRDIRPGEWLRVWYSEDYMKRLHSMSQ ::: : . : ..::: :.::. .:: . ::: :: :::::. : :.. CCDS42 DDCNWMMLVRPAAEAEHQNLTAYQHGSDVYFTTSRDIPPGTELRVWYAAFYAKKMDKPML 150 160 170 180 190 200 280 290 300 310 320 330 pF1KE0 ETIHRNLARGEKRLQREKSEQVLDNPEDLRGPIHLSVLRQGKSPYKRGFDEGDVHPQAKK CCDS42 KQAGSGVHAAGTPENSAPVESEPSQWACKVCSATFLELQLLNEHLLGHLEQAKSLPPGSQ 210 220 230 240 250 260 >>CCDS63370.1 PRDM15 gene_id:63977|Hs108|chr21 (1198 aa) initn: 305 init1: 220 opt: 349 Z-score: 352.4 bits: 76.0 E(32554): 2.6e-13 Smith-Waterman score: 349; 31.2% identity (59.5% similar) in 205 aa overlap (76-267:4-203) 50 60 70 80 90 100 pF1KE0 TNAAVGDMVTVVKTEVCSPLRDQEYGQPCSRRPDSSAMEVEPKKLKGKRDLIVPK-SFQ- ::: .:. :... . .: .:: CCDS63 MPRRRPPASGAAQFPERIATRSPDPIPLCTFQR 10 20 30 110 120 130 140 150 pF1KE0 QVD-----------FWFCESCQEYFVDECPNHGPPVFVSDTPVPVGIPDRAALTIPQGME ::. : .::.:..: .:::. :: :.:.:. : .:: ..: ..: CCDS63 QVSEMAEDGSEEIMFIWCEDCSQYHDSECPELGPVVMVKDSFVL----SRARSSLPPNLE 40 50 60 70 80 160 170 180 190 200 210 pF1KE0 VVKDTSGESDVRCVNEVIPKGHIFGPYEGQISTQDKSAGFFSWLIVDKNNRYKSIDGSDE . . .: : ..... . . :::.:.. .. .. . : . .:... .: :.: CCDS63 IRRLEDGAEGVFAITQLVKRTQ-FGPFESRRVAKWEKESAFPLKVFQKDGHPVCFDTSNE 90 100 110 120 130 140 220 230 240 250 260 270 pF1KE0 TKANWMRYVVISREEREQNLLAFQHSERIYFRACRDIRPGEWLRVWYSEDYMKRLHSMSQ ::: : . : ..::: :.::. .:: . ::: :: :::::. : :.. CCDS63 DDCNWMMLVRPAAEAEHQNLTAYQHGSDVYFTTSRDIPPGTELRVWYAAFYAKKMDKPML 150 160 170 180 190 200 280 290 300 310 320 330 pF1KE0 ETIHRNLARGEKRLQREKSEQVLDNPEDLRGPIHLSVLRQGKSPYKRGFDEGDVHPQAKK CCDS63 KQAGSGVHAAGTPENSAPVESEPSQWACKVCSATFLELQLLNEHLLGHLEQAKSLPPGSQ 210 220 230 240 250 260 >>CCDS5054.2 PRDM1 gene_id:639|Hs108|chr6 (825 aa) initn: 295 init1: 211 opt: 317 Z-score: 322.2 bits: 69.9 E(32554): 1.3e-11 Smith-Waterman score: 317; 30.3% identity (58.8% similar) in 238 aa overlap (109-336:54-281) 80 90 100 110 120 130 pF1KE0 DSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNHGPPVFVSDTPVPVG : :.: :.:.. : :. . : CCDS50 VRFQGLAEGTKGTMKMDMEDADMTLWTEAEFEEKCT-YIVNDHPW--------DSGADGG 30 40 50 60 70 140 150 160 170 180 190 pF1KE0 IPDRAALTIPQGMEVVKDTSGESDVRCVN-EVIPKGHIFGPYEGQISTQD---KSAG--F .: ..:... :..: . .. : :::: ::: :.: :.: :.:. . CCDS50 TSVQAEASLPRNLLFKYATNSEEVIGVMSKEYIPKGTRFGPLIGEIYTNDTVPKNANRKY 80 90 100 110 120 130 200 210 220 230 240 250 pF1KE0 FSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFRACRDIRPG : : : .... .. ::: .: :.:::::: .. ::::: : :.. ::: . . : . CCDS50 F-WRIYSRGELHHFIDGFNEEKSNWMRYVNPAHSPREQNLAACQNGMNIYFYTIKPIPAN 140 150 160 170 180 190 260 270 280 290 300 pF1KE0 EWLRVWYSEDYMKRLH-SMSQETIHRNLARGEKRLQREKSEQVLDNPEDL--RGPIHLSV . : ::: .:. .::: . : ::.. .. :.. ..:. :... : . CCDS50 QELLVWYCRDFAERLHYPYPGELTMMNLTQTQSSLKQPSTEKNELCPKNVPKREYSVKEI 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE0 LRQGKSPYK-RGFDEGDVHPQAKKKKIDLIFKDVLEASLESAKVEAHQLALSTSLVIRKV :. ..: : . . .... : ...: .: CCDS50 LKLDSNPSKGKDLYRSNISPLTSEKDLDDFRRRGSPEMPFYPRVVYPIRAPLPEDFLKAS 260 270 280 290 300 310 >>CCDS9115.1 PRDM4 gene_id:11108|Hs108|chr12 (801 aa) initn: 261 init1: 159 opt: 303 Z-score: 308.2 bits: 67.2 E(32554): 7.6e-11 Smith-Waterman score: 303; 32.4% identity (59.7% similar) in 176 aa overlap (108-276:376-545) 80 90 100 110 120 130 pF1KE0 PDSSAMEVEPKKLKGKRDLIVPKSFQQVDFWFCESCQEYFVDECPNHGPPVFVSDTPVPV : : :.. . ..::.::: .:: ::: CCDS91 SSDSLSFVSPSLQMEDSNSNKENMATLFTIW-CTLCDRAYPSDCPEHGPVTFVPDTP--- 350 360 370 380 390 400 140 150 160 170 180 190 pF1KE0 GIPDRAALTIPQGMEVVKDTSGESDVRCVNEVIPKGHIFGPYEGQISTQ-------DKSA : .:: :..:. . . .. : ..:.:: ::: :: : . ::.. CCDS91 -IESRARLSLPKQLVLRQSIVGAEVGVWTGETIPVRTCFGPLIGQQSHSMEVAEWTDKAV 410 420 430 440 450 460 200 210 220 230 240 250 pF1KE0 GFFSWLIVDKNNRYKSIDGSDETKANWMRYVVISREEREQNLLAFQHSERIYFRACRDIR . . : : .. : .::.. ::: .: .:...::::.:. :. .:.: . .:: CCDS91 NHI-WKIYHNGVLEFCIITTDENECNWMMFVRKARNREEQNLVAYPHDGKIFFCTSQDIP 470 480 490 500 510 260 270 280 290 300 310 pF1KE0 PGEWLRVWYSEDYMKRLHSMSQETIHRNLARGEKRLQREKSEQVLDNPEDLRGPIHLSVL : . : .::.:: ... . .: CCDS91 PENELLFYYSRDYAQQIGVPEHPDVHLCNCGKECNSYTEFKAHLTSHIHNHLPTQGHSGS 520 530 540 550 560 570 511 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 06:24:50 2016 done: Fri Nov 4 06:24:50 2016 Total Scan time: 2.940 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]