FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7675, 341 aa 1>>>pF1KB7675 341 - 341 aa - 341 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.8354+/-0.000927; mu= 0.5890+/- 0.056 mean_var=181.6648+/-36.849, 0's: 0 Z-trim(111.9): 36 B-trim: 0 in 0/53 Lambda= 0.095157 statistics sampled from 12736 (12771) to 12736 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.739), E-opt: 0.2 (0.392), width: 16 Scan time: 3.030 The best scores are: opt bits E(32554) CCDS9662.1 PAX9 gene_id:5083|Hs108|chr14 ( 341) 2317 329.9 1.8e-90 CCDS13146.2 PAX1 gene_id:5075|Hs108|chr20 ( 534) 1378 201.1 1.7e-51 CCDS74709.1 PAX1 gene_id:5075|Hs108|chr20 ( 457) 1364 199.2 5.6e-51 CCDS41561.1 PAX2 gene_id:5076|Hs108|chr10 ( 394) 685 105.9 5.7e-23 CCDS7499.1 PAX2 gene_id:5076|Hs108|chr10 ( 396) 685 105.9 5.7e-23 CCDS46399.1 PAX8 gene_id:7849|Hs108|chr2 ( 398) 664 103.0 4.2e-22 CCDS46398.1 PAX8 gene_id:7849|Hs108|chr2 ( 450) 665 103.2 4.3e-22 CCDS42735.1 PAX8 gene_id:7849|Hs108|chr2 ( 287) 661 102.6 4.3e-22 CCDS42736.1 PAX8 gene_id:7849|Hs108|chr2 ( 321) 658 102.2 6.2e-22 CCDS46522.1 PAX3 gene_id:5077|Hs108|chr2 ( 483) 660 102.5 7.3e-22 CCDS44075.1 PAX7 gene_id:5081|Hs108|chr1 ( 518) 656 102.0 1.1e-21 CCDS2451.1 PAX3 gene_id:5077|Hs108|chr2 ( 206) 643 100.0 1.8e-21 CCDS46523.1 PAX3 gene_id:5077|Hs108|chr2 ( 215) 643 100.0 1.9e-21 CCDS2450.1 PAX3 gene_id:5077|Hs108|chr2 ( 403) 648 100.9 2e-21 CCDS2449.1 PAX3 gene_id:5077|Hs108|chr2 ( 407) 648 100.9 2e-21 CCDS42826.1 PAX3 gene_id:5077|Hs108|chr2 ( 479) 648 100.9 2.3e-21 CCDS42825.1 PAX3 gene_id:5077|Hs108|chr2 ( 484) 648 100.9 2.3e-21 CCDS2448.1 PAX3 gene_id:5077|Hs108|chr2 ( 505) 648 100.9 2.4e-21 CCDS44074.1 PAX7 gene_id:5081|Hs108|chr1 ( 505) 647 100.8 2.6e-21 CCDS186.1 PAX7 gene_id:5081|Hs108|chr1 ( 520) 647 100.8 2.7e-21 CCDS65042.1 PAX5 gene_id:5079|Hs108|chr9 ( 319) 641 99.8 3.1e-21 CCDS65043.1 PAX5 gene_id:5079|Hs108|chr9 ( 348) 641 99.9 3.4e-21 CCDS65044.1 PAX5 gene_id:5079|Hs108|chr9 ( 295) 638 99.4 3.9e-21 CCDS65045.1 PAX5 gene_id:5079|Hs108|chr9 ( 324) 638 99.4 4.2e-21 CCDS65046.1 PAX5 gene_id:5079|Hs108|chr9 ( 328) 638 99.4 4.3e-21 CCDS65047.1 PAX5 gene_id:5079|Hs108|chr9 ( 357) 638 99.5 4.6e-21 CCDS65048.1 PAX5 gene_id:5079|Hs108|chr9 ( 362) 638 99.5 4.6e-21 CCDS6607.1 PAX5 gene_id:5079|Hs108|chr9 ( 391) 638 99.5 4.9e-21 CCDS31451.1 PAX6 gene_id:5080|Hs108|chr11 ( 422) 604 94.8 1.3e-19 CCDS5797.1 PAX4 gene_id:5078|Hs108|chr7 ( 343) 483 78.2 1.1e-14 CCDS31452.1 PAX6 gene_id:5080|Hs108|chr11 ( 436) 396 66.3 5.4e-11 >>CCDS9662.1 PAX9 gene_id:5083|Hs108|chr14 (341 aa) initn: 2317 init1: 2317 opt: 2317 Z-score: 1737.7 bits: 329.9 E(32554): 1.8e-90 Smith-Waterman score: 2317; 100.0% identity (100.0% similar) in 341 aa overlap (1-341:1-341) 10 20 30 40 50 60 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSKILARY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSKILARY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 NETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKYNVPSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 NETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKYNVPSV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 SSISRILRNKIGNLAQQGHYDSYKQHQPTPQPALPYNHIYSYPSPITAAAAKVPTPPGVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 SSISRILRNKIGNLAQQGHYDSYKQHQPTPQPALPYNHIYSYPSPITAAAAKVPTPPGVP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AIPGSVAMPRTWPSSHSVTDILGIRSITDQVSDSSPYHSPKVEEWSSLGRNNFPAAAPHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 AIPGSVAMPRTWPSSHSVTDILGIRSITDQVSDSSPYHSPKVEEWSSLGRNNFPAAAPHA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 VNGLEKGALEQEAKYGQAPNGLPAVGSFVSASSMAPYPTPAQVSPYMTYSAAPSGYVAGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 VNGLEKGALEQEAKYGQAPNGLPAVGSFVSASSMAPYPTPAQVSPYMTYSAAPSGYVAGH 250 260 270 280 290 300 310 320 330 340 pF1KB7 GWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSVTASAL ::::::::::::::::::::::::::::::::::::::::: CCDS96 GWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSVTASAL 310 320 330 340 >>CCDS13146.2 PAX1 gene_id:5075|Hs108|chr20 (534 aa) initn: 1192 init1: 1095 opt: 1378 Z-score: 1038.1 bits: 201.1 E(32554): 1.7e-51 Smith-Waterman score: 1378; 63.3% identity (79.7% similar) in 354 aa overlap (1-339:95-435) 10 20 30 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIV :: ..::::::::::::::::::::::::: CCDS13 GGAQALPDCAGPSPGHPGHPGARQLAGPLAMEQTYGEVNQLGGVFVNGRPLPNAIRLRIV 70 80 90 100 110 120 40 50 60 70 80 90 pF1KB7 ELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTY :::::::::::::::::::::::::::::::::::::::::::::::::::.:::::: : CCDS13 ELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPNVVKHIRDY 130 140 150 160 170 180 100 110 120 130 140 150 pF1KB7 KQRDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTP :: :::::::::::::::::::::::::::::::::::::::.::: : :.. :: : CCDS13 KQGDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGSLAQPGPYEASKQ--PPS 190 200 210 220 230 240 160 170 180 190 200 pF1KB7 QPALPYNHIYSYP--SPITAAAAKVPTPPGVPAIPGSVAMPRTWPSSHSVTDILGIRSIT ::.:::::::.:: ::.. ..::. . ::::. : :..::.:::.:::..:::::.. CCDS13 QPTLPYNHIYQYPYPSPVSPTGAKMGSHPGVPGTAGHVSIPRSWPSAHSVSNILGIRTFM 250 260 270 280 290 300 210 220 230 240 250 260 pF1KB7 DQV-----SDSSPYHSPKVEEWSSLGRNNFPAAAPHAVNGLEKGALEQEAKYGQAPNGLP .:. :... : :::.:.:....:. :::. : ::::::: ::: . :: :. . : CCDS13 EQTGALAGSEGTAY-SPKMEDWAGVNRTAFPAT-P-AVNGLEKPALEADIKYTQSASTLS 310 320 330 340 350 270 280 290 300 310 pF1KB7 AVGSFVSASSMAPYPTPAQVSPYMTYSAAPSGYVA-GHGWQHAGGTSLSP-------HNC :::.:. : . ::. : . .::: .::.: : : : : :.: :. CCDS13 AVGGFLPACA---YPASNQ---HGVYSAPGGGYLAPGPPWPPAQGPPLAPPGAGVAVHGG 360 370 380 390 400 410 320 330 340 pF1KB7 DIPASLAFKGMQAAREGSHSVTASAL .. :...:: . .:::: . :. CCDS13 ELAAAMTFK--HPSREGSLPAPAARPRTPSVAYTDCPSRPRPPRGSSPRTRARRERQADP 420 430 440 450 460 470 >>CCDS74709.1 PAX1 gene_id:5075|Hs108|chr20 (457 aa) initn: 1192 init1: 1095 opt: 1364 Z-score: 1028.7 bits: 199.2 E(32554): 5.6e-51 Smith-Waterman score: 1364; 64.6% identity (80.5% similar) in 339 aa overlap (1-324:95-422) 10 20 30 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIV :: ..::::::::::::::::::::::::: CCDS74 GGAQALPDCAGPSPGHPGHPGARQLAGPLAMEQTYGEVNQLGGVFVNGRPLPNAIRLRIV 70 80 90 100 110 120 40 50 60 70 80 90 pF1KB7 ELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTY :::::::::::::::::::::::::::::::::::::::::::::::::::.:::::: : CCDS74 ELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPNVVKHIRDY 130 140 150 160 170 180 100 110 120 130 140 150 pF1KB7 KQRDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTP :: :::::::::::::::::::::::::::::::::::::::.::: : :.. : :: CCDS74 KQGDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGSLAQPGPYEASK--QPPS 190 200 210 220 230 240 160 170 180 190 200 pF1KB7 QPALPYNHIYSYP--SPITAAAAKVPTPPGVPAIPGSVAMPRTWPSSHSVTDILGIRSIT ::.:::::::.:: ::.. ..::. . ::::. : :..::.:::.:::..:::::.. CCDS74 QPTLPYNHIYQYPYPSPVSPTGAKMGSHPGVPGTAGHVSIPRSWPSAHSVSNILGIRTFM 250 260 270 280 290 300 210 220 230 240 250 260 pF1KB7 DQV-----SDSSPYHSPKVEEWSSLGRNNFPAAAPHAVNGLEKGALEQEAKYGQAPNGLP .:. :... : :::.:.:....:. :::. : ::::::: ::: . :: :. . : CCDS74 EQTGALAGSEGTAY-SPKMEDWAGVNRTAFPAT-P-AVNGLEKPALEADIKYTQSASTLS 310 320 330 340 350 270 280 290 300 310 pF1KB7 AVGSFVSASSMAPYPTPAQVSPYMTYSAAPSGYVA-GHGWQHAGGTSLSP-------HNC :::.:. : . ::. : . .::: .::.: : : : : :.: :. CCDS74 AVGGFLPACA---YPASNQ---HGVYSAPGGGYLAPGPPWPPAQGPPLAPPGAGVAVHGG 360 370 380 390 400 410 320 330 340 pF1KB7 DIPASLAFKGMQAAREGSHSVTASAL .. :...:: CCDS74 ELAAAMTFKHPSREVADRKPPSSGSKAPDALSSLHGLPIPASTS 420 430 440 450 >>CCDS41561.1 PAX2 gene_id:5076|Hs108|chr10 (394 aa) initn: 729 init1: 662 opt: 685 Z-score: 525.9 bits: 105.9 E(32554): 5.7e-23 Smith-Waterman score: 685; 57.1% identity (78.5% similar) in 191 aa overlap (1-191:13-196) 10 20 30 40 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRV :.:. : :::::::::::::::...: ::::::. :.::::::::::: CCDS41 MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVELAHQGVRPCDISRQLRV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB7 SHGCVSKILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLA :::::::::.:: ::::: ::.::::::.:.:: :: .: ::...: .::::::::::: CCDS41 SHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEYKRQNPTMFAWEIRDRLLA 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB7 DGVCDKYNVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPALPYNHIYSYPSPITA .:.::. .:::::::.::.:.:. :: . . . : : .. . : :: .. CCDS41 EGICDNDTVPSVSSINRIIRTKV----QQPFHPT-PDGAGTGVTAPGHTIVPSTASPPVS 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 AAAKVPTPPGVPAIPGSVAMPRTWPSSHSVTDILGIRSITDQVSDSSPYHSPKVEEWSSL .:.. :. : .: : ...::. CCDS41 SASNDPV--GSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQSGVDSLRKHLRADTF 180 190 200 210 220 230 >>CCDS7499.1 PAX2 gene_id:5076|Hs108|chr10 (396 aa) initn: 690 init1: 662 opt: 685 Z-score: 525.9 bits: 105.9 E(32554): 5.7e-23 Smith-Waterman score: 685; 57.1% identity (78.5% similar) in 191 aa overlap (1-191:13-196) 10 20 30 40 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRV :.:. : :::::::::::::::...: ::::::. :.::::::::::: CCDS74 MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVELAHQGVRPCDISRQLRV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB7 SHGCVSKILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLA :::::::::.:: ::::: ::.::::::.:.:: :: .: ::...: .::::::::::: CCDS74 SHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEYKRQNPTMFAWEIRDRLLA 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB7 DGVCDKYNVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPALPYNHIYSYPSPITA .:.::. .:::::::.::.:.:. :: . . . : : .. . : :: .. CCDS74 EGICDNDTVPSVSSINRIIRTKV----QQPFHPT-PDGAGTGVTAPGHTIVPSTASPPVS 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 AAAKVPTPPGVPAIPGSVAMPRTWPSSHSVTDILGIRSITDQVSDSSPYHSPKVEEWSSL .:.. :. : .: : ...::. CCDS74 SASNDPV--GSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQSGVDSLRKHLRADTF 180 190 200 210 220 230 >>CCDS46399.1 PAX8 gene_id:7849|Hs108|chr2 (398 aa) initn: 688 init1: 627 opt: 664 Z-score: 510.3 bits: 103.0 E(32554): 4.2e-22 Smith-Waterman score: 664; 42.2% identity (68.1% similar) in 301 aa overlap (6-297:11-305) 10 20 30 40 50 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSK : .:::::.::::::::...: :::.::. :.:::::::::::::::::: CCDS46 MPHNSIRSGHGGLNQLGGAFVNGRPLPEVVRQRIVDLAHQGVRPCDISRQLRVSHGCVSK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKY ::.:: ::::: ::.::::::.:.:: ::..: ::...: .:::::::::::.::::. CCDS46 ILGRYYETGSIRPGVIGGSKPKVATPKVVEKIGDYKRQNPTMFAWEIRDRLLAEGVCDND 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 NVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPAL-PYNHIYSYPSPITAAAAKVP .:::::::.::.:.:. . . . .. .: .: : . . :: . . ... CCDS46 TVPSVSSINRIIRTKVQQPFNLPMDSCVATKSLSPGHTLIPSSAVTPPESPQSDSLGSTY 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 TPPGVPAIPGSVAMPRTWPSSHSVTDILGIR-SITDQVSDSSPYHSPKVEEWSSLGRNNF . :. .: :.: . . . .: . : :: .: :.:.: . ... .:. .. CCDS46 SINGLLGI----AQPGSDKRKMDDSDQDSCRLSIDSQSSSSGPRKHLRTDAFSQ--HHLE 190 200 210 220 230 240 250 260 270 280 pF1KB7 PAAAPHAVNGL-EKGALEQEAKYGQAPNGLPAVGSFVS--ASSMAPYPTPA--QVSPYMT : : . : : ...: :. :: ..: .. ....: :: ..: ..: CCDS46 PLECPFERQHYPEAYASPSHTKGEQGLYPLPLLNSTLDDGKATLTPSNTPLGRNLSTHQT 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB7 YS--AAPSGYVAGHGWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSVTASAL : ::: .. CCDS46 YPVVAAPPFWICSKSAPGSRPSMPFPMLPPCTGSSRARPSSQGERWWGPRCPDTHPTSPP 300 310 320 330 340 350 >>CCDS46398.1 PAX8 gene_id:7849|Hs108|chr2 (450 aa) initn: 674 init1: 627 opt: 665 Z-score: 510.2 bits: 103.2 E(32554): 4.3e-22 Smith-Waterman score: 665; 39.8% identity (66.2% similar) in 337 aa overlap (6-332:11-338) 10 20 30 40 50 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSK : .:::::.::::::::...: :::.::. :.:::::::::::::::::: CCDS46 MPHNSIRSGHGGLNQLGGAFVNGRPLPEVVRQRIVDLAHQGVRPCDISRQLRVSHGCVSK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKY ::.:: ::::: ::.::::::.:.:: ::..: ::...: .:::::::::::.::::. CCDS46 ILGRYYETGSIRPGVIGGSKPKVATPKVVEKIGDYKRQNPTMFAWEIRDRLLAEGVCDND 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 NVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPAL-PYNHIYSYPSPITAAAAKVP .:::::::.::.:.:. . . . .. .: .: : . . :: . . ... CCDS46 TVPSVSSINRIIRTKVQQPFNLPMDSCVATKSLSPGHTLIPSSAVTPPESPQSDSLGSTY 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 TPPGVPAIPGSVAMPRTWPSSHSVTDILGIR-SITDQVSDSSPYHSPKVEEWSSLGRNNF . :. .: :.: . . . .: . : :: .: :.:.: . ... .:. .. CCDS46 SINGLLGI----AQPGSDKRKMDDSDQDSCRLSIDSQSSSSGPRKHLRTDAFSQ--HHLE 190 200 210 220 230 240 250 260 270 280 pF1KB7 PAAAPHAVNGL-EKGALEQEAKYGQAPNGLPAVGSFVS--ASSMAPYPTPA--QVSPYMT : : . : : ...: :. :: ..: .. ....: :: ..: ..: CCDS46 PLECPFERQHYPEAYASPSHTKGEQGLYPLPLLNSTLDDGKATLTPSNTPLGRNLSTHQT 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB7 YS--AAP-SGYVAGHGWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSVTASAL : : : : .. . .....: .: . .: :: .: . : CCDS46 YPVVADPHSPFAIKQETPEVSSSSSTPSSL---SSSAFLDLQQVGSGVPPFNAFPHAASV 300 310 320 330 340 350 CCDS46 YGQFTGQALLSGREMVGPTLPGYPPHIPTSGQGSYASSAIAGMVAGSEYSGNAYGHTPYS 360 370 380 390 400 410 >>CCDS42735.1 PAX8 gene_id:7849|Hs108|chr2 (287 aa) initn: 692 init1: 633 opt: 661 Z-score: 510.2 bits: 102.6 E(32554): 4.3e-22 Smith-Waterman score: 663; 43.8% identity (62.7% similar) in 306 aa overlap (6-292:11-286) 10 20 30 40 50 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSK : .:::::.::::::::...: :::.::. :.:::::::::::::::::: CCDS42 MPHNSIRSGHGGLNQLGGAFVNGRPLPEVVRQRIVDLAHQGVRPCDISRQLRVSHGCVSK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKY ::.:: ::::: ::.::::::.:.:: ::..: ::...: .:::::::::::.::::. CCDS42 ILGRYYETGSIRPGVIGGSKPKVATPKVVEKIGDYKRQNPTMFAWEIRDRLLAEGVCDND 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 NVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPALPYNHIYSYPSPITAAAAKVPT .:::::::.::.:.:. :: :.: :. . .: CCDS42 TVPSVSSINRIIRTKV----QQ-----------------PFNL------PMDSCVATKSL 130 140 150 180 190 200 210 220 pF1KB7 PPGVPAIPGSVAMPRTWP------SSHSVTDILGI-------RSITDQVSDSSPYHSPKV :: ::.:.. : : :..:.. .::: :.. :. .:: : CCDS42 SPGHTLIPSSAVTPPESPQSDSLGSTYSINGLLGIAQPGSDKRKMDDSDQDSCRL-SIDS 160 170 180 190 200 210 230 240 250 260 270 pF1KB7 EEWSSLGRNNF--PAAAPHAVNGLEKGALEQEAKYGQA-PN---GLPAVGSFVSASSMAP . :: :... : . : .. :: .:. . : :. : :... : :: CCDS42 QSSSSGPRKHLRTDAFSQHHLEPLECPFERQHYPEAYASPSHTKGEQEVNTL--AMPMAT 220 230 240 250 260 270 280 290 300 310 320 330 pF1KB7 YPTPAQVSPYMTYSAAPSGYVAGHGWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSV ::: . : . . : CCDS42 PPTPPTARPGASPTPAC 280 >>CCDS42736.1 PAX8 gene_id:7849|Hs108|chr2 (321 aa) initn: 694 init1: 633 opt: 658 Z-score: 507.2 bits: 102.2 E(32554): 6.2e-22 Smith-Waterman score: 660; 42.8% identity (68.3% similar) in 290 aa overlap (6-281:11-290) 10 20 30 40 50 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSK : .:::::.::::::::...: :::.::. :.:::::::::::::::::: CCDS42 MPHNSIRSGHGGLNQLGGAFVNGRPLPEVVRQRIVDLAHQGVRPCDISRQLRVSHGCVSK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKY ::.:: ::::: ::.::::::.:.:: ::..: ::...: .:::::::::::.::::. CCDS42 ILGRYYETGSIRPGVIGGSKPKVATPKVVEKIGDYKRQNPTMFAWEIRDRLLAEGVCDND 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 NVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPAL-PYNHIYSYPSPITAAAAKVP .:::::::.::.:.:. . . . .. .: .: : . . :: . . ... CCDS42 TVPSVSSINRIIRTKVQQPFNLPMDSCVATKSLSPGHTLIPSSAVTPPESPQSDSLGSTY 130 140 150 160 170 180 180 190 200 210 220 pF1KB7 TPPGVPAIPGSVAMPRTWPSSHSVTDILGIR-SITDQVSDSSPYHSPKVEEWSS------ . :. .: :.: . . . .: . : :: .: :.:.: . ... .:. CCDS42 SINGLLGI----AQPGSDKRKMDDSDQDSCRLSIDSQSSSSGPRKHLRTDAFSQHHLEPL 190 200 210 220 230 230 240 250 260 270 280 pF1KB7 ---LGRNNFPAA--APHAVNGLEKGALEQEAKYG-QAPNGLPAVGSFVSASSMAPYPTPA . :...: : .: ..: :.: : .: . :. :. .. ..: : :. : CCDS42 ECPFERQHYPEAYASPSHTKG-EQG----ERWWGPRCPDTHPTSPP-ADRAAMPPLPSQA 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB7 QVSPYMTYSAAPSGYVAGHGWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSVTASAL CCDS42 WWQEVNTLAMPMATPPTPPTARPGASPTPAC 300 310 320 >>CCDS46522.1 PAX3 gene_id:5077|Hs108|chr2 (483 aa) initn: 680 init1: 655 opt: 660 Z-score: 506.0 bits: 102.5 E(32554): 7.3e-22 Smith-Waterman score: 671; 39.3% identity (61.9% similar) in 349 aa overlap (6-339:36-368) 10 20 30 pF1KB7 MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQL :.::::::::.::::::: :: .:::.:. CCDS46 GAVPRMMRPGPGQNYPRSGFPLEVSTPLGQGRVNQLGGVFINGRPLPNHIRHKIVEMAHH 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB7 GIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDP ::::: :::::::::::::::: ::.::::: :::::::::.:::: : :.:. ::...: CCDS46 GIRPCVISRQLRVSHGCVSKILCRYQETGSIRPGAIGGSKPKVTTPDVEKKIEEYKRENP 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB7 GIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPAL- :.:.:::::.:: :.:::. .::::::::::::.:.:. .. :. . . . : CCDS46 GMFSWEIRDKLLKDAVCDRNTVPSVSSISRILRSKFGKGEEEEADLERKEAEESEKKAKH 130 140 150 160 170 180 160 170 180 190 200 pF1KB7 PYNHIYSY--PSPITAAAAKVPTPPGVPAIPGSVAMPRTWPSSHSVTDILGIRSIT---- . : : .: . .. . . : .: . . :: ..... .. : CCDS46 SIDGILSERASAPQSDEGSDIDSEPDLP-LKRKQRRSRTTFTAEQLEELERAFERTHYPD 190 200 210 220 230 240 210 220 230 240 250 260 pF1KB7 ----DQVSDSSPYHSPKVEEWSSLGRNNFPAAAPHAVNGLEKGALEQEAKYGQAPNGLPA ..... . .:. : : : . : ..: : :... : :...:. CCDS46 IYTREELAQRAKLTEARVQVWFSNRRARWRKQA--GANQLM--AFNHLIPGGFPPTAMPT 250 260 270 280 290 300 270 280 290 300 310 320 pF1KB7 VGSF-VSASSMAPYPTPAQVSPYMTYSAAPSGYVAGHGWQHAGGTSLSPHNCDIPA---S . .. .: .:. : : :: ::. : : : ... :. ::. : CCDS46 LPTYQLSETSYQPTSIPQAVSD-------PSSTV--HRPQPLPPSTV--HQSTIPSNPDS 310 320 330 340 330 340 pF1KB7 LAFKGMQAAREGSHSVTASAL . . ..:.: : : : CCDS46 SSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQVMGLLTNHGGVPHQPQTDY 350 360 370 380 390 400 341 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 21:31:32 2016 done: Fri Nov 4 21:31:32 2016 Total Scan time: 3.030 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]