FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5822, 543 aa 1>>>pF1KB5822 543 - 543 aa - 543 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.6923+/-0.000905; mu= 1.2715+/- 0.055 mean_var=272.9383+/-54.850, 0's: 0 Z-trim(116.2): 27 B-trim: 2 in 1/52 Lambda= 0.077632 statistics sampled from 16751 (16775) to 16751 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.805), E-opt: 0.2 (0.515), width: 16 Scan time: 4.310 The best scores are: opt bits E(32554) CCDS64425.1 TFEB gene_id:7942|Hs108|chr6 ( 490) 3244 376.3 4.7e-104 CCDS4858.1 TFEB gene_id:7942|Hs108|chr6 ( 476) 3198 371.1 1.6e-102 CCDS64424.1 TFEB gene_id:7942|Hs108|chr6 ( 391) 2138 252.3 7.7e-67 CCDS14315.3 TFE3 gene_id:7030|Hs108|chrX ( 575) 1129 139.4 1.1e-32 CCDS54607.1 MITF gene_id:4286|Hs108|chr3 ( 468) 770 99.2 1.2e-20 CCDS46865.1 MITF gene_id:4286|Hs108|chr3 ( 504) 770 99.2 1.2e-20 CCDS43106.1 MITF gene_id:4286|Hs108|chr3 ( 520) 770 99.2 1.3e-20 CCDS46866.2 MITF gene_id:4286|Hs108|chr3 ( 357) 755 97.4 3e-20 CCDS43107.1 MITF gene_id:4286|Hs108|chr3 ( 413) 755 97.4 3.4e-20 CCDS5762.1 TFEC gene_id:22797|Hs108|chr7 ( 347) 728 94.4 2.4e-19 CCDS2913.1 MITF gene_id:4286|Hs108|chr3 ( 419) 706 92.0 1.5e-18 CCDS59076.1 TFEC gene_id:22797|Hs108|chr7 ( 280) 669 87.7 2e-17 CCDS34738.1 TFEC gene_id:22797|Hs108|chr7 ( 318) 669 87.7 2.2e-17 >>CCDS64425.1 TFEB gene_id:7942|Hs108|chr6 (490 aa) initn: 3244 init1: 3244 opt: 3244 Z-score: 1981.6 bits: 376.3 E(32554): 4.7e-104 Smith-Waterman score: 3244; 100.0% identity (100.0% similar) in 483 aa overlap (61-543:8-490) 40 50 60 70 80 90 pF1KB5 VPVILASPCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRE :::::::::::::::::::::::::::::: CCDS64 MTASSGWEPAPAATMASRIGLRMQLMREQAQQEEQRE 10 20 30 100 110 120 130 140 150 pF1KB5 RMQQQAVMHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 RMQQQAVMHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYH 40 50 60 70 80 90 160 170 180 190 200 210 pF1KB5 LQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 LQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSP 100 110 120 130 140 150 220 230 240 250 260 270 pF1KB5 MAMLHIGSNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MAMLHIGSNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTAS 160 170 180 190 200 210 280 290 300 310 320 330 pF1KB5 LVGVTSSSCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 LVGVTSSSCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIP 220 230 240 250 260 270 340 350 360 370 380 390 pF1KB5 KANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 KANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQA 280 290 300 310 320 330 400 410 420 430 440 450 pF1KB5 RVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 RVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPL 340 350 360 370 380 390 460 470 480 490 500 510 pF1KB5 PTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 PTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLL 400 410 420 430 440 450 520 530 540 pF1KB5 PLASDPLLSTMSPEASKASSRRSSFSMEEGDVL ::::::::::::::::::::::::::::::::: CCDS64 PLASDPLLSTMSPEASKASSRRSSFSMEEGDVL 460 470 480 490 >>CCDS4858.1 TFEB gene_id:7942|Hs108|chr6 (476 aa) initn: 3198 init1: 3198 opt: 3198 Z-score: 1954.0 bits: 371.1 E(32554): 1.6e-102 Smith-Waterman score: 3198; 100.0% identity (100.0% similar) in 476 aa overlap (68-543:1-476) 40 50 60 70 80 90 pF1KB5 PCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAV :::::::::::::::::::::::::::::: CCDS48 MASRIGLRMQLMREQAQQEEQRERMQQQAV 10 20 30 100 110 120 130 140 150 pF1KB5 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQ 40 50 60 70 80 90 160 170 180 190 200 210 pF1KB5 KVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 KVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIG 100 110 120 130 140 150 220 230 240 250 260 270 pF1KB5 SNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 SNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS 160 170 180 190 200 210 280 290 300 310 320 330 pF1KB5 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV 220 230 240 250 260 270 340 350 360 370 380 390 pF1KB5 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT 280 290 300 310 320 330 400 410 420 430 440 450 pF1KB5 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP 340 350 360 370 380 390 460 470 480 490 500 510 pF1KB5 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL 400 410 420 430 440 450 520 530 540 pF1KB5 LSTMSPEASKASSRRSSFSMEEGDVL :::::::::::::::::::::::::: CCDS48 LSTMSPEASKASSRRSSFSMEEGDVL 460 470 >>CCDS64424.1 TFEB gene_id:7942|Hs108|chr6 (391 aa) initn: 2226 init1: 2135 opt: 2138 Z-score: 1313.5 bits: 252.3 E(32554): 7.7e-67 Smith-Waterman score: 2445; 82.1% identity (82.1% similar) in 476 aa overlap (68-543:1-391) 40 50 60 70 80 90 pF1KB5 PCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAV :::::::::::::::::::::::::::::: CCDS64 MASRIGLRMQLMREQAQQEEQRERMQQQAV 10 20 30 100 110 120 130 140 150 pF1KB5 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQ ::::::::::::::::::::::::::::::::::::::::: CCDS64 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLK------------------- 40 50 60 70 160 170 180 190 200 210 pF1KB5 KVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIG CCDS64 ------------------------------------------------------------ 220 230 240 250 260 270 pF1KB5 SNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 ------LDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS 80 90 100 110 120 280 290 300 310 320 330 pF1KB5 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV 130 140 150 160 170 180 340 350 360 370 380 390 pF1KB5 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT 190 200 210 220 230 240 400 410 420 430 440 450 pF1KB5 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP 250 260 270 280 290 300 460 470 480 490 500 510 pF1KB5 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL 310 320 330 340 350 360 520 530 540 pF1KB5 LSTMSPEASKASSRRSSFSMEEGDVL :::::::::::::::::::::::::: CCDS64 LSTMSPEASKASSRRSSFSMEEGDVL 370 380 390 >>CCDS14315.3 TFE3 gene_id:7030|Hs108|chrX (575 aa) initn: 1059 init1: 553 opt: 1129 Z-score: 700.5 bits: 139.4 E(32554): 1.1e-32 Smith-Waterman score: 1223; 46.8% identity (69.1% similar) in 511 aa overlap (60-539:102-573) 30 40 50 60 70 80 pF1KB5 AVPVILASPCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQR : :: ... .::. ::.:::: :::..:.: CCDS14 LPLRSSLPISLQATPATPATLSASSSAGGSRTPAMSSSSSSRVLLRQQLMRAQAQEQERR 80 90 100 110 120 130 90 100 110 120 130 140 pF1KB5 ERMQQQAVMHYMQQQQQQQQQQLGGPPTPAINT-----PVHFQS-PPP--VPGEVLKVQS :: .: :. . . .: .:::.. : : ::: :: ::::::. CCDS14 ERREQAAAAPFPSP----------APASPAISVVGVSAGGHTLSRPPPAQVPREVLKVQT 140 150 160 170 180 150 160 170 180 190 200 pF1KB5 YLENPTSYHLQQSQHQKVREYLSETYGNKFAAH-ISPAQGSPKPPPAASPGVRAGHVLSS .::::: :::::...:.:..::: : : :.:.. ..: : . : .: .:.: .. CCDS14 HLENPTRYHLQQARRQQVKQYLSTTLGPKLASQALTPPPGPASAQPLPAP--EAAH--TT 190 200 210 220 230 210 220 230 240 250 pF1KB5 SAGNSAPNSPMAMLHIGSNPERELDDVIDNIMRL-----DDVLGYI---NPEMQMPNTLP . .::::::::.: :::. :.:.:::::.:. : :..:.:. . .:.:.::: CCDS14 GPTGSAPNSPMALLTIGSSSEKEIDDVIDEIISLESSYNDEMLSYLPGGTTGLQLPSTLP 240 250 260 270 280 290 260 270 280 290 300 310 pF1KB5 LSSSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKERQKKDNHNLIE .:.. :.:::: : .:. . ..:.::::.: . :::....:..:: ::::::::::::: CCDS14 VSGNLLDVYSS--QGVATPAITVSNSCPAELPNIKREISETEAKALLKERQKKDNHNLIE 300 310 320 330 340 350 320 330 340 350 360 370 pF1KB5 RRRRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRR ::::::::::::::: ::::..: ..::::::::::::::::..::. :.:..::...: CCDS14 RRRRFNINDRIKELGTLIPKSSDPEMRWNKGTILKASVDYIRKLQKEQQRSKDLESRQRS 360 370 380 390 400 410 380 390 400 410 420 pF1KB5 LEMTNKQLWLRIQELEMQARVHGLPTT-SPSGMNMAEL-AQQVVKQELPS--EEG-PGEA ::..:..: :::::::.::..::::. .:. ...: :.. .: : . ::: :: : CCDS14 LEQANRSLQLRIQELELQAQIHGLPVPPTPGLLSLATTSASDSLKPEQLDIEEEGRPGAA 420 430 440 450 460 470 430 440 450 460 470 pF1KB5 LML---GAEVPDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFG-----GREDEGPP . : :. : ::. : : . :: : :.. . .: .:.:: CCDS14 TFHVGGGPAQNAPHQQPPAPPSDAL-LDLHFPSD-HLGDLGDPFHLGLEDILMEEEEGVV 480 490 500 510 520 530 480 490 500 510 520 530 pF1KB5 GYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPLLSTMSPEASKASSRRSSFSME : :. : :: : :::::::..:: .::::::::::::: CCDS14 G---GLSGGALSP------------------LRAASDPLLSSVSPAVSKASSRRSSFSME 540 550 560 570 540 pF1KB5 EGDVL : CCDS14 EES >>CCDS54607.1 MITF gene_id:4286|Hs108|chr3 (468 aa) initn: 1101 init1: 612 opt: 770 Z-score: 484.4 bits: 99.2 E(32554): 1.2e-20 Smith-Waterman score: 1209; 45.8% identity (67.0% similar) in 509 aa overlap (68-539:1-463) 40 50 60 70 80 90 pF1KB5 PCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAV :.::: ::.:::::: :..:.::..:. . CCDS54 MTSRILLRQQLMREQMQEQERREQQQKLQA 10 20 30 100 110 120 130 140 150 pF1KB5 MHYMQQQQQQQQQQLGGPPTPAINT--PVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQ ..:::. .: :::::. :. . : :: ::::::..:::::.::.::.: CCDS54 AQFMQQRVPVSQ-------TPAINVSVPTTLPSATQVPMEVLKVQTHLENPTKYHIQQAQ 40 50 60 70 80 160 170 180 190 200 210 pF1KB5 HQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLH .:.:..::: : .:: : . . . : : .:: ::. :.:::::::::: CCDS54 RQQVKQYLSTTLANK---HANQVLSLPCP---NQPG---DHVMPPVPGSSAPNSPMAMLT 90 100 110 120 130 220 230 240 pF1KB5 IGSNPERE----------------------------LDDVIDNIMRLD-----DVLGYIN ..:: :.: .:::::.:. :. ..:: .. CCDS54 LNSNCEKEGFYKFEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMD 140 150 160 170 180 190 250 260 270 280 290 300 pF1KB5 PEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKER : .:: ::::.:.. ...:... .: . :.::::.: . :::::..:.::::::: CCDS54 PALQMANTLPVSGNLIDLYGNQGLPPPGL--TISNSCPANLPNIKRELTESEARALAKER 200 210 220 230 240 250 310 320 330 340 350 360 pF1KB5 QKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQK ::::::::::::::::::::::::: ::::.:: :.::::::::::::::::..:.. :. CCDS54 QKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQR 260 270 280 290 300 310 370 380 390 400 410 420 pF1KB5 SRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEE ..::::....:: .:..: :::::::::::.::: .:. .:.....::: : : CCDS54 AKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQE-PVLE 320 330 340 350 360 370 430 440 450 460 470 480 pF1KB5 GPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYP . .. : : .: : : . :...:. : :. .: CCDS54 NCSQDL--------------LQHHADLTCTTTLDLTDGTITFNNNLGTG---TEANQAYS 380 390 400 410 490 500 510 520 530 540 pF1KB5 EPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLA-SDPLLSTMSPEASKASSRRSSFSMEEG : : :: :. .:.::.: :.. .:::::..:: :::.::::::.:::: CCDS54 VPTKMG--------SK--LEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEET 420 430 440 450 460 pF1KB5 DVL CCDS54 EHTC >>CCDS46865.1 MITF gene_id:4286|Hs108|chr3 (504 aa) initn: 1101 init1: 612 opt: 770 Z-score: 484.0 bits: 99.2 E(32554): 1.2e-20 Smith-Waterman score: 1225; 44.0% identity (65.2% similar) in 552 aa overlap (28-539:1-499) 10 20 30 40 50 pF1KB5 MSQLSPACSVTLGKSLPLSGLGVFSSKMDAVPVILASPCQPLCFEEDTCLIYLLPLL--- :.:. : . ::. :: .:: CCDS46 MEALRVQMFMPCS---FES----LYLSSAEHPG 10 20 60 70 80 90 100 110 pF1KB5 IHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAVMHYMQQQQQQQQQQLGGPPT . : ...:.::: ::.:::::: :..:.::..:. . ..:::. .: : CCDS46 ASKPPISSSSMTSRILLRQQLMREQMQEQERREQQQKLQAAQFMQQRVPVSQ-------T 30 40 50 60 70 120 130 140 150 160 170 pF1KB5 PAINT--PVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQKVREYLSETYGNKFAAHI ::::. :. . : :: ::::::..:::::.::.::.:.:.:..::: : .:: : CCDS46 PAINVSVPTTLPSATQVPMEVLKVQTHLENPTKYHIQQAQRQQVKQYLSTTLANK---HA 80 90 100 110 120 130 180 190 200 210 220 pF1KB5 SPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIGSNPERE------------ . . . : : .:: ::. :.:::::::::: ..:: :.: CCDS46 NQVLSLPCP---NQPG---DHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYKFEEQNRAE 140 150 160 170 180 190 230 240 250 260 pF1KB5 ----------------LDDVIDNIMRLD-----DVLGYINPEMQMPNTLPLSSSHLNVYS .:::::.:. :. ..:: ..: .:: ::::.:.. ...:. CCDS46 SECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYG 200 210 220 230 240 250 270 280 290 300 310 320 pF1KB5 SDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKERQKKDNHNLIERRRRFNINDR .. .: . :.::::.: . :::::..:.::::::::::::::::::::::::::: CCDS46 NQGLPPPGL--TISNSCPANLPNIKRELTESEARALAKERQKKDNHNLIERRRRFNINDR 260 270 280 290 300 330 340 350 360 370 380 pF1KB5 IKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWL ::::: ::::.:: :.::::::::::::::::..:.. :...::::....:: .:..: : CCDS46 IKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLEHANRHLLL 310 320 330 340 350 360 390 400 410 420 430 440 pF1KB5 RIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPA ::::::::::.::: .:. .:.....::: : :. .. : CCDS46 RIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQE-PVLENCSQDL-------------- 370 380 390 400 410 450 460 470 480 490 500 pF1KB5 LPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLD : .: : : . :...:. : :. .: : : :: :. CCDS46 LQHHADLTCTTTLDLTDGTITFNNNLGTG---TEANQAYSVPTKMG--------SK--LE 420 430 440 450 460 510 520 530 540 pF1KB5 LMLLDDSLLPLA-SDPLLSTMSPEASKASSRRSSFSMEEGDVL .:.::.: :.. .:::::..:: :::.::::::.:::: CCDS46 DILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEETEHTC 470 480 490 500 >>CCDS43106.1 MITF gene_id:4286|Hs108|chr3 (520 aa) initn: 1101 init1: 612 opt: 770 Z-score: 483.8 bits: 99.2 E(32554): 1.3e-20 Smith-Waterman score: 1223; 45.3% identity (66.9% similar) in 517 aa overlap (60-539:45-515) 30 40 50 60 70 80 pF1KB5 AVPVILASPCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQR . : ...:.::: ::.:::::: :..:.: CCDS43 EEFHEEPKTYYELKSQPLKSSSSAEHPGASKPPISSSSMTSRILLRQQLMREQMQEQERR 20 30 40 50 60 70 90 100 110 120 130 140 pF1KB5 ERMQQQAVMHYMQQQQQQQQQQLGGPPTPAINT--PVHFQSPPPVPGEVLKVQSYLENPT :..:. . ..:::. .: :::::. :. . : :: ::::::..::::: CCDS43 EQQQKLQAAQFMQQRVPVSQ-------TPAINVSVPTTLPSATQVPMEVLKVQTHLENPT 80 90 100 110 120 150 160 170 180 190 200 pF1KB5 SYHLQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAP .::.::.:.:.:..::: : .:: : . . . : : .:: ::. :.::: CCDS43 KYHIQQAQRQQVKQYLSTTLANK---HANQVLSLPCP---NQPG---DHVMPPVPGSSAP 130 140 150 160 170 210 220 230 pF1KB5 NSPMAMLHIGSNPERE----------------------------LDDVIDNIMRLD---- ::::::: ..:: :.: .:::::.:. :. CCDS43 NSPMAMLTLNSNCEKEGFYKFEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYN 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB5 -DVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAE ..:: ..: .:: ::::.:.. ...:... .:. :.::::.: . :::::..: CCDS43 EEILGLMDPALQMANTLPVSGNLIDLYGNQGLPPPGLT--ISNSCPANLPNIKRELTESE 240 250 260 270 280 290 300 310 320 330 340 350 pF1KB5 SRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIR .:::::::::::::::::::::::::::::::: ::::.:: :.:::::::::::::::: CCDS43 ARALAKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIR 300 310 320 330 340 350 360 370 380 390 400 410 pF1KB5 RMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVV ..:.. :...::::....:: .:..: :::::::::::.::: .:. .:..... CCDS43 KLQREQQRAKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRII 360 370 380 390 400 410 420 430 440 450 460 470 pF1KB5 KQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGRE ::: : :. .. : : .: : : . :...:. : CCDS43 KQE-PVLENCSQDL--------------LQHHADLTCTTTLDLTDGTITFNNNLGTG--- 420 430 440 450 480 490 500 510 520 530 pF1KB5 DEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLA-SDPLLSTMSPEASKASSRR :. .: : : :: :. .:.::.: :.. .:::::..:: :::.:::: CCDS43 TEANQAYSVPTKMG--------SK--LEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRR 460 470 480 490 500 540 pF1KB5 SSFSMEEGDVL ::.:::: CCDS43 SSMSMEETEHTC 510 520 >>CCDS46866.2 MITF gene_id:4286|Hs108|chr3 (357 aa) initn: 925 init1: 612 opt: 755 Z-score: 476.9 bits: 97.4 E(32554): 3e-20 Smith-Waterman score: 877; 43.0% identity (64.5% similar) in 409 aa overlap (138-539:11-352) 110 120 130 140 150 160 pF1KB5 QQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQKVREYLSETY .::..:::::.::.::.:.:. . : CCDS46 MLEMLEYNHYQVQTHLENPTKYHIQQAQRQQGFYKFEEQ- 10 20 30 170 180 190 200 210 220 pF1KB5 GNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIGSNPERELDDV :. . ::. . : .: ..::: CCDS46 -NR--------------AESECPGMNT-HSRASCM--------------------QMDDV 40 50 60 230 240 250 260 270 280 pF1KB5 IDNIMRLD-----DVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSSSCPAD ::.:. :. ..:: ..: .:: ::::.:.. ...:... .:. :.::::. CCDS46 IDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYGNQGLPPPGLT--ISNSCPAN 70 80 90 100 110 120 290 300 310 320 330 340 pF1KB5 LTQ-KRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDVRWNK : . :::::..:.:::::::::::::::::::::::::::::::: ::::.:: :.:::: CCDS46 LPNIKRELTESEARALAKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNK 130 140 150 160 170 180 350 360 370 380 390 400 pF1KB5 GTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPTTSPS ::::::::::::..:.. :...::::....:: .:..: :::::::::::.::: . CCDS46 GTILKASVDYIRKLQREQQRAKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPST 190 200 210 220 230 240 410 420 430 440 450 460 pF1KB5 GMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSPFHHL :. .:.....::: : :. .. : : .: : : . CCDS46 GLCSPDLVNRIIKQE-PVLENCSQDL--------------LQHHADLTCTTTLDLTDGTI 250 260 270 280 470 480 490 500 510 520 pF1KB5 DFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLA-SDPLLST :...:. : :. .: : : :: :. .:.::.: :.. .:::::. CCDS46 TFNNNLGTG---TEANQAYSVPTKMG--------SK--LEDILMDDTLSPVGVTDPLLSS 290 300 310 320 330 530 540 pF1KB5 MSPEASKASSRRSSFSMEEGDVL .:: :::.::::::.:::: CCDS46 VSPGASKTSSRRSSMSMEETEHTC 340 350 >>CCDS43107.1 MITF gene_id:4286|Hs108|chr3 (413 aa) initn: 1079 init1: 612 opt: 755 Z-score: 476.1 bits: 97.4 E(32554): 3.4e-20 Smith-Waterman score: 1039; 45.5% identity (66.8% similar) in 437 aa overlap (138-539:11-408) 110 120 130 140 150 160 pF1KB5 QQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQKVREYLSETY .::..:::::.::.::.:.:.:..::: : CCDS43 MLEMLEYNHYQVQTHLENPTKYHIQQAQRQQVKQYLSTTL 10 20 30 40 170 180 190 200 210 220 pF1KB5 GNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIGSNPERE---- .:: : . . . : : .:: ::. :.:::::::::: ..:: :.: CCDS43 ANK---HANQVLSLPCP---NQPG---DHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYK 50 60 70 80 90 230 240 250 pF1KB5 ------------------------LDDVIDNIMRLD-----DVLGYINPEMQMPNTLPLS .:::::.:. :. ..:: ..: .:: ::::.: CCDS43 FEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVS 100 110 120 130 140 150 260 270 280 290 300 310 pF1KB5 SSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKERQKKDNHNLIERR .. ...:... .:. :.::::.: . :::::..:.::::::::::::::::::: CCDS43 GNLIDLYGNQGLPPPGLT--ISNSCPANLPNIKRELTESEARALAKERQKKDNHNLIERR 160 170 180 190 200 320 330 340 350 360 370 pF1KB5 RRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLE ::::::::::::: ::::.:: :.::::::::::::::::..:.. :...::::....:: CCDS43 RRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLE 210 220 230 240 250 260 380 390 400 410 420 430 pF1KB5 MTNKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEV .:..: :::::::::::.::: .:. .:.....::: : :. .. : CCDS43 HANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQE-PVLENCSQDL------ 270 280 290 300 310 320 440 450 460 470 480 490 pF1KB5 PDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFP : .: : : . :...:. : :. .: : : CCDS43 --------LQHHADLTCTTTLDLTDGTITFNNNLGTG---TEANQAYSVPTKMG------ 330 340 350 360 500 510 520 530 540 pF1KB5 SLSKKDLDLMLLDDSLLPLA-SDPLLSTMSPEASKASSRRSSFSMEEGDVL :: :. .:.::.: :.. .:::::..:: :::.::::::.:::: CCDS43 --SK--LEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEETEHTC 370 380 390 400 410 >>CCDS5762.1 TFEC gene_id:22797|Hs108|chr7 (347 aa) initn: 826 init1: 661 opt: 728 Z-score: 460.7 bits: 94.4 E(32554): 2.4e-19 Smith-Waterman score: 840; 43.9% identity (70.3% similar) in 380 aa overlap (175-543:8-347) 150 160 170 180 190 200 pF1KB5 NPTSYHLQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPP-PAASPGVRAGHV-LSSSA :.:. .: :...: :. .:. :.:.: CCDS57 MTLDHQIINPTLKWSQPAVPSGGPLVQHAHTTLDSDA 10 20 30 210 220 230 240 250 pF1KB5 GNSAPNSPMA-MLHIGS---NPERELDDVIDNIMRLDDVL---GYINPEMQMPNTLPLSS : . ..:.. .: ::. : . ...:::..:. ... . : .: . : :: :. CCDS57 GLT--ENPLTKLLAIGKEDDNAQWHMEDVIEDIIGMESSFKEEGADSP-LLMQRTL--SG 40 50 60 70 80 90 260 270 280 290 300 310 pF1KB5 SHLNVYSSDPQVTASLVGVTSSSCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRR : :.:::.. .. .:.::.:::..: .:::.:....::::::::::::::::::::: CCDS57 SILDVYSGEQGISPINMGLTSASCPSSLPMKREITETDTRALAKERQKKDNHNLIERRRR 100 110 120 130 140 150 320 330 340 350 360 370 pF1KB5 FNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMT .::: :::::: ::::.:: :.::::::::::::.::. .::. :..::::.....::.. CCDS57 YNINYRIKELGTLIPKSNDPDMRWNKGTILKASVEYIKWLQKEQQRARELEHRQKKLEQA 160 170 180 190 200 210 380 390 400 410 420 430 pF1KB5 NKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGE--ALMLGAEV :..: :::::::.:::.::::: . : ..:. .:.::. :.. . . .. CCDS57 NRRLLLRIQELEIQARTHGLPTLASLG--TVDLGAHVTKQQSHPEQNSVDYCQQLTVSQG 220 230 240 250 260 270 440 450 460 470 480 490 pF1KB5 PDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFP :.:: : :: . . ..: : : :.:: .: .:.. CCDS57 PSPE----LCDQA-IAF-SDPLSYFTDLSFSAAL----KEEQ------------------ 280 290 300 500 510 520 530 540 pF1KB5 SLSKKDLDLMLLDDSLLPLASDPLLSTMSPEASKASSRRSSFSMEEGDVL :: :::::.. :...:::::. :: .:: :::::::: ..:: : CCDS57 -----RLDGMLLDDTISPFGTDPLLSATSPAVSKESSRRSSFSSDDGDEL 310 320 330 340 543 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 10:26:16 2016 done: Sat Nov 5 10:26:16 2016 Total Scan time: 4.310 Total Display time: 0.090 Function used was FASTA [36.3.4 Apr, 2011]