FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7282, 705 aa 1>>>pF1KB7282 705 - 705 aa - 705 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.9679+/-0.00103; mu= 3.9399+/- 0.062 mean_var=195.6169+/-39.113, 0's: 0 Z-trim(110.5): 26 B-trim: 9 in 1/52 Lambda= 0.091700 statistics sampled from 11648 (11666) to 11648 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.7), E-opt: 0.2 (0.358), width: 16 Scan time: 4.110 The best scores are: opt bits E(32554) CCDS9108.1 RFX4 gene_id:5992|Hs108|chr12 ( 641) 3348 455.8 8.9e-128 CCDS9106.1 RFX4 gene_id:5992|Hs108|chr12 ( 735) 3348 455.9 9.9e-128 CCDS55880.1 RFX4 gene_id:5992|Hs108|chr12 ( 744) 3348 455.9 1e-127 CCDS5113.1 RFX6 gene_id:222546|Hs108|chr6 ( 928) 623 95.4 4e-19 >>CCDS9108.1 RFX4 gene_id:5992|Hs108|chr12 (641 aa) initn: 3339 init1: 3339 opt: 3348 Z-score: 2407.5 bits: 455.8 E(32554): 8.9e-128 Smith-Waterman score: 3839; 91.4% identity (93.3% similar) in 641 aa overlap (98-705:1-641) 70 80 90 100 110 120 pF1KB7 NYEIAEGVCIPRSALYMHYLDFCEKNDTQPVNAASFG--KIIRQQFPQLTTRRLGTRGQS .: :.:: ... . :. .: .:. . CCDS91 MNWAAFGGSEFFIPEGIQIDSRCPLSRNIT 10 20 30 130 140 150 160 170 180 pF1KB7 K-YHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPRSKLGTLLPEFPNV . :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 EWYHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPRSKLGTLLPEFPNV 40 50 60 70 80 90 190 200 210 pF1KB7 KDLNLPASLPEEKVSTFIMMYRTHCQRIL------------------------------G ::::::::::::::::::::::::::::: : CCDS91 KDLNLPASLPEEKVSTFIMMYRTHCQRILDTVIRANFDEVQSFLLHFWQGMPPHMLPVLG 100 110 120 130 140 150 220 230 240 250 260 270 pF1KB7 SSTVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLDEWLKVALHDLPEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 SSTVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLDEWLKVALHDLPEN 160 170 180 190 200 210 280 290 300 310 320 330 pF1KB7 LRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWRNVDLNSITKQTLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 LRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWRNVDLNSITKQTLY 220 230 240 250 260 270 340 350 360 370 380 390 pF1KB7 TMEDSRDEHRKLITQLYQEFDHLLEEQSPIESYIEWLDTMVDRCVVKVAAKRQGSLKKVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 TMEDSRDEHRKLITQLYQEFDHLLEEQSPIESYIEWLDTMVDRCVVKVAAKRQGSLKKVA 280 290 300 310 320 330 400 410 420 430 440 450 pF1KB7 QQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLYLLESLHCQERANELMRAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 QQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLYLLESLHCQERANELMRAM 340 350 360 370 380 390 460 470 480 490 500 510 pF1KB7 KGEGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATSVEVPPPSSPVSNPSPEYTGLSTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 KGEGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATSVEVPPPSSPVSNPSPEYTGLSTT 400 410 420 430 440 450 520 530 540 550 560 570 pF1KB7 GAMQSYTWSLTYTVTTAAGSPAENSQQLPCMRNTHVPSSSVTHRIPVYPHREEHGYTGSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 GAMQSYTWSLTYTVTTAAGSPAENSQQLPCMRNTHVPSSSVTHRIPVYPHREEHGYTGSY 460 470 480 490 500 510 580 590 600 610 620 630 pF1KB7 NYGSYGNQHPHPMQSQYPALPHDTAISGPLHYAPYHRSSAQYPFNSPTSRMEPCLMSSTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 NYGSYGNQHPHPMQSQYPALPHDTAISGPLHYAPYHRSSAQYPFNSPTSRMEPCLMSSTP 520 530 540 550 560 570 640 650 660 670 680 690 pF1KB7 RLHPTPVTPRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNSEYEHMQHFPGFAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 RLHPTPVTPRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNSEYEHMQHFPGFAY 580 590 600 610 620 630 700 pF1KB7 INGEASTGWAK ::::::::::: CCDS91 INGEASTGWAK 640 >>CCDS9106.1 RFX4 gene_id:5992|Hs108|chr12 (735 aa) initn: 3339 init1: 3339 opt: 3348 Z-score: 2406.6 bits: 455.9 E(32554): 9.9e-128 Smith-Waterman score: 4694; 95.9% identity (95.9% similar) in 735 aa overlap (1-705:1-735) 10 20 30 40 50 60 pF1KB7 MHCGLLEEPDMDSTESWIERCLNESENKRYSSHTSLGNVSNDENEEKENNRASKPHSTPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 MHCGLLEEPDMDSTESWIERCLNESENKRYSSHTSLGNVSNDENEEKENNRASKPHSTPA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 TLQWLEENYEIAEGVCIPRSALYMHYLDFCEKNDTQPVNAASFGKIIRQQFPQLTTRRLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 TLQWLEENYEIAEGVCIPRSALYMHYLDFCEKNDTQPVNAASFGKIIRQQFPQLTTRRLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 TRGQSKYHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPRSKLGTLLPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 TRGQSKYHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPRSKLGTLLPE 130 140 150 160 170 180 190 200 210 pF1KB7 FPNVKDLNLPASLPEEKVSTFIMMYRTHCQRIL--------------------------- ::::::::::::::::::::::::::::::::: CCDS91 FPNVKDLNLPASLPEEKVSTFIMMYRTHCQRILDTVIRANFDEVQSFLLHFWQGMPPHML 190 200 210 220 230 240 220 230 240 250 260 270 pF1KB7 ---GSSTVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLDEWLKVALHD ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 PVLGSSTVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLDEWLKVALHD 250 260 270 280 290 300 280 290 300 310 320 330 pF1KB7 LPENLRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWRNVDLNSITK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 LPENLRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWRNVDLNSITK 310 320 330 340 350 360 340 350 360 370 380 390 pF1KB7 QTLYTMEDSRDEHRKLITQLYQEFDHLLEEQSPIESYIEWLDTMVDRCVVKVAAKRQGSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 QTLYTMEDSRDEHRKLITQLYQEFDHLLEEQSPIESYIEWLDTMVDRCVVKVAAKRQGSL 370 380 390 400 410 420 400 410 420 430 440 450 pF1KB7 KKVAQQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLYLLESLHCQERANEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 KKVAQQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLYLLESLHCQERANEL 430 440 450 460 470 480 460 470 480 490 500 510 pF1KB7 MRAMKGEGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATSVEVPPPSSPVSNPSPEYTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 MRAMKGEGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATSVEVPPPSSPVSNPSPEYTG 490 500 510 520 530 540 520 530 540 550 560 570 pF1KB7 LSTTGAMQSYTWSLTYTVTTAAGSPAENSQQLPCMRNTHVPSSSVTHRIPVYPHREEHGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 LSTTGAMQSYTWSLTYTVTTAAGSPAENSQQLPCMRNTHVPSSSVTHRIPVYPHREEHGY 550 560 570 580 590 600 580 590 600 610 620 630 pF1KB7 TGSYNYGSYGNQHPHPMQSQYPALPHDTAISGPLHYAPYHRSSAQYPFNSPTSRMEPCLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 TGSYNYGSYGNQHPHPMQSQYPALPHDTAISGPLHYAPYHRSSAQYPFNSPTSRMEPCLM 610 620 630 640 650 660 640 650 660 670 680 690 pF1KB7 SSTPRLHPTPVTPRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNSEYEHMQHFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS91 SSTPRLHPTPVTPRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNSEYEHMQHFP 670 680 690 700 710 720 700 pF1KB7 GFAYINGEASTGWAK ::::::::::::::: CCDS91 GFAYINGEASTGWAK 730 >>CCDS55880.1 RFX4 gene_id:5992|Hs108|chr12 (744 aa) initn: 3339 init1: 3339 opt: 3348 Z-score: 2406.6 bits: 455.9 E(32554): 1e-127 Smith-Waterman score: 4601; 95.9% identity (95.9% similar) in 723 aa overlap (13-705:22-744) 10 20 30 40 50 pF1KB7 MHCGLLEEPDMDSTESWIERCLNESENKRYSSHTSLGNVSNDENEEKENNR ::::::::::::::::::::::::::::::::::::::: CCDS55 MIKRRAHPGAGGDRTRPRRRRSTESWIERCLNESENKRYSSHTSLGNVSNDENEEKENNR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ASKPHSTPATLQWLEENYEIAEGVCIPRSALYMHYLDFCEKNDTQPVNAASFGKIIRQQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ASKPHSTPATLQWLEENYEIAEGVCIPRSALYMHYLDFCEKNDTQPVNAASFGKIIRQQF 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 PQLTTRRLGTRGQSKYHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PQLTTRRLGTRGQSKYHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPR 130 140 150 160 170 180 180 190 200 210 pF1KB7 SKLGTLLPEFPNVKDLNLPASLPEEKVSTFIMMYRTHCQRIL------------------ :::::::::::::::::::::::::::::::::::::::::: CCDS55 SKLGTLLPEFPNVKDLNLPASLPEEKVSTFIMMYRTHCQRILDTVIRANFDEVQSFLLHF 190 200 210 220 230 240 220 230 240 250 260 pF1KB7 ------------GSSTVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLD :::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 WQGMPPHMLPVLGSSTVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLD 250 260 270 280 290 300 270 280 290 300 310 320 pF1KB7 EWLKVALHDLPENLRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EWLKVALHDLPENLRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWR 310 320 330 340 350 360 330 340 350 360 370 380 pF1KB7 NVDLNSITKQTLYTMEDSRDEHRKLITQLYQEFDHLLEEQSPIESYIEWLDTMVDRCVVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 NVDLNSITKQTLYTMEDSRDEHRKLITQLYQEFDHLLEEQSPIESYIEWLDTMVDRCVVK 370 380 390 400 410 420 390 400 410 420 430 440 pF1KB7 VAAKRQGSLKKVAQQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLYLLESL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VAAKRQGSLKKVAQQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLYLLESL 430 440 450 460 470 480 450 460 470 480 490 500 pF1KB7 HCQERANELMRAMKGEGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATSVEVPPPSSPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 HCQERANELMRAMKGEGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATSVEVPPPSSPV 490 500 510 520 530 540 510 520 530 540 550 560 pF1KB7 SNPSPEYTGLSTTGAMQSYTWSLTYTVTTAAGSPAENSQQLPCMRNTHVPSSSVTHRIPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SNPSPEYTGLSTTGAMQSYTWSLTYTVTTAAGSPAENSQQLPCMRNTHVPSSSVTHRIPV 550 560 570 580 590 600 570 580 590 600 610 620 pF1KB7 YPHREEHGYTGSYNYGSYGNQHPHPMQSQYPALPHDTAISGPLHYAPYHRSSAQYPFNSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 YPHREEHGYTGSYNYGSYGNQHPHPMQSQYPALPHDTAISGPLHYAPYHRSSAQYPFNSP 610 620 630 640 650 660 630 640 650 660 670 680 pF1KB7 TSRMEPCLMSSTPRLHPTPVTPRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TSRMEPCLMSSTPRLHPTPVTPRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNS 670 680 690 700 710 720 690 700 pF1KB7 EYEHMQHFPGFAYINGEASTGWAK :::::::::::::::::::::::: CCDS55 EYEHMQHFPGFAYINGEASTGWAK 730 740 >>CCDS5113.1 RFX6 gene_id:222546|Hs108|chr6 (928 aa) initn: 983 init1: 605 opt: 623 Z-score: 456.8 bits: 95.4 E(32554): 4e-19 Smith-Waterman score: 1261; 36.8% identity (61.1% similar) in 679 aa overlap (25-631:85-763) 10 20 30 40 50 pF1KB7 MHCGLLEEPDMDSTESWIERCLNESENKRYSSHTSLGNVSNDENEEKENNRA-- ::.. ..: : ..... .:.. CCDS51 GEQGGGEKGEDPELPGAVKSEMHLNNGNFSSEEEDADNHDSKTKAADQYLSQKKTITQIV 60 70 80 90 100 110 60 70 80 90 100 110 pF1KB7 -SKPHSTPATLQWLEENYEIAEGVCIPRSALYMHYLDFCEKNDTQPVNAASFGKIIRQQF .: ..: ::::::::: . ::::.:: :: ::::::.:. .:. ::.::: :::.: CCDS51 KDKKKQTQLTLQWLEENYIVCEGVCLPRCILYAHYLDFCRKEKLEPACAATFGKTIRQKF 120 130 140 150 160 170 120 130 140 150 160 170 pF1KB7 PQLTTRRLGTRGQSKYHYYGIAVKESSQYYDVMYSKKGAAWVSETGKKEVSKQTVAYSPR : ::::::::::.::::::::..:::: :: .:: :: . : . :. . : :: CCDS51 PLLTTRRLGTRGHSKYHYYGIGIKESSAYYHSVYSGKGLTRFSGSKLKNEGGFTRKYSLS 180 190 200 210 220 230 180 190 200 210 pF1KB7 SKLGTLLPEFPNVKDLNLPASLPEEKVSTFIMMYRTHCQRILGSS--------------- :: ::::::::... : . . ..::.:.::::.:::: :: .. CCDS51 SKTGTLLPEFPSAQHLVYQGCISKDKVDTLIMMYKTHCQCILDNAINGNFEEIQHFLLHF 240 250 260 270 280 290 220 230 240 250 260 pF1KB7 ---------------TVVNIVGVCDSILYKAISGVLMPTVLQALPDSLTQVIRKFAKQLD ....: ::::::::... ::.:...: .:.:: ::.:::. . CCDS51 WQGMPDHLLPLLENPVIIDIFCVCDSILYKVLTDVLIPATMQEMPESLLADIRNFAKNWE 300 310 320 330 340 350 270 280 290 300 310 320 pF1KB7 EWLKVALHDLPENLRNIKFELSRRFSQILRRQTSLNHLCQASRTVIHSADITFQMLEDWR .:. .:..::: : . :. . ::: . :.::::. :: : .: .. . .. .:. : . CCDS51 QWVVSSLENLPEALTDKKIPIVRRFVSSLKRQTSFLHLAQIARPALFDQHVVNSMVSDIE 360 370 380 390 400 410 330 340 350 360 370 pF1KB7 NVDLNSITKQTLYTMEDSRDEHRKLITQ-----LYQEFDHLLEEQSPIESYIEWLDTMVD :::::: .:.: :. : : . . :. ..::. ::.... .:..::::::.:. CCDS51 RVDLNSIGSQALLTISGSTDTESGIYTEHDSITVFQELKDLLKKNATVEAFIEWLDTVVE 420 430 440 450 460 470 380 390 400 410 420 430 pF1KB7 RCVVKVAAKRQGSLKKVAQQFLLMWSCFGTRVIRDMTLHSAPSFGSFHLIHLMFDDYVLY . :.:.. . :::: ::.::: :: ::.::....::..: ::::::::....:.:.: CCDS51 QRVIKTSKQNGRSLKKRAQDFLLKWSFFGARVMHNLTLNNASSFGSFHLIRMLLDEYILL 480 490 500 510 520 530 440 450 460 470 480 490 pF1KB7 LLESLHCQERANELM----RAMKG-EGSTAEVREEIILTEAAAPTPSPVPSFSPAKSATS .:. ... .::. . ::. ..: : : . . . : . .:. . CCDS51 AMETQFNNDKEQELQNLLDKYMKNSDASKAAFTASPSSCFLANRNKGSMVSSDAVKNESH 540 550 560 570 580 590 500 510 520 530 pF1KB7 VE---VPPPSS------PVSN--PSPEYTGLSTTGAMQSYTWS---LTYTVTTAAGSPAE :: .: ::: :. . :. . .. :: :. . .: .. : .: . CCDS51 VETTYLPLPSSQPGGLGPALHQFPAGNTDNMPLTGQMELSQIAGHLMTPPISPAMASRGS 600 610 620 630 640 650 540 550 560 570 580 pF1KB7 NSQQLPCM-RNTHV------PSSSVTHRIPVYPH--REEHG-YTGSYNYGSYGNQHPHPM .: : : : :: :. :.:: . .: :. : :: . .:: CCDS51 VINQGPMAGRPPSVGPVLSAPSHCSTYPEPIYPTLPQANHDFYSTSSNYQTVFRAQPHST 660 670 680 690 700 710 590 600 610 620 630 640 pF1KB7 QSQYPA-LPHDTAIS---GPLHYAPYHRSSAQYPFNS-PTSRMEPCLMSSTPRLHPTPVT .. :: : .. : . : : :.:: : : . : :.. CCDS51 SGLYPHHTEHGRCMAWTEQQLSRDFFSGSCAGSPYNSRPPSSYGPSLQAQDSHNMQFLNT 720 730 740 750 760 770 650 660 670 680 690 700 pF1KB7 PRWPEVPSANTCYTSPSVHSARYGNSSDMYTPLTTRRNSEYEHMQHFPGFAYINGEASTG CCDS51 GSFNFLSNTGAASCQGATLPPNSPNGYYGSNINYPESHRLGSMVNQHVSVISSIRSLPPY 780 790 800 810 820 830 705 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 06:34:05 2016 done: Fri Nov 4 06:34:06 2016 Total Scan time: 4.110 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]