FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2752, 1162 aa
1>>>pF1KE2752 1162 - 1162 aa - 1162 aa
Library: human.CCDS.faa
18921897 residues in 33420 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7034+/-0.000947; mu= 7.2608+/- 0.057
mean_var=168.8034+/-34.217, 0's: 0 Z-trim(110.7): 67 B-trim: 4 in 1/51
Lambda= 0.098715
statistics sampled from 11925 (11992) to 11925 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.359), width: 16
Scan time: 2.710
The best scores are: opt bits E(33420)
CCDS892.1 TTF2 gene_id:8458|Hs109|chr1 (1162) 7700 1109.5 0
CCDS82856.1 HLTF gene_id:6596|Hs109|chr3 (1008) 430 74.1 1.9e-12
CCDS33875.1 HLTF gene_id:6596|Hs109|chr3 (1009) 430 74.1 1.9e-12
>>CCDS892.1 TTF2 gene_id:8458|Hs109|chr1 (1162 aa)
initn: 7700 init1: 7700 opt: 7700 Z-score: 5931.6 bits: 1109.5 E(33420): 0
Smith-Waterman score: 7700; 100.0% identity (100.0% similar) in 1162 aa overlap (1-1162:1-1162)
10 20 30 40 50 60
pF1KE2 MEEVRCPEHGTFCFLKTGVRDGPNKGKSFYVCRADTCSFVRATDIPVSHCLLHEDFVVEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 MEEVRCPEHGTFCFLKTGVRDGPNKGKSFYVCRADTCSFVRATDIPVSHCLLHEDFVVEL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 QGLLLPQDKKEYRLFFRCIRSKAEGKRWCGSIPWQDPDSKEHSVSNKSQHASETFHHSSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 QGLLLPQDKKEYRLFFRCIRSKAEGKRWCGSIPWQDPDSKEHSVSNKSQHASETFHHSSN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 WLRNPFKVLDKNQEPALWKQLIKGEGEEKKADKKQREKGDQLFDQKKEQKPEMMEKDLSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 WLRNPFKVLDKNQEPALWKQLIKGEGEEKKADKKQREKGDQLFDQKKEQKPEMMEKDLSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 GLVPKKKQSVVQEKKQEEGAEIQCEAETGGTHKRDFSEIKSQQCQGNELTRPSASSQEKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 GLVPKKKQSVVQEKKQEEGAEIQCEAETGGTHKRDFSEIKSQQCQGNELTRPSASSQEKS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 SGKSQDVQRESEPLREKVTQLLPQNVHSHNSISKPQKGGPLNKEYTNWEAKETKAKDGPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 SGKSQDVQRESEPLREKVTQLLPQNVHSHNSISKPQKGGPLNKEYTNWEAKETKAKDGPS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 IQATQKSLPQGHFQERPETHSVPAPGGPAAQAAPAAPGLSLGEGREAATSSDDEEEDDVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 IQATQKSLPQGHFQERPETHSVPAPGGPAAQAAPAAPGLSLGEGREAATSSDDEEEDDVV
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 FVSSKPGSPLLFDSTLDLETKENLQFPDRSVQRKVSPASGVSKKVEPSDPVARRVYLTTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 FVSSKPGSPLLFDSTLDLETKENLQFPDRSVQRKVSPASGVSKKVEPSDPVARRVYLTTQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 LKQKKSTLASVNIQALPDKGQKLIKQIQELEEVLSGLTLSPEQGTNEKSNSQVPQQSHFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 LKQKKSTLASVNIQALPDKGQKLIKQIQELEEVLSGLTLSPEQGTNEKSNSQVPQQSHFT
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE2 KTTTGPPHLVPPQPLPRRGTQPVGSLELKSACQVTAGGSSQCYRGHTNQDHVHAVWKITS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 KTTTGPPHLVPPQPLPRRGTQPVGSLELKSACQVTAGGSSQCYRGHTNQDHVHAVWKITS
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE2 EAIGQLHRSLESCPGETVVAEDPAGLKVPLLLHQKQALAWLLWRESQKPQGGILADDMGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 EAIGQLHRSLESCPGETVVAEDPAGLKVPLLLHQKQALAWLLWRESQKPQGGILADDMGL
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE2 GKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSCDFTSHGTLIICPASLIHHWKNEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 GKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSCDFTSHGTLIICPASLIHHWKNEV
610 620 630 640 650 660
670 680 690 700 710 720
pF1KE2 EKRVNSNKLRVYLYHGPNRDSRARVLSTYDIVITTYSLVAKEIPTNKQEAEIPGANLNVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 EKRVNSNKLRVYLYHGPNRDSRARVLSTYDIVITTYSLVAKEIPTNKQEAEIPGANLNVE
670 680 690 700 710 720
730 740 750 760 770 780
pF1KE2 GTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKLQACARWAVTGTPIQNNLLDMYSLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 GTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKLQACARWAVTGTPIQNNLLDMYSLL
730 740 750 760 770 780
790 800 810 820 830 840
pF1KE2 KFLRCSPFDEFNLWRSQVDNGSKKGGERLSILTKSLLLRRTKDQLDSTGRPLVILPQRKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 KFLRCSPFDEFNLWRSQVDNGSKKGGERLSILTKSLLLRRTKDQLDSTGRPLVILPQRKF
790 800 810 820 830 840
850 860 870 880 890 900
pF1KE2 QLHHLKLSEDEETVYNVFFARSRSALQSYLKRHESRGNQSGRSPNNPFSRVALEFGSEEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 QLHHLKLSEDEETVYNVFFARSRSALQSYLKRHESRGNQSGRSPNNPFSRVALEFGSEEP
850 860 870 880 890 900
910 920 930 940 950 960
pF1KE2 RHSEAADSPRSSTVHILSQLLRLRQCCCHLSLLKSALDPMELKGEGLVLSLEEQLSALTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 RHSEAADSPRSSTVHILSQLLRLRQCCCHLSLLKSALDPMELKGEGLVLSLEEQLSALTL
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KE2 SELRDSEPSSTVSLNGTFFKMELFEGMRESTKISSLLAELEAIQRNSASQKSVIVSQWTN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 SELRDSEPSSTVSLNGTFFKMELFEGMRESTKISSLLAELEAIQRNSASQKSVIVSQWTN
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KE2 MLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSRGPQVMLISLLAGGVGLNLTGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 MLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSRGPQVMLISLLAGGVGLNLTGG
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KE2 NHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFVCEGTVEEKILQLQEKKKDLAKQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS89 NHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFVCEGTVEEKILQLQEKKKDLAKQV
1090 1100 1110 1120 1130 1140
1150 1160
pF1KE2 LSGSGESVTKLTLADLRVLFGI
::::::::::::::::::::::
CCDS89 LSGSGESVTKLTLADLRVLFGI
1150 1160
>>CCDS82856.1 HLTF gene_id:6596|Hs109|chr3 (1008 aa)
initn: 1061 init1: 291 opt: 430 Z-score: 337.0 bits: 74.1 E(33420): 1.9e-12
Smith-Waterman score: 852; 32.0% identity (60.3% similar) in 590 aa overlap (608-1137:442-980)
580 590 600 610 620 630
pF1KE2 LAWLLWRESQKPQGGILADDMGLGKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSC
:: . . .:: :. . :. .: :
CCDS82 MKGKLKNVQSETKGRAKGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTD-V
420 430 440 450 460 470
640 650 660 670 680 690
pF1KE2 DFTSHGTLIICPASLIHHWKNEVEKRVNSN-KLRVYLYHGPNRDSRARVLSTYDIVITTY
. . :::::: :.. .: .. ....:. .: :.:.::.: . .:: :::.:::
CCDS82 EERPRTTLIICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTY
480 490 500 510 520 530
700 710 720 730 740 750
pF1KE2 SLVAKEIPTNKQEAEIPGANLNVEGTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKL
...... : .: : :: : : :.::::.: ..:: .: . :: :
CCDS82 NILTHDYGT--------------KGDS-PLHSIRWLRVILDEGHAIRNPNAQQTKAVLDL
540 550 560 570
760 770 780 790 800 810
pF1KE2 QACARWAVTGTPIQNNLLDMYSLLKFLRCSPFDEFNLWRSQVDN----GSKKGGERLSIL
.. ::..:::::::.: :..:::.::. .:: . . :. .. :.. : .::. :
CCDS82 ESERRWVLTGTPIQNSLKDLWSLLSFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSL
580 590 600 610 620 630
820 830 840 850 860 870
pF1KE2 TKSLLLRRTKDQLDSTGRPLVILPQRKFQLHHLKLSEDEETVYNVFFARSRSALQSYLKR
:.. ::::: . :.:.. ::.:: ..:. ::..:. .: :: .
CCDS82 IKNITLRRTKTS-KIKGKPVLELPERKVFIQHITLSDEERKIY-----------QSV--K
640 650 660 670 680
880 890 900 910 920 930
pF1KE2 HESRGNQSGRSPNNPFSRVALEFGSEEPRHSEAADSPRSSTVHILSQLLRLRQCCCHLSL
.:.:.. :: :. :. .. :: .:. :::::: ::: :
CCDS82 NEGRATI-GRYFNE---------GTVLAHY---AD--------VLGLLLRLRQICCHTYL
690 700 710 720
940 950 960
pF1KE2 LKSALD---------PMELKGE-----GLVLSL--EEQ----LSALTL------------
: .:.. : ::. . :.:: .:. :..::.
CCDS82 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK
730 740 750 760 770 780
970 980 990
pF1KE2 ----SELRDSEPSSTVSL-NGTFFKMELFEGMRE----------------STKISSLLAE
. ... .: . : . . . .:.: : :.::..:.
CCDS82 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA
790 800 810 820 830 840
1000 1010 1020 1030 1040 1050
pF1KE2 LEAIQRNSASQKSVIVSQWTNMLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSR
: ..... . ::..:::.:..:... . :: :.... .:::. :.:.. .. :....
CCDS82 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE
850 860 870 880 890 900
1060 1070 1080 1090 1100 1110
pF1KE2 G--PQVMLISLLAGGVGLNLTGGNHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFV
. : .::.:: ::::::::......::.: :::. ::: :: .:.::...:.: .:.
CCDS82 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI
910 920 930 940 950 960
1120 1130 1140 1150 1160
pF1KE2 CEGTVEEKILQLQEKKKDLAKQVLSGSGESVTKLTLADLRVLFGI
. .:::..:..:.::..::
CCDS82 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL
970 980 990 1000
>>CCDS33875.1 HLTF gene_id:6596|Hs109|chr3 (1009 aa)
initn: 1061 init1: 291 opt: 430 Z-score: 337.0 bits: 74.1 E(33420): 1.9e-12
Smith-Waterman score: 852; 32.0% identity (60.3% similar) in 590 aa overlap (608-1137:443-981)
580 590 600 610 620 630
pF1KE2 LAWLLWRESQKPQGGILADDMGLGKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSC
:: . . .:: :. . :. .: :
CCDS33 KGKLKNVQSETKGRAKAGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTD-V
420 430 440 450 460 470
640 650 660 670 680 690
pF1KE2 DFTSHGTLIICPASLIHHWKNEVEKRVNSN-KLRVYLYHGPNRDSRARVLSTYDIVITTY
. . :::::: :.. .: .. ....:. .: :.:.::.: . .:: :::.:::
CCDS33 EERPRTTLIICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTY
480 490 500 510 520 530
700 710 720 730 740 750
pF1KE2 SLVAKEIPTNKQEAEIPGANLNVEGTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKL
...... : .: : :: : : :.::::.: ..:: .: . :: :
CCDS33 NILTHDYGT--------------KGDS-PLHSIRWLRVILDEGHAIRNPNAQQTKAVLDL
540 550 560 570
760 770 780 790 800 810
pF1KE2 QACARWAVTGTPIQNNLLDMYSLLKFLRCSPFDEFNLWRSQVDN----GSKKGGERLSIL
.. ::..:::::::.: :..:::.::. .:: . . :. .. :.. : .::. :
CCDS33 ESERRWVLTGTPIQNSLKDLWSLLSFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSL
580 590 600 610 620 630
820 830 840 850 860 870
pF1KE2 TKSLLLRRTKDQLDSTGRPLVILPQRKFQLHHLKLSEDEETVYNVFFARSRSALQSYLKR
:.. ::::: . :.:.. ::.:: ..:. ::..:. .: :: .
CCDS33 IKNITLRRTKTS-KIKGKPVLELPERKVFIQHITLSDEERKIY-----------QSV--K
640 650 660 670 680
880 890 900 910 920 930
pF1KE2 HESRGNQSGRSPNNPFSRVALEFGSEEPRHSEAADSPRSSTVHILSQLLRLRQCCCHLSL
.:.:.. :: :. :. .. :: .:. :::::: ::: :
CCDS33 NEGRATI-GRYFNE---------GTVLAHY---AD--------VLGLLLRLRQICCHTYL
690 700 710 720
940 950 960
pF1KE2 LKSALD---------PMELKGE-----GLVLSL--EEQ----LSALTL------------
: .:.. : ::. . :.:: .:. :..::.
CCDS33 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK
730 740 750 760 770 780
970 980 990
pF1KE2 ----SELRDSEPSSTVSL-NGTFFKMELFEGMRE----------------STKISSLLAE
. ... .: . : . . . .:.: : :.::..:.
CCDS33 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA
790 800 810 820 830 840
1000 1010 1020 1030 1040 1050
pF1KE2 LEAIQRNSASQKSVIVSQWTNMLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSR
: ..... . ::..:::.:..:... . :: :.... .:::. :.:.. .. :....
CCDS33 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE
850 860 870 880 890 900
1060 1070 1080 1090 1100 1110
pF1KE2 G--PQVMLISLLAGGVGLNLTGGNHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFV
. : .::.:: ::::::::......::.: :::. ::: :: .:.::...:.: .:.
CCDS33 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI
910 920 930 940 950 960
1120 1130 1140 1150 1160
pF1KE2 CEGTVEEKILQLQEKKKDLAKQVLSGSGESVTKLTLADLRVLFGI
. .:::..:..:.::..::
CCDS33 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL
970 980 990 1000
1162 residues in 1 query sequences
18921897 residues in 33420 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Aug 1 17:02:56 2019 done: Thu Aug 1 17:02:57 2019
Total Scan time: 2.710 Total Display time: 0.080
Function used was FASTA [36.3.4 Apr, 2011]