FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2752, 1162 aa 1>>>pF1KE2752 1162 - 1162 aa - 1162 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7034+/-0.000947; mu= 7.2608+/- 0.057 mean_var=168.8034+/-34.217, 0's: 0 Z-trim(110.7): 67 B-trim: 4 in 1/51 Lambda= 0.098715 statistics sampled from 11925 (11992) to 11925 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.359), width: 16 Scan time: 2.710 The best scores are: opt bits E(33420) CCDS892.1 TTF2 gene_id:8458|Hs109|chr1 (1162) 7700 1109.5 0 CCDS82856.1 HLTF gene_id:6596|Hs109|chr3 (1008) 430 74.1 1.9e-12 CCDS33875.1 HLTF gene_id:6596|Hs109|chr3 (1009) 430 74.1 1.9e-12 >>CCDS892.1 TTF2 gene_id:8458|Hs109|chr1 (1162 aa) initn: 7700 init1: 7700 opt: 7700 Z-score: 5931.6 bits: 1109.5 E(33420): 0 Smith-Waterman score: 7700; 100.0% identity (100.0% similar) in 1162 aa overlap (1-1162:1-1162) 10 20 30 40 50 60 pF1KE2 MEEVRCPEHGTFCFLKTGVRDGPNKGKSFYVCRADTCSFVRATDIPVSHCLLHEDFVVEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 MEEVRCPEHGTFCFLKTGVRDGPNKGKSFYVCRADTCSFVRATDIPVSHCLLHEDFVVEL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 QGLLLPQDKKEYRLFFRCIRSKAEGKRWCGSIPWQDPDSKEHSVSNKSQHASETFHHSSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 QGLLLPQDKKEYRLFFRCIRSKAEGKRWCGSIPWQDPDSKEHSVSNKSQHASETFHHSSN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 WLRNPFKVLDKNQEPALWKQLIKGEGEEKKADKKQREKGDQLFDQKKEQKPEMMEKDLSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 WLRNPFKVLDKNQEPALWKQLIKGEGEEKKADKKQREKGDQLFDQKKEQKPEMMEKDLSS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GLVPKKKQSVVQEKKQEEGAEIQCEAETGGTHKRDFSEIKSQQCQGNELTRPSASSQEKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 GLVPKKKQSVVQEKKQEEGAEIQCEAETGGTHKRDFSEIKSQQCQGNELTRPSASSQEKS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 SGKSQDVQRESEPLREKVTQLLPQNVHSHNSISKPQKGGPLNKEYTNWEAKETKAKDGPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 SGKSQDVQRESEPLREKVTQLLPQNVHSHNSISKPQKGGPLNKEYTNWEAKETKAKDGPS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 IQATQKSLPQGHFQERPETHSVPAPGGPAAQAAPAAPGLSLGEGREAATSSDDEEEDDVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 IQATQKSLPQGHFQERPETHSVPAPGGPAAQAAPAAPGLSLGEGREAATSSDDEEEDDVV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 FVSSKPGSPLLFDSTLDLETKENLQFPDRSVQRKVSPASGVSKKVEPSDPVARRVYLTTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 FVSSKPGSPLLFDSTLDLETKENLQFPDRSVQRKVSPASGVSKKVEPSDPVARRVYLTTQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 LKQKKSTLASVNIQALPDKGQKLIKQIQELEEVLSGLTLSPEQGTNEKSNSQVPQQSHFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 LKQKKSTLASVNIQALPDKGQKLIKQIQELEEVLSGLTLSPEQGTNEKSNSQVPQQSHFT 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE2 KTTTGPPHLVPPQPLPRRGTQPVGSLELKSACQVTAGGSSQCYRGHTNQDHVHAVWKITS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 KTTTGPPHLVPPQPLPRRGTQPVGSLELKSACQVTAGGSSQCYRGHTNQDHVHAVWKITS 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE2 EAIGQLHRSLESCPGETVVAEDPAGLKVPLLLHQKQALAWLLWRESQKPQGGILADDMGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 EAIGQLHRSLESCPGETVVAEDPAGLKVPLLLHQKQALAWLLWRESQKPQGGILADDMGL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE2 GKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSCDFTSHGTLIICPASLIHHWKNEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 GKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSCDFTSHGTLIICPASLIHHWKNEV 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE2 EKRVNSNKLRVYLYHGPNRDSRARVLSTYDIVITTYSLVAKEIPTNKQEAEIPGANLNVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 EKRVNSNKLRVYLYHGPNRDSRARVLSTYDIVITTYSLVAKEIPTNKQEAEIPGANLNVE 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE2 GTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKLQACARWAVTGTPIQNNLLDMYSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 GTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKLQACARWAVTGTPIQNNLLDMYSLL 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE2 KFLRCSPFDEFNLWRSQVDNGSKKGGERLSILTKSLLLRRTKDQLDSTGRPLVILPQRKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 KFLRCSPFDEFNLWRSQVDNGSKKGGERLSILTKSLLLRRTKDQLDSTGRPLVILPQRKF 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE2 QLHHLKLSEDEETVYNVFFARSRSALQSYLKRHESRGNQSGRSPNNPFSRVALEFGSEEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 QLHHLKLSEDEETVYNVFFARSRSALQSYLKRHESRGNQSGRSPNNPFSRVALEFGSEEP 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE2 RHSEAADSPRSSTVHILSQLLRLRQCCCHLSLLKSALDPMELKGEGLVLSLEEQLSALTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 RHSEAADSPRSSTVHILSQLLRLRQCCCHLSLLKSALDPMELKGEGLVLSLEEQLSALTL 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE2 SELRDSEPSSTVSLNGTFFKMELFEGMRESTKISSLLAELEAIQRNSASQKSVIVSQWTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 SELRDSEPSSTVSLNGTFFKMELFEGMRESTKISSLLAELEAIQRNSASQKSVIVSQWTN 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE2 MLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSRGPQVMLISLLAGGVGLNLTGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 MLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSRGPQVMLISLLAGGVGLNLTGG 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE2 NHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFVCEGTVEEKILQLQEKKKDLAKQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS89 NHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFVCEGTVEEKILQLQEKKKDLAKQV 1090 1100 1110 1120 1130 1140 1150 1160 pF1KE2 LSGSGESVTKLTLADLRVLFGI :::::::::::::::::::::: CCDS89 LSGSGESVTKLTLADLRVLFGI 1150 1160 >>CCDS82856.1 HLTF gene_id:6596|Hs109|chr3 (1008 aa) initn: 1061 init1: 291 opt: 430 Z-score: 337.0 bits: 74.1 E(33420): 1.9e-12 Smith-Waterman score: 852; 32.0% identity (60.3% similar) in 590 aa overlap (608-1137:442-980) 580 590 600 610 620 630 pF1KE2 LAWLLWRESQKPQGGILADDMGLGKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSC :: . . .:: :. . :. .: : CCDS82 MKGKLKNVQSETKGRAKGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTD-V 420 430 440 450 460 470 640 650 660 670 680 690 pF1KE2 DFTSHGTLIICPASLIHHWKNEVEKRVNSN-KLRVYLYHGPNRDSRARVLSTYDIVITTY . . :::::: :.. .: .. ....:. .: :.:.::.: . .:: :::.::: CCDS82 EERPRTTLIICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTY 480 490 500 510 520 530 700 710 720 730 740 750 pF1KE2 SLVAKEIPTNKQEAEIPGANLNVEGTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKL ...... : .: : :: : : :.::::.: ..:: .: . :: : CCDS82 NILTHDYGT--------------KGDS-PLHSIRWLRVILDEGHAIRNPNAQQTKAVLDL 540 550 560 570 760 770 780 790 800 810 pF1KE2 QACARWAVTGTPIQNNLLDMYSLLKFLRCSPFDEFNLWRSQVDN----GSKKGGERLSIL .. ::..:::::::.: :..:::.::. .:: . . :. .. :.. : .::. : CCDS82 ESERRWVLTGTPIQNSLKDLWSLLSFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSL 580 590 600 610 620 630 820 830 840 850 860 870 pF1KE2 TKSLLLRRTKDQLDSTGRPLVILPQRKFQLHHLKLSEDEETVYNVFFARSRSALQSYLKR :.. ::::: . :.:.. ::.:: ..:. ::..:. .: :: . CCDS82 IKNITLRRTKTS-KIKGKPVLELPERKVFIQHITLSDEERKIY-----------QSV--K 640 650 660 670 680 880 890 900 910 920 930 pF1KE2 HESRGNQSGRSPNNPFSRVALEFGSEEPRHSEAADSPRSSTVHILSQLLRLRQCCCHLSL .:.:.. :: :. :. .. :: .:. :::::: ::: : CCDS82 NEGRATI-GRYFNE---------GTVLAHY---AD--------VLGLLLRLRQICCHTYL 690 700 710 720 940 950 960 pF1KE2 LKSALD---------PMELKGE-----GLVLSL--EEQ----LSALTL------------ : .:.. : ::. . :.:: .:. :..::. CCDS82 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK 730 740 750 760 770 780 970 980 990 pF1KE2 ----SELRDSEPSSTVSL-NGTFFKMELFEGMRE----------------STKISSLLAE . ... .: . : . . . .:.: : :.::..:. CCDS82 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA 790 800 810 820 830 840 1000 1010 1020 1030 1040 1050 pF1KE2 LEAIQRNSASQKSVIVSQWTNMLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSR : ..... . ::..:::.:..:... . :: :.... .:::. :.:.. .. :.... CCDS82 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE 850 860 870 880 890 900 1060 1070 1080 1090 1100 1110 pF1KE2 G--PQVMLISLLAGGVGLNLTGGNHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFV . : .::.:: ::::::::......::.: :::. ::: :: .:.::...:.: .:. CCDS82 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI 910 920 930 940 950 960 1120 1130 1140 1150 1160 pF1KE2 CEGTVEEKILQLQEKKKDLAKQVLSGSGESVTKLTLADLRVLFGI . .:::..:..:.::..:: CCDS82 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL 970 980 990 1000 >>CCDS33875.1 HLTF gene_id:6596|Hs109|chr3 (1009 aa) initn: 1061 init1: 291 opt: 430 Z-score: 337.0 bits: 74.1 E(33420): 1.9e-12 Smith-Waterman score: 852; 32.0% identity (60.3% similar) in 590 aa overlap (608-1137:443-981) 580 590 600 610 620 630 pF1KE2 LAWLLWRESQKPQGGILADDMGLGKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSC :: . . .:: :. . :. .: : CCDS33 KGKLKNVQSETKGRAKAGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTD-V 420 430 440 450 460 470 640 650 660 670 680 690 pF1KE2 DFTSHGTLIICPASLIHHWKNEVEKRVNSN-KLRVYLYHGPNRDSRARVLSTYDIVITTY . . :::::: :.. .: .. ....:. .: :.:.::.: . .:: :::.::: CCDS33 EERPRTTLIICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTY 480 490 500 510 520 530 700 710 720 730 740 750 pF1KE2 SLVAKEIPTNKQEAEIPGANLNVEGTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKL ...... : .: : :: : : :.::::.: ..:: .: . :: : CCDS33 NILTHDYGT--------------KGDS-PLHSIRWLRVILDEGHAIRNPNAQQTKAVLDL 540 550 560 570 760 770 780 790 800 810 pF1KE2 QACARWAVTGTPIQNNLLDMYSLLKFLRCSPFDEFNLWRSQVDN----GSKKGGERLSIL .. ::..:::::::.: :..:::.::. .:: . . :. .. :.. : .::. : CCDS33 ESERRWVLTGTPIQNSLKDLWSLLSFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSL 580 590 600 610 620 630 820 830 840 850 860 870 pF1KE2 TKSLLLRRTKDQLDSTGRPLVILPQRKFQLHHLKLSEDEETVYNVFFARSRSALQSYLKR :.. ::::: . :.:.. ::.:: ..:. ::..:. .: :: . CCDS33 IKNITLRRTKTS-KIKGKPVLELPERKVFIQHITLSDEERKIY-----------QSV--K 640 650 660 670 680 880 890 900 910 920 930 pF1KE2 HESRGNQSGRSPNNPFSRVALEFGSEEPRHSEAADSPRSSTVHILSQLLRLRQCCCHLSL .:.:.. :: :. :. .. :: .:. :::::: ::: : CCDS33 NEGRATI-GRYFNE---------GTVLAHY---AD--------VLGLLLRLRQICCHTYL 690 700 710 720 940 950 960 pF1KE2 LKSALD---------PMELKGE-----GLVLSL--EEQ----LSALTL------------ : .:.. : ::. . :.:: .:. :..::. CCDS33 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK 730 740 750 760 770 780 970 980 990 pF1KE2 ----SELRDSEPSSTVSL-NGTFFKMELFEGMRE----------------STKISSLLAE . ... .: . : . . . .:.: : :.::..:. CCDS33 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA 790 800 810 820 830 840 1000 1010 1020 1030 1040 1050 pF1KE2 LEAIQRNSASQKSVIVSQWTNMLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSR : ..... . ::..:::.:..:... . :: :.... .:::. :.:.. .. :.... CCDS33 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE 850 860 870 880 890 900 1060 1070 1080 1090 1100 1110 pF1KE2 G--PQVMLISLLAGGVGLNLTGGNHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFV . : .::.:: ::::::::......::.: :::. ::: :: .:.::...:.: .:. CCDS33 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI 910 920 930 940 950 960 1120 1130 1140 1150 1160 pF1KE2 CEGTVEEKILQLQEKKKDLAKQVLSGSGESVTKLTLADLRVLFGI . .:::..:..:.::..:: CCDS33 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL 970 980 990 1000 1162 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Aug 1 17:02:56 2019 done: Thu Aug 1 17:02:57 2019 Total Scan time: 2.710 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]