FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4899, 469 aa
1>>>pF1KB4899 469 - 469 aa - 469 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0131+/-0.00038; mu= 14.7175+/- 0.024
mean_var=92.1771+/-19.942, 0's: 0 Z-trim(114.5): 156 B-trim: 2443 in 2/51
Lambda= 0.133587
statistics sampled from 24228 (24438) to 24228 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.666), E-opt: 0.2 (0.287), width: 16
Scan time: 8.010
The best scores are: opt bits E(85289)
NP_005230 (OMIM: 164740) protein C-ets-2 isoform 1 ( 469) 3220 631.0 2.1e-180
XP_016883779 (OMIM: 164740) PREDICTED: protein C-e ( 469) 3220 631.0 2.1e-180
XP_005260992 (OMIM: 164740) PREDICTED: protein C-e ( 469) 3220 631.0 2.1e-180
NP_001243224 (OMIM: 164740) protein C-ets-2 isofor ( 609) 3220 631.1 2.6e-180
XP_016872804 (OMIM: 164720) PREDICTED: protein C-e ( 450) 1513 302.0 2.2e-81
NP_001137292 (OMIM: 164720) protein C-ets-1 isofor ( 485) 1513 302.0 2.3e-81
XP_016872803 (OMIM: 164720) PREDICTED: protein C-e ( 485) 1513 302.0 2.3e-81
NP_005229 (OMIM: 164720) protein C-ets-1 isoform 2 ( 441) 1467 293.1 1e-78
XP_011540952 (OMIM: 164720) PREDICTED: protein C-e ( 475) 1465 292.8 1.4e-78
XP_016872805 (OMIM: 164720) PREDICTED: protein C-e ( 418) 1464 292.5 1.5e-78
XP_011540951 (OMIM: 164720) PREDICTED: protein C-e ( 519) 1465 292.8 1.5e-78
NP_001155894 (OMIM: 164720) protein C-ets-1 isofor ( 225) 986 200.2 4.8e-51
NP_001317380 (OMIM: 164720) protein C-ets-1 isofor ( 354) 745 153.9 6.6e-37
XP_016872806 (OMIM: 164720) PREDICTED: protein C-e ( 398) 745 154.0 7.2e-37
XP_011540953 (OMIM: 164720) PREDICTED: protein C-e ( 432) 745 154.0 7.7e-37
NP_001257941 (OMIM: 193067) Friend leukemia integr ( 259) 461 99.1 1.5e-20
NP_001257939 (OMIM: 193067) Friend leukemia integr ( 386) 461 99.2 2.1e-20
XP_016872895 (OMIM: 193067) PREDICTED: Friend leuk ( 419) 461 99.2 2.3e-20
XP_016872894 (OMIM: 193067) PREDICTED: Friend leuk ( 419) 461 99.2 2.3e-20
NP_001161153 (OMIM: 193067) Friend leukemia integr ( 419) 461 99.2 2.3e-20
XP_011541003 (OMIM: 193067) PREDICTED: Friend leuk ( 419) 461 99.2 2.3e-20
NP_002008 (OMIM: 193067) Friend leukemia integrati ( 452) 461 99.3 2.4e-20
XP_011527823 (OMIM: 600609) PREDICTED: GA-binding ( 454) 457 98.5 4.1e-20
XP_011527822 (OMIM: 600609) PREDICTED: GA-binding ( 454) 457 98.5 4.1e-20
XP_016883802 (OMIM: 600609) PREDICTED: GA-binding ( 454) 457 98.5 4.1e-20
NP_002031 (OMIM: 600609) GA-binding protein alpha ( 454) 457 98.5 4.1e-20
NP_001184226 (OMIM: 600609) GA-binding protein alp ( 454) 457 98.5 4.1e-20
XP_005260995 (OMIM: 600609) PREDICTED: GA-binding ( 454) 457 98.5 4.1e-20
NP_891548 (OMIM: 165080) transcriptional regulator ( 479) 453 97.7 7.3e-20
NP_001129626 (OMIM: 165080) transcriptional regula ( 486) 453 97.7 7.4e-20
NP_001230357 (OMIM: 165080) transcriptional regula ( 486) 453 97.7 7.4e-20
NP_001230358 (OMIM: 165080) transcriptional regula ( 363) 451 97.3 7.6e-20
NP_001317954 (OMIM: 165080) transcriptional regula ( 455) 451 97.3 9.1e-20
NP_004440 (OMIM: 165080) transcriptional regulator ( 462) 451 97.3 9.3e-20
NP_059991 (OMIM: 607150) protein FEV [Homo sapiens ( 238) 437 94.4 3.5e-19
NP_001129627 (OMIM: 165080) transcriptional regula ( 387) 435 94.2 6.8e-19
XP_016883778 (OMIM: 165080) PREDICTED: transcripti ( 428) 435 94.2 7.4e-19
XP_011527788 (OMIM: 165080) PREDICTED: transcripti ( 435) 435 94.2 7.5e-19
XP_016883776 (OMIM: 165080) PREDICTED: transcripti ( 459) 435 94.2 7.8e-19
XP_011541004 (OMIM: 193067) PREDICTED: Friend leuk ( 410) 431 93.4 1.2e-18
NP_001156619 (OMIM: 600541) ETS translocation vari ( 454) 429 93.1 1.7e-18
NP_001156624 (OMIM: 600541) ETS translocation vari ( 374) 423 91.9 3.3e-18
NP_001156623 (OMIM: 600541) ETS translocation vari ( 419) 423 91.9 3.6e-18
NP_001156622 (OMIM: 600541) ETS translocation vari ( 437) 423 91.9 3.7e-18
NP_001156620 (OMIM: 600541) ETS translocation vari ( 459) 423 91.9 3.9e-18
NP_001156621 (OMIM: 600541) ETS translocation vari ( 459) 423 91.9 3.9e-18
XP_011513472 (OMIM: 600541) PREDICTED: ETS translo ( 472) 423 91.9 4e-18
XP_005249692 (OMIM: 600541) PREDICTED: ETS translo ( 477) 423 91.9 4e-18
NP_004947 (OMIM: 600541) ETS translocation variant ( 477) 423 91.9 4e-18
XP_005249693 (OMIM: 600541) PREDICTED: ETS translo ( 477) 423 91.9 4e-18
>>NP_005230 (OMIM: 164740) protein C-ets-2 isoform 1 [Ho (469 aa)
initn: 3220 init1: 3220 opt: 3220 Z-score: 3359.7 bits: 631.0 E(85289): 2.1e-180
Smith-Waterman score: 3220; 100.0% identity (100.0% similar) in 469 aa overlap (1-469:1-469)
10 20 30 40 50 60
pF1KB4 MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSIS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 HDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 HDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 EFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 EFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 NSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 NSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 KSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 KSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
370 380 390 400 410 420
430 440 450 460
pF1KB4 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
430 440 450 460
>>XP_016883779 (OMIM: 164740) PREDICTED: protein C-ets-2 (469 aa)
initn: 3220 init1: 3220 opt: 3220 Z-score: 3359.7 bits: 631.0 E(85289): 2.1e-180
Smith-Waterman score: 3220; 100.0% identity (100.0% similar) in 469 aa overlap (1-469:1-469)
10 20 30 40 50 60
pF1KB4 MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSIS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 HDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 HDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 EFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 EFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 NSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 NSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 KSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 KSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
370 380 390 400 410 420
430 440 450 460
pF1KB4 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
430 440 450 460
>>XP_005260992 (OMIM: 164740) PREDICTED: protein C-ets-2 (469 aa)
initn: 3220 init1: 3220 opt: 3220 Z-score: 3359.7 bits: 631.0 E(85289): 2.1e-180
Smith-Waterman score: 3220; 100.0% identity (100.0% similar) in 469 aa overlap (1-469:1-469)
10 20 30 40 50 60
pF1KB4 MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSIS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 HDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 HDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 EFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 EFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 NSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 NSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 KSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 KSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 SLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
370 380 390 400 410 420
430 440 450 460
pF1KB4 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
430 440 450 460
>>NP_001243224 (OMIM: 164740) protein C-ets-2 isoform 2 (609 aa)
initn: 3220 init1: 3220 opt: 3220 Z-score: 3358.1 bits: 631.1 E(85289): 2.6e-180
Smith-Waterman score: 3220; 100.0% identity (100.0% similar) in 469 aa overlap (1-469:141-609)
10 20 30
pF1KB4 MNDFGIKNMDQVAPVANSYRGTLKRQPAFD
::::::::::::::::::::::::::::::
NP_001 PSPSPGAHRAAAGAEQPPSRPSAGPARSGRMNDFGIKNMDQVAPVANSYRGTLKRQPAFD
120 130 140 150 160 170
40 50 60 70 80 90
pF1KB4 TFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSGF
180 190 200 210 220 230
100 110 120 130 140 150
pF1KB4 KKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLEL
240 250 260 270 280 290
160 170 180 190 200 210
pF1KB4 APDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPHWINSNTLGFGTEQAPYGMQTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 APDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPHWINSNTLGFGTEQAPYGMQTQ
300 310 320 330 340 350
220 230 240 250 260 270
pF1KB4 NYPKGGLLDSMCPASTPSVLSSEQEFQMFPKSRLSSVSVTYCSVSQDFPGSNLNLLTNNS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 NYPKGGLLDSMCPASTPSVLSSEQEFQMFPKSRLSSVSVTYCSVSQDFPGSNLNLLTNNS
360 370 380 390 400 410
280 290 300 310 320 330
pF1KB4 GTPKDHDSPENGADSFESSDSLLQSWNSQSSLLDVQRVPSFESFEDDCSQSLCLNKPTMS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GTPKDHDSPENGADSFESSDSLLQSWNSQSSLLDVQRVPSFESFEDDCSQSLCLNKPTMS
420 430 440 450 460 470
340 350 360 370 380 390
pF1KB4 FKDYIQERSDPVEQGKPVIPAAVLAGFTGSGPIQLWQFLLELLSDKSCQSFISWTGDGWE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FKDYIQERSDPVEQGKPVIPAAVLAGFTGSGPIQLWQFLLELLSDKSCQSFISWTGDGWE
480 490 500 510 520 530
400 410 420 430 440 450
pF1KB4 FKLADPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTSGKRYVYRFVCDLQNLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FKLADPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTSGKRYVYRFVCDLQNLL
540 550 560 570 580 590
460
pF1KB4 GFTPEELHAILGVQPDTED
:::::::::::::::::::
NP_001 GFTPEELHAILGVQPDTED
600
>>XP_016872804 (OMIM: 164720) PREDICTED: protein C-ets-1 (450 aa)
initn: 1464 init1: 877 opt: 1513 Z-score: 1582.0 bits: 302.0 E(85289): 2.2e-81
Smith-Waterman score: 1519; 54.7% identity (76.5% similar) in 439 aa overlap (45-469:20-450)
20 30 40 50 60 70
pF1KB4 VANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPC
.:.. :::::::. : ..:::::
XP_016 MNCGFQSNYHQQRPCYPFWDEMATQEVPTGLEHCVSDMECADVPLLTPS
10 20 30 40
80 90 100 110 120 130
pF1KB4 SKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGM
:: .:::::::::::: :::.::::::.: :.: .: .:..::.::::: .:..:.: :
XP_016 SKEMMSQALKATFSGFTKEQQRLGIPKDPRQWTETHVRDWVMWAVNEFSLKGVDFQKFCM
50 60 70 80 90 100
140 150 160 170 180 190
pF1KB4 NGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPH--WIN
:: :: :::. :::::::::::::::::: . ::. . :. :. . :. . .
XP_016 NGAALCALGKDCFLELAPDFVGDILWEHLEILQKEDVK----PYQVNGVNPAYPESRYTS
110 120 130 140 150 160
200 210 220 230 240
pF1KB4 SNTLGFGTEQA----PYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFPKSRL----
. ...: :.: : .. .. . ... : :. .:: . : . .:. :
XP_016 DYFISYGIEHAQCVPPSEFSEPSFITESY-QTLHPISSEELLSLKYE-NDYPSVILRDPL
170 180 190 200 210 220
250 260 270 280 290 300
pF1KB4 --SSVSVTYCSVSQDF-PGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQSS
.... : ...:. .:. . .. : .:: :. .:..: : : :::.::::
XP_016 QTDTLQNDYFAIKQEVVTPDNMCMGRTSRGKLGGQDSFES-IESYDSCDRLTQSWSSQSS
230 240 250 260 270 280
310 320 330 340 350 360
pF1KB4 LLDVQRVPSFESFE-DDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
. ..:::::..::. .: .: .:: .::::...:.: ... :::::::.:::.:::
XP_016 FNSLQRVPSYDSFDSEDYPAALPNHKPKGTFKDYVRDRAD-LNKDKPVIPAAALAGYTGS
290 300 310 320 330 340
370 380 390 400 410 420
pF1KB4 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
:::::::::::::.:::::::::::::::::::.::::::::::::::::::::::::::
XP_016 GPIQLWQFLLELLTDKSCQSFISWTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRG
350 360 370 380 390 400
430 440 450 460
pF1KB4 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::::::::::::.:::::::::::::.:::.:::::::.: :.::...
XP_016 LRYYYDKNIIHKTAGKRYVYRFVCDLQSLLGYTPEELHAMLDVKPDADE
410 420 430 440 450
>>NP_001137292 (OMIM: 164720) protein C-ets-1 isoform 1 (485 aa)
initn: 1464 init1: 877 opt: 1513 Z-score: 1581.6 bits: 302.0 E(85289): 2.3e-81
Smith-Waterman score: 1519; 54.7% identity (76.5% similar) in 439 aa overlap (45-469:55-485)
20 30 40 50 60 70
pF1KB4 VANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPC
.:.. :::::::. : ..:::::
NP_001 QGPSNTYEDPRMNCGFQSNYHQQRPCYPFWDEMATQEVPTGLEHCVSDMECADVPLLTPS
30 40 50 60 70 80
80 90 100 110 120 130
pF1KB4 SKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGM
:: .:::::::::::: :::.::::::.: :.: .: .:..::.::::: .:..:.: :
NP_001 SKEMMSQALKATFSGFTKEQQRLGIPKDPRQWTETHVRDWVMWAVNEFSLKGVDFQKFCM
90 100 110 120 130 140
140 150 160 170 180 190
pF1KB4 NGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPH--WIN
:: :: :::. :::::::::::::::::: . ::. . :. :. . :. . .
NP_001 NGAALCALGKDCFLELAPDFVGDILWEHLEILQKEDVK----PYQVNGVNPAYPESRYTS
150 160 170 180 190 200
200 210 220 230 240
pF1KB4 SNTLGFGTEQA----PYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFPKSRL----
. ...: :.: : .. .. . ... : :. .:: . : . .:. :
NP_001 DYFISYGIEHAQCVPPSEFSEPSFITESY-QTLHPISSEELLSLKYE-NDYPSVILRDPL
210 220 230 240 250
250 260 270 280 290 300
pF1KB4 --SSVSVTYCSVSQDF-PGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQSS
.... : ...:. .:. . .. : .:: :. .:..: : : :::.::::
NP_001 QTDTLQNDYFAIKQEVVTPDNMCMGRTSRGKLGGQDSFES-IESYDSCDRLTQSWSSQSS
260 270 280 290 300 310
310 320 330 340 350 360
pF1KB4 LLDVQRVPSFESFE-DDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
. ..:::::..::. .: .: .:: .::::...:.: ... :::::::.:::.:::
NP_001 FNSLQRVPSYDSFDSEDYPAALPNHKPKGTFKDYVRDRAD-LNKDKPVIPAAALAGYTGS
320 330 340 350 360 370
370 380 390 400 410 420
pF1KB4 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
:::::::::::::.:::::::::::::::::::.::::::::::::::::::::::::::
NP_001 GPIQLWQFLLELLTDKSCQSFISWTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRG
380 390 400 410 420 430
430 440 450 460
pF1KB4 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::::::::::::.:::::::::::::.:::.:::::::.: :.::...
NP_001 LRYYYDKNIIHKTAGKRYVYRFVCDLQSLLGYTPEELHAMLDVKPDADE
440 450 460 470 480
>>XP_016872803 (OMIM: 164720) PREDICTED: protein C-ets-1 (485 aa)
initn: 1464 init1: 877 opt: 1513 Z-score: 1581.6 bits: 302.0 E(85289): 2.3e-81
Smith-Waterman score: 1519; 54.7% identity (76.5% similar) in 439 aa overlap (45-469:55-485)
20 30 40 50 60 70
pF1KB4 VANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPC
.:.. :::::::. : ..:::::
XP_016 QGPSNTYEDPRMNCGFQSNYHQQRPCYPFWDEMATQEVPTGLEHCVSDMECADVPLLTPS
30 40 50 60 70 80
80 90 100 110 120 130
pF1KB4 SKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGM
:: .:::::::::::: :::.::::::.: :.: .: .:..::.::::: .:..:.: :
XP_016 SKEMMSQALKATFSGFTKEQQRLGIPKDPRQWTETHVRDWVMWAVNEFSLKGVDFQKFCM
90 100 110 120 130 140
140 150 160 170 180 190
pF1KB4 NGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPH--WIN
:: :: :::. :::::::::::::::::: . ::. . :. :. . :. . .
XP_016 NGAALCALGKDCFLELAPDFVGDILWEHLEILQKEDVK----PYQVNGVNPAYPESRYTS
150 160 170 180 190 200
200 210 220 230 240
pF1KB4 SNTLGFGTEQA----PYGMQTQNYPKGGLLDSMCPASTPSVLSSEQEFQMFPKSRL----
. ...: :.: : .. .. . ... : :. .:: . : . .:. :
XP_016 DYFISYGIEHAQCVPPSEFSEPSFITESY-QTLHPISSEELLSLKYE-NDYPSVILRDPL
210 220 230 240 250
250 260 270 280 290 300
pF1KB4 --SSVSVTYCSVSQDF-PGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQSS
.... : ...:. .:. . .. : .:: :. .:..: : : :::.::::
XP_016 QTDTLQNDYFAIKQEVVTPDNMCMGRTSRGKLGGQDSFES-IESYDSCDRLTQSWSSQSS
260 270 280 290 300 310
310 320 330 340 350 360
pF1KB4 LLDVQRVPSFESFE-DDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGS
. ..:::::..::. .: .: .:: .::::...:.: ... :::::::.:::.:::
XP_016 FNSLQRVPSYDSFDSEDYPAALPNHKPKGTFKDYVRDRAD-LNKDKPVIPAAALAGYTGS
320 330 340 350 360 370
370 380 390 400 410 420
pF1KB4 GPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRG
:::::::::::::.:::::::::::::::::::.::::::::::::::::::::::::::
XP_016 GPIQLWQFLLELLTDKSCQSFISWTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRG
380 390 400 410 420 430
430 440 450 460
pF1KB4 LRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::::::::::::.:::::::::::::.:::.:::::::.: :.::...
XP_016 LRYYYDKNIIHKTAGKRYVYRFVCDLQSLLGYTPEELHAMLDVKPDADE
440 450 460 470 480
>>NP_005229 (OMIM: 164720) protein C-ets-1 isoform 2 [Ho (441 aa)
initn: 1413 init1: 877 opt: 1467 Z-score: 1534.3 bits: 293.1 E(85289): 1e-78
Smith-Waterman score: 1473; 54.7% identity (77.1% similar) in 424 aa overlap (60-469:26-441)
30 40 50 60 70 80
pF1KB4 DTFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSG
: : ..::::: :: .:::::::::::
NP_005 MKAAVDLKPTLTIIKTEKVDLELFPSPDMECADVPLLTPSSKEMMSQALKATFSG
10 20 30 40 50
90 100 110 120 130 140
pF1KB4 FKKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLE
: :::.::::::.: :.: .: .:..::.::::: .:..:.: ::: :: :::. :::
NP_005 FTKEQQRLGIPKDPRQWTETHVRDWVMWAVNEFSLKGVDFQKFCMNGAALCALGKDCFLE
60 70 80 90 100 110
150 160 170 180 190 200
pF1KB4 LAPDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPH--WINSNTLGFGTEQAPYGM
::::::::::::::: . ::. . :. :. . :. . .. ...: :.: .
NP_005 LAPDFVGDILWEHLEILQKEDVKP----YQVNGVNPAYPESRYTSDYFISYGIEHAQC-V
120 130 140 150 160 170
210 220 230 240 250
pF1KB4 QTQNYPKGGLL----DSMCPASTPSVLSSEQEFQMFPKSRL------SSVSVTYCSVSQD
... . ... ... : :. .:: . : . .:. : .... : ...:.
NP_005 PPSEFSEPSFITESYQTLHPISSEELLSLKYE-NDYPSVILRDPLQTDTLQNDYFAIKQE
180 190 200 210 220
260 270 280 290 300 310
pF1KB4 F-PGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQSSLLDVQRVPSFESFE-
.:. . .. : .:: :. .:..: : : :::.::::. ..:::::..::.
NP_005 VVTPDNMCMGRTSRGKLGGQDSFES-IESYDSCDRLTQSWSSQSSFNSLQRVPSYDSFDS
230 240 250 260 270 280
320 330 340 350 360 370
pF1KB4 DDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGSGPIQLWQFLLELLSD
.: .: .:: .::::...:.: ... :::::::.:::.::::::::::::::::.:
NP_005 EDYPAALPNHKPKGTFKDYVRDRAD-LNKDKPVIPAAALAGYTGSGPIQLWQFLLELLTD
290 300 310 320 330 340
380 390 400 410 420 430
pF1KB4 KSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTSG
::::::::::::::::::.:::::::::::::::::::::::::::::::::::::::.:
NP_005 KSCQSFISWTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTAG
350 360 370 380 390 400
440 450 460
pF1KB4 KRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
::::::::::::.:::.:::::::.: :.::...
NP_005 KRYVYRFVCDLQSLLGYTPEELHAMLDVKPDADE
410 420 430 440
>>XP_011540952 (OMIM: 164720) PREDICTED: protein C-ets-1 (475 aa)
initn: 1413 init1: 877 opt: 1465 Z-score: 1531.7 bits: 292.8 E(85289): 1.4e-78
Smith-Waterman score: 1471; 53.8% identity (77.4% similar) in 429 aa overlap (56-469:55-475)
30 40 50 60 70 80
pF1KB4 QPAFDTFDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANC-ELPLLTPCSKAVMSQALK
. .. . . .: ..::::: :: .::::::
XP_011 PSPGLNSGAAELQLLSFVFPASLPCNGQGLIGTMCEPDMECADVPLLTPSSKEMMSQALK
30 40 50 60 70 80
90 100 110 120 130 140
pF1KB4 ATFSGFKKEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGK
:::::: :::.::::::.: :.: .: .:..::.::::: .:..:.: ::: :: :::
XP_011 ATFSGFTKEQQRLGIPKDPRQWTETHVRDWVMWAVNEFSLKGVDFQKFCMNGAALCALGK
90 100 110 120 130 140
150 160 170 180 190 200
pF1KB4 ERFLELAPDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPH--WINSNTLGFGTEQ
. :::::::::::::::::: . ::. . :. :. . :. . .. ...: :.
XP_011 DCFLELAPDFVGDILWEHLEILQKEDVKP----YQVNGVNPAYPESRYTSDYFISYGIEH
150 160 170 180 190 200
210 220 230 240 250
pF1KB4 APYGMQTQNYPKGGLL----DSMCPASTPSVLSSEQEFQMFPKSRL------SSVSVTYC
: . ... . ... ... : :. .:: . : . .:. : .... :
XP_011 AQC-VPPSEFSEPSFITESYQTLHPISSEELLSLKYE-NDYPSVILRDPLQTDTLQNDYF
210 220 230 240 250
260 270 280 290 300 310
pF1KB4 SVSQDF-PGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQSSLLDVQRVPSF
...:. .:. . .. : .:: :. .:..: : : :::.::::. ..:::::.
XP_011 AIKQEVVTPDNMCMGRTSRGKLGGQDSFES-IESYDSCDRLTQSWSSQSSFNSLQRVPSY
260 270 280 290 300 310
320 330 340 350 360 370
pF1KB4 ESFE-DDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGSGPIQLWQFLL
.::. .: .: .:: .::::...:.: ... :::::::.:::.:::::::::::::
XP_011 DSFDSEDYPAALPNHKPKGTFKDYVRDRAD-LNKDKPVIPAAALAGYTGSGPIQLWQFLL
320 330 340 350 360 370
380 390 400 410 420 430
pF1KB4 ELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNII
:::.:::::::::::::::::::.::::::::::::::::::::::::::::::::::::
XP_011 ELLTDKSCQSFISWTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNII
380 390 400 410 420 430
440 450 460
pF1KB4 HKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED
:::.:::::::::::::.:::.:::::::.: :.::...
XP_011 HKTAGKRYVYRFVCDLQSLLGYTPEELHAMLDVKPDADE
440 450 460 470
>>XP_016872805 (OMIM: 164720) PREDICTED: protein C-ets-1 (418 aa)
initn: 1413 init1: 877 opt: 1464 Z-score: 1531.5 bits: 292.5 E(85289): 1.5e-78
Smith-Waterman score: 1470; 54.7% identity (77.3% similar) in 422 aa overlap (62-469:5-418)
40 50 60 70 80 90
pF1KB4 FDGSLFAVFPSLNEEQTLQEVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSGFK
: ..::::: :: .::::::::::::
XP_016 MCEPDMECADVPLLTPSSKEMMSQALKATFSGFT
10 20 30
100 110 120 130 140 150
pF1KB4 KEQRRLGIPKNPWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLELA
:::.::::::.: :.: .: .:..::.::::: .:..:.: ::: :: :::. :::::
XP_016 KEQQRLGIPKDPRQWTETHVRDWVMWAVNEFSLKGVDFQKFCMNGAALCALGKDCFLELA
40 50 60 70 80 90
160 170 180 190 200
pF1KB4 PDFVGDILWEHLEQMIKENQEKTEDQYEENSHLTSVPH--WINSNTLGFGTEQAPYGMQT
::::::::::::: . ::. . :. :. . :. . .. ...: :.: .
XP_016 PDFVGDILWEHLEILQKEDVKP----YQVNGVNPAYPESRYTSDYFISYGIEHAQC-VPP
100 110 120 130 140
210 220 230 240 250
pF1KB4 QNYPKGGLL----DSMCPASTPSVLSSEQEFQMFPKSRL------SSVSVTYCSVSQDF-
... . ... ... : :. .:: . : . .:. : .... : ...:.
XP_016 SEFSEPSFITESYQTLHPISSEELLSLKYEND-YPSVILRDPLQTDTLQNDYFAIKQEVV
150 160 170 180 190 200
260 270 280 290 300 310
pF1KB4 PGSNLNLLTNNSGTPKDHDSPENGADSFESSDSLLQSWNSQSSLLDVQRVPSFESFE-DD
.:. . .. : .:: :. .:..: : : :::.::::. ..:::::..::. .:
XP_016 TPDNMCMGRTSRGKLGGQDSFES-IESYDSCDRLTQSWSSQSSFNSLQRVPSYDSFDSED
210 220 230 240 250 260
320 330 340 350 360 370
pF1KB4 CSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIPAAVLAGFTGSGPIQLWQFLLELLSDKS
.: .:: .::::...:.: ... :::::::.:::.::::::::::::::::.:::
XP_016 YPAALPNHKPKGTFKDYVRDRAD-LNKDKPVIPAAALAGYTGSGPIQLWQFLLELLTDKS
270 280 290 300 310 320
380 390 400 410 420 430
pF1KB4 CQSFISWTGDGWEFKLADPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTSGKR
::::::::::::::::.:::::::::::::::::::::::::::::::::::::::.:::
XP_016 CQSFISWTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTAGKR
330 340 350 360 370 380
440 450 460
pF1KB4 YVYRFVCDLQNLLGFTPEELHAILGVQPDTED
::::::::::.:::.:::::::.: :.::...
XP_016 YVYRFVCDLQSLLGYTPEELHAMLDVKPDADE
390 400 410
469 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 21:05:31 2016 done: Sat Nov 5 21:05:32 2016
Total Scan time: 8.010 Total Display time: 0.060
Function used was FASTA [36.3.4 Apr, 2011]