FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9877, 546 aa
1>>>pF1KB9877 546 - 546 aa - 546 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.9704+/-0.000418; mu= -4.2416+/- 0.026
mean_var=395.3926+/-81.577, 0's: 0 Z-trim(123.0): 2 B-trim: 728 in 1/60
Lambda= 0.064500
statistics sampled from 42065 (42068) to 42065 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.493), width: 16
Scan time: 12.770
The best scores are: opt bits E(85289)
NP_057511 (OMIM: 609522) RNA polymerase II transcr ( 753) 1420 146.3 3.3e-34
NP_003189 (OMIM: 600786) transcription elongation ( 798) 869 95.0 9.2e-19
>>NP_057511 (OMIM: 609522) RNA polymerase II transcripti (753 aa)
initn: 3024 init1: 1420 opt: 1420 Z-score: 735.2 bits: 146.3 E(85289): 3.3e-34
Smith-Waterman score: 1871; 58.8% identity (62.0% similar) in 577 aa overlap (1-385:1-577)
10 20 30 40 50 60
pF1KB9 MAAGSTTLRAVGKLQVRLATKTEPKKLEKYLQKLSALPMTADILAETGIRKTVKRLRKHQ
::::::::.:: ::::::::::::::::::::::::::::::::::::::::::::::::
NP_057 MAAGSTTLHAVEKLQVRLATKTEPKKLEKYLQKLSALPMTADILAETGIRKTVKRLRKHQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 HVGDFARDLAARWKKLVLVDRNTGPDPQDPEESASRQRFGEALQEREKAWGFPENATAPR
::::::::::::::::::::::: : ::::::::::::::::::..::::::::::::::
NP_057 HVGDFARDLAARWKKLVLVDRNTRPGPQDPEESASRQRFGEALQDQEKAWGFPENATAPR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SPSHSPEHRRTARRTPPGQQRPHPRSPSREPRAERKRPRMAPADSGPHRDPPTRTAPLPM
:::::::::::::::::::::::::: ::::::::: ::.:::::: .: ::::::: :
NP_057 SPSHSPEHRRTARRTPPGQQRPHPRSHSREPRAERKCPRIAPADSGRYRASPTRTAPLRM
130 140 150 160 170 180
190 200 210 220 230
pF1KB9 PEGPEPAVPGEQPGRGHAHAAQGGPLLGQGCQGQPQGEAVGSHSKGHKSSR---------
:::::::.::.::::::.::::::::: ::::::::.:: ::::::::::
NP_057 PEGPEPAAPGKQPGRGHTHAAQGGPLLCPGCQGQPQGKAVVSHSKGHKSSRQEKRPLCAQ
190 200 210 220 230 240
pF1KB9 --------------GA--------------------------------------------
::
NP_057 GDWHSPTLIREKSCGACLREETPRMPSWASARDRQPSDFKTDKEGGQAGSGQRVPALEEA
250 260 270 280 290 300
pF1KB9 ------------------------------------------------------------
NP_057 PDSHQKRPQHSHSNKKRPSLDGRDPGNGTHGLSPEEKEQLSNDRETQEGKPPTAHLDRTS
310 320 330 340 350 360
pF1KB9 ------------------------------------------------------------
NP_057 VSSLSEVEEVDMAEEFEQPTLSCEKYLTYDQLRKQKKKTGKSATTALGDKQRKANESKGT
370 380 390 400 410 420
240 250 260 270 280
pF1KB9 -----SAQKSPPVQESQSERLQAAGADSAGPKTVPSHVFSELWDPSEAWMQANYDLLSAF
::.: :::::::::::::::::::::::::::::::::: :::::::::: ::
NP_057 RESWDSAKKLPPVQESQSERLQAAGADSAGPKTVPSHVFSELWDLSEAWMQANYDPLSDS
430 440 450 460 470 480
290 300 310 320 330 340
pF1KB9 EAMTSQANPEALSAPTLQEEAAFPGRRVNAKMPVYSGSRPACQLQVPTLRQQCLRVPRNN
..:::::.:::::.: ..::::::::::::::::::::::::::::::::::: .: :::
NP_057 DSMTSQAKPEALSSPKFREEAAFPGRRVNAKMPVYSGSRPACQLQVPTLRQQCAQVLRNN
490 500 510 520 530 540
350 360 370 380 390 400
pF1KB9 PDALGDVEGVPYSVLEPVLEGWTPDQPYRTEKDNAALARETDELWRIHCLQDFKEEKPQE
::::.:: ::: ::::::::: ::: :: .::: ::
NP_057 PDALSDVGEVPYWVLEPVLEGWRPDQLYRRKKDNHALVRETDELRRNHCFQDFKEEKPQE
550 560 570 580 590 600
410 420 430 440 450 460
pF1KB9 HESWRELYLRLRDAREQRLRVVTTKIRSARENKPSGRQTKMICFNSVAKTPYDASRRQEK
NP_057 NKTWREQYLRLPDAPEQRLRVMTTNIRSARGNNPNGREAKMICFKSVAKTPYDTSRRQEK
610 620 630 640 650 660
>--
initn: 776 init1: 574 opt: 750 Z-score: 398.3 bits: 83.9 E(85289): 1.9e-15
Smith-Waterman score: 750; 69.1% identity (82.9% similar) in 175 aa overlap (387-546:579-753)
360 370 380 390 400 410
pF1KB9 GVPYSVLEPVLEGWTPDQPYRTEKDNAALARETDELWRIHCLQDFKEEKPQEHESWRELY
:::::: : ::.::::::::::...::: :
NP_057 EVPYWVLEPVLEGWRPDQLYRRKKDNHALVRETDELRRNHCFQDFKEEKPQENKTWREQY
550 560 570 580 590 600
420 430 440 450 460 470
pF1KB9 LRLRDAREQRLRVVTTKIRSARENKPSGRQTKMICFNSVAKTPYDASRRQEKSAGAADPG
::: :: ::::::.::.::::: :.:.::..:::::.::::::::.::::::::: :::
NP_057 LRLPDAPEQRLRVMTTNIRSARGNNPNGREAKMICFKSVAKTPYDTSRRQEKSAGDADPE
610 620 630 640 650 660
480 490 500 510 520
pF1KB9 NGEMEPAPKPAGSSQAPSGLGDGDGGSVSGGG---------------SSNRHAAPADKTR
:::..:: ::::::..::. ... :: :... :::.::::: :::
NP_057 NGEIKPASKPAGSSHTPSSQSSSGGGRDSSSSILRWLPEKRANPCLSSSNEHAAPAAKTR
670 680 690 700 710 720
530 540
pF1KB9 KQAAKKVAPLMAKAIRDYKGRFSRR
::::::::::::::::::: :::::
NP_057 KQAAKKVAPLMAKAIRDYKRRFSRR
730 740 750
>>NP_003189 (OMIM: 600786) transcription elongation fact (798 aa)
initn: 1420 init1: 718 opt: 869 Z-score: 457.8 bits: 95.0 E(85289): 9.2e-19
Smith-Waterman score: 962; 38.8% identity (64.6% similar) in 492 aa overlap (82-546:314-798)
60 70 80 90 100 110
pF1KB9 TVKRLRKHQHVGDFARDLAARWKKLVLVDRNTGPDPQDPEESASRQRFGEALQEREKAWG
:. : . . ..: : .:...
NP_003 KSDEKASVVSREKSHKALSKEENRRPPSGDNAREKPPSSGVKKEKDREGSSLKKK----C
290 300 310 320 330
120 130 140 150 160
pF1KB9 FPENATAPRSPSHSPEHRRTAR----RTPPGQQRPHPRSPSRE--PRAERKRPRMAPADS
.: . .: . ..:.:: . .. : . . . . :....: .
NP_003 LPPSEAASDNHLKKPKHRDPEKAKLDKSKQGLDSFDTGKGAGDLLPKVKEKGSNNLKTPE
340 350 360 370 380 390
170 180 190 200 210 220
pF1KB9 GPHRDPPTRTAPLPMPEGPEPAVPGE--QPGRG-HAHAAQGGPLLGQGCQGQPQGEAVGS
: . : . .:. : . : :: . ... . : . . .. :.:.
NP_003 GKVKTNLDRKSLGSLPKVEETDMEDEFEQPTMSFESYLSYDQPRKKKKKIVKTSATALGD
400 410 420 430 440 450
230 240 250 260 270
pF1KB9 HS--KGHKSSRGA---SAQKSPPVQESQSERLQAAGADSAGPKTVPSHVFSELWDPSEAW
.. :. ..: : :.:: : :....::. :::: : . ::. :. : :
NP_003 KGLKKNDSKSTGKNLDSVQKLPKVNKTKSEK--PAGADLAKLRKVPD-VLPVLPDLPLPA
460 470 480 490 500 510
280 290 300 310 320 330
pF1KB9 MQANYDLLSAFEAMTS-QANPEALSAPTLQEEAAFPGRRVNAKMPVYSGSRPACQLQVPT
.:::: : ..: ..: : . .:.:.: .:::.: :::.:.:: :::::. : .. :
NP_003 IQANYRPLPSLELISSFQPKRKAFSSPQEEEEAGFTGRRMNSKMQVYSGSKCAYLPKMMT
520 530 540 550 560 570
340 350 360 370 380 390
pF1KB9 LRQQCLRVPRNNPDALGDVEGVPYSVLEPVLEGWTPDQPYRTEKDNAALARETDELWRIH
:.:::.:: .:: :.. .: :::::::::::: :::: :: :. : .: .:::.::..:
NP_003 LHQQCIRVLKNNIDSIFEVGGVPYSVLEPVLERCTPDQLYRIEEYNHVLIEETDQLWKVH
580 590 600 610 620 630
400 410 420 430 440 450
pF1KB9 CLQDFKEEKPQEHESWRELYLRLRDAREQRLRVVTTKIRSARENKPSGRQTKMICFNSVA
: .:::::.:.:.:::::.::::.:::::::::.: .:. :. :::.:::.:: ::::
NP_003 CHRDFKEERPEEYESWREMYLRLQDAREQRLRVLTKNIQFAHANKPKGRQAKMAFVNSVA
640 650 660 670 680 690
460 470 480 490 500 510
pF1KB9 KTPYDASRRQEK--SAGAADPGNGEMEPAPKPAGSSQAP-SGLG---DGDGGSVSGGGSS
: : :. ::::: ..::: : . ...::: : :::.: :... . . . .: ..:
NP_003 KPPRDVRRRQEKFGTGGAAVPEKIKIKPAPYPMGSSHASASSISFNPSPEEPAYDGPSTS
700 710 720 730 740 750
520 530 540
pF1KB9 NRHAAPADKT------RKQAAKKVAPLMAKAIRDYKGRFSRR
. : ::. .. :: ..::.::.:::.:. .:.:::::
NP_003 SAHLAPVVSSTVSYDPRKPTVKKIAPMMAKTIKAFKNRFSRR
760 770 780 790
>--
initn: 573 init1: 404 opt: 570 Z-score: 307.4 bits: 67.2 E(85289): 2.2e-10
Smith-Waterman score: 640; 46.1% identity (68.8% similar) in 269 aa overlap (1-248:27-288)
10 20 30
pF1KB9 MAAGSTTLRAVGKLQVRLATKTEPKKLEKYLQKL
::: :. :..: :::.:::.. .:::: :::.::
NP_003 MHGGRSCGPRTRREPSSGEEAAPVTAMAAESA-LQVVEKLQARLAANPDPKKLLKYLKKL
10 20 30 40 50
40 50 60 70 80 90
pF1KB9 SALPMTADILAETGIRKTVKRLRKHQHVGDFARDLAARWKKLVLVDRNTGPDPQDPEESA
:.::.:.:::::::. :::. ::::.:::.:::::.:.::::: :.::. :: :: :.:
NP_003 STLPITVDILAETGVGKTVNSLRKHEHVGSFARDLVAQWKKLVPVERNAEPDEQDFEKSN
60 70 80 90 100 110
100 110 120 130 140 150
pF1KB9 SRQRFGEALQEREKAWG-FPENATAPRSPSHSPEHRRTARRTPPGQQRPHPRSPSREPRA
::.: .:::..:. : . :. : : :.::.::. .: .::: : ..: :
NP_003 SRKRPRDALQKEEEMEGDYQETWKATGSRSYSPDHRQKKHRKLSELERPHKVSHGHERRD
120 130 140 150 160 170
160 170 180 190
pF1KB9 ERKR-PRMAP--------ADSGPHRDPPTRTAPLPM--------PEGPEPAVPGEQPGRG
:::: ::.: .: : ..::. :.: : : :: : ..::.:
NP_003 ERKRCHRMSPTYSSDPESSDYGHVQSPPSCTSPHQMYVDHYRSLEEDQEPIVSHQKPGKG
180 190 200 210 220 230
200 210 220 230 240 250
pF1KB9 HAHAAQGGPLLGQGCQ---GQPQGEAVGSHSKGHKSSRGASAQKSPPVQESQSERLQAAG
:..: : :: . . :.:.:..: :..: ::::. . . ::. ...:.
NP_003 HSNAFQD--RLGASQERHLGEPHGKGVVSQNKEHKSSH----KDKRPVDAKSDEKASVVS
240 250 260 270 280 290
260 270 280 290 300 310
pF1KB9 ADSAGPKTVPSHVFSELWDPSEAWMQANYDLLSAFEAMTSQANPEALSAPTLQEEAAFPG
NP_003 REKSHKALSKEENRRPPSGDNAREKPPSSGVKKEKDREGSSLKKKCLPPSEAASDNHLKK
300 310 320 330 340 350
546 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 19:57:46 2016 done: Fri Nov 4 19:57:48 2016
Total Scan time: 12.770 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]