FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9619, 347 aa
1>>>pF1KB9619 347 - 347 aa - 347 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.5166+/-0.000346; mu= 5.1961+/- 0.022
mean_var=280.8366+/-56.243, 0's: 0 Z-trim(124.1): 24 B-trim: 0 in 0/57
Lambda= 0.076533
statistics sampled from 45112 (45140) to 45112 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.803), E-opt: 0.2 (0.529), width: 16
Scan time: 10.930
The best scores are: opt bits E(85289)
NP_002220 (OMIM: 165161) transcription factor jun- ( 347) 2336 270.5 3.9e-72
NP_002219 (OMIM: 165160) transcription factor AP-1 ( 331) 711 91.0 3.9e-18
NP_001273897 (OMIM: 165162) transcription factor j ( 304) 660 85.4 1.8e-16
NP_005345 (OMIM: 165162) transcription factor jun- ( 347) 660 85.4 2e-16
>>NP_002220 (OMIM: 165161) transcription factor jun-B [H (347 aa)
initn: 2336 init1: 2336 opt: 2336 Z-score: 1416.1 bits: 270.5 E(85289): 3.9e-72
Smith-Waterman score: 2336; 100.0% identity (100.0% similar) in 347 aa overlap (1-347:1-347)
10 20 30 40 50 60
pF1KB9 MCTKMEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKPSLAVNLADPYRSLKAPGARGPGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MCTKMEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKPSLAVNLADPYRSLKAPGARGPGP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 EGGGGGSYFSGQGSDTGASLKLASSELERLIVPNSNGVITTTPTPPGQYFYPRGGGSGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 EGGGGGSYFSGQGSDTGASLKLASSELERLIVPNSNGVITTTPTPPGQYFYPRGGGSGGG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 AGGAGGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSLGATGGPPAGPGGVYAGPEPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 AGGAGGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSLGATGGPPAGPGGVYAGPEPPP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 VYTNLSSYSPASASSGGAGAAVGTGSSYPTTTISYLPHAPPFAGGHPAQLGLGRGASTFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 VYTNLSSYSPASASSGGAGAAVGTGSSYPTTTISYLPHAPPFAGGHPAQLGLGRGASTFK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 EEPQTVPEARSRDATPPVSPINMEDQERIKVERKRLRNRLAATKCRKRKLERIARLEDKV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 EEPQTVPEARSRDATPPVSPINMEDQERIKVERKRLRNRLAATKCRKRKLERIARLEDKV
250 260 270 280 290 300
310 320 330 340
pF1KB9 KTLKAENAGLSSTAGLLREQVAQLKQKVMTHVSNGCQLLLGVKGHAF
:::::::::::::::::::::::::::::::::::::::::::::::
NP_002 KTLKAENAGLSSTAGLLREQVAQLKQKVMTHVSNGCQLLLGVKGHAF
310 320 330 340
>>NP_002219 (OMIM: 165160) transcription factor AP-1 [Ho (331 aa)
initn: 876 init1: 454 opt: 711 Z-score: 446.6 bits: 91.0 E(85289): 3.9e-18
Smith-Waterman score: 859; 47.7% identity (67.9% similar) in 365 aa overlap (1-347:1-331)
10 20 30 40 50 60
pF1KB9 MCTKMEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKPSLAVNLADPYRSLKAPGARGPGP
: .::: :: ::. .:. : . . :.:: :...::::: ::: : :. .
NP_002 MTAKMETTFY-DDALNASFLPSESGPYGYSNPKILKQSMTLNLADPVGSLK-PHLRAKNS
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 EGGGGGSYFSGQGSDTGASLKLASSELERLIVPNSNGVITTTPTPPGQYFYPRGGGSGGG
. ... :.: ::::: ::::::. .::: ::::::: :.. :..
NP_002 D------LLTSP--DVGL-LKLASPELERLIIQSSNGHITTTPTPT-QFLCPKN------
60 70 80 90 100
130 140 150 160
pF1KB9 AGGAGGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSL------GATGGPPA-------
::.::::::.:::.:: .::..: : :.:. :: ::
NP_002 -------VTDEQEGFAEGFVRALAELHSQN--TLPSVTSAAQPVNGAGMVAPAVASVAGG
110 120 130 140 150
170 180 190 200 210 220
pF1KB9 -GPGGVYAG-PEPPPVYTNLSSYSPASASSGGAGAAVGT-GSSYPTTTISYLPHAPPFAG
: :: :. ::::.:::...:.. ::::.. . :. : ..:. . ::
NP_002 SGSGGFSASLHSEPPVYANLSNFNPGALSSGGGAPSYGAAGLAFPAQ--PQQQQQPPH--
160 170 180 190 200
230 240 250 260 270 280
pF1KB9 GH-PAQLGLGRGA-STFKEEPQTVPEARSRDATPPVSPINMEDQERIKVERKRLRNRLAA
: : :. . . ...::::::::: .. :::.:::.::.:::::.::::.:::.::
NP_002 -HLPQQMPVQHPRLQALKEEPQTVPEMPGE--TPPLSPIDMESQERIKAERKRMRNRIAA
210 220 230 240 250 260
290 300 310 320 330 340
pF1KB9 TKCRKRKLERIARLEDKVKTLKAENAGLSSTAGLLREQVAQLKQKVMTHVSNGCQLLLGV
.::::::::::::::.:::::::.:. :.:::..:::::::::::::.::..::::.:
NP_002 SKCRKRKLERIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNHVNSGCQLMLTQ
270 280 290 300 310 320
pF1KB9 KGHAF
. ..:
NP_002 QLQTF
330
>>NP_001273897 (OMIM: 165162) transcription factor jun-D (304 aa)
initn: 605 init1: 545 opt: 660 Z-score: 416.6 bits: 85.4 E(85289): 1.8e-16
Smith-Waterman score: 791; 49.1% identity (69.5% similar) in 318 aa overlap (35-339:19-296)
10 20 30 40 50 60
pF1KB9 MEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKPSLAVNLADPYRSLKAPGARGPGPEGGG
:::. : : :.: :: . .: :
NP_001 MMKKDALTLSLSEQVAAALKPAA----APPPTPLRADGAPSAAPPDGL
10 20 30 40
70 80 90 100 110 120
pF1KB9 GGSYFSGQGSDTGASLKLASSELERLIVPNSNGVITTTPTPPGQYFYPRGGGSGGGAGGA
.: : : ::::: ::::::. .:::..::::: .:..::. ..:
NP_001 LAS------PDLGL-LKLASPELERLII-QSNGLVTTTPTS-SQFLYPKVAAS-------
50 60 70 80
130 140 150 160 170
pF1KB9 GGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSLGATGGPPAGPGGVYAGPEPP-----
:::: ::.::::::.::::.:.. ... .:... .::.:. .: ::
NP_001 -----EEQE-FAEGFVKALEDLHKQNQLGA-GAAAAAAAAAAGGPSGTATGSAPPGELAP
90 100 110 120 130 140
180 190 200 210 220 230
pF1KB9 -------PVYTNLSSYSPASASSGGAGAAVGTGS-SYPTTTISYLPHAPPFAGGHPAQLG
:::.:::::. ::::.: :... .. . . . : :: : : :
NP_001 AAAAPEAPVYANLSSYA------GGAGGAGGAATVAFAAEPVPFPPPPPPGALGPP----
150 160 170 180 190
240 250 260 270 280 290
pF1KB9 LGRGASTFKEEPQTVPEARSRDATPPVSPINMEDQERIKVERKRLRNRLAATKCRKRKLE
: :. .:.::::::.. : .::.:::.:. :::::.::::::::.::.::::::::
NP_001 --RLAA-LKDEPQTVPDVPSFGESPPLSPIDMDTQERIKAERKRLRNRIAASKCRKRKLE
200 210 220 230 240
300 310 320 330 340
pF1KB9 RIARLEDKVKTLKAENAGLSSTAGLLREQVAQLKQKVMTHVSNGCQLLLGVKGHAF
::.:::.::::::..:. :.:::.:::::::::::::..::..:::::
NP_001 RISRLEEKVKTLKSQNTELASTASLLREQVAQLKQKVLSHVNSGCQLLPQHQVPAY
250 260 270 280 290 300
>>NP_005345 (OMIM: 165162) transcription factor jun-D is (347 aa)
initn: 605 init1: 545 opt: 660 Z-score: 416.0 bits: 85.4 E(85289): 2e-16
Smith-Waterman score: 804; 45.0% identity (66.7% similar) in 369 aa overlap (5-339:1-339)
10 20 30 40
pF1KB9 MCTKMEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKP---------------SLAVNLAD
:: ::: :.. .. : : . .: :. . : : .:...:..
NP_005 METPFYGDEALSGLGGGASGSGGSFASPGRLFPGAPPTAAAGSMMKKDALTLSLSE
10 20 30 40 50
50 60 70 80 90
pF1KB9 PYRS-LKAPGARGPGP-EGGGGGSYFSGQG----SDTGASLKLASSELERLIVPNSNGVI
. :: .: : : .. :. : .: : : ::::: ::::::. .:::..
NP_005 QVAAALKPAAAPPPTPLRADGAPSAAPPDGLLASPDLGL-LKLASPELERLII-QSNGLV
60 70 80 90 100 110
100 110 120 130 140 150
pF1KB9 TTTPTPPGQYFYPRGGGSGGGAGGAGGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSL
::::: .:..::. ..: :::: ::.::::::.::::.:.. ...
NP_005 TTTPTS-SQFLYPKVAAS------------EEQE-FAEGFVKALEDLHKQNQLGA-GAAA
120 130 140 150
160 170 180 190 200
pF1KB9 GATGGPPAGPGGVYAGPEPP------------PVYTNLSSYSPASASSGGAGAAVGTGS-
.:... .::.:. .: :: :::.:::::. ::::.: :...
NP_005 AAAAAAAGGPSGTATGSAPPGELAPAAAAPEAPVYANLSSYA------GGAGGAGGAATV
160 170 180 190 200 210
210 220 230 240 250 260
pF1KB9 SYPTTTISYLPHAPPFAGGHPAQLGLGRGASTFKEEPQTVPEARSRDATPPVSPINMEDQ
.. . . . : :: : : : : :. .:.::::::.. : .::.:::.:. :
NP_005 AFAAEPVPFPPPPPPGALGPP------RLAA-LKDEPQTVPDVPSFGESPPLSPIDMDTQ
220 230 240 250 260
270 280 290 300 310 320
pF1KB9 ERIKVERKRLRNRLAATKCRKRKLERIARLEDKVKTLKAENAGLSSTAGLLREQVAQLKQ
::::.::::::::.::.::::::::::.:::.::::::..:. :.:::.:::::::::::
NP_005 ERIKAERKRLRNRIAASKCRKRKLERISRLEEKVKTLKSQNTELASTASLLREQVAQLKQ
270 280 290 300 310 320
330 340
pF1KB9 KVMTHVSNGCQLLLGVKGHAF
::..::..:::::
NP_005 KVLSHVNSGCQLLPQHQVPAY
330 340
347 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:45:03 2016 done: Fri Nov 4 17:45:05 2016
Total Scan time: 10.930 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]