FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9703, 328 aa
1>>>pF1KB9703 328 - 328 aa - 328 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4444+/-0.000285; mu= 8.9147+/- 0.018
mean_var=192.1958+/-38.216, 0's: 0 Z-trim(124.7): 65 B-trim: 23 in 1/58
Lambda= 0.092513
statistics sampled from 46719 (46784) to 46719 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.833), E-opt: 0.2 (0.549), width: 16
Scan time: 9.530
The best scores are: opt bits E(85289)
NP_835455 (OMIM: 607194,609069,615935) pancreas tr ( 328) 2275 315.1 1.3e-85
XP_011513798 (OMIM: 101400,123100,180750,601622) P ( 202) 253 45.0 0.00016
NP_000465 (OMIM: 101400,123100,180750,601622) twis ( 202) 253 45.0 0.00016
NP_001157877 (OMIM: 607539,609432,615416) class A ( 235) 252 45.0 0.00019
NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapien ( 237) 239 43.2 0.00065
XP_005260965 (OMIM: 606386) PREDICTED: oligodendro ( 323) 237 43.1 0.00096
NP_005797 (OMIM: 606386) oligodendrocyte transcrip ( 323) 237 43.1 0.00096
NP_476527 (OMIM: 200110,209885,227260,607556) twis ( 160) 227 41.4 0.0015
NP_001258822 (OMIM: 200110,209885,227260,607556) t ( 160) 227 41.4 0.0015
NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 230 42.1 0.0016
NP_001277335 (OMIM: 187040,613065) T-cell acute ly ( 172) 220 40.5 0.003
XP_016857682 (OMIM: 187040,613065) PREDICTED: T-ce ( 172) 220 40.5 0.003
NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 220 40.6 0.0032
XP_016857677 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 223 41.2 0.0036
XP_016857676 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 223 41.2 0.0036
NP_001274276 (OMIM: 187040,613065) T-cell acute ly ( 331) 223 41.2 0.0036
NP_001277333 (OMIM: 187040,613065) T-cell acute ly ( 331) 223 41.2 0.0036
XP_016857678 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 223 41.2 0.0036
XP_016857679 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 223 41.2 0.0036
NP_001277334 (OMIM: 187040,613065) T-cell acute ly ( 331) 223 41.2 0.0036
XP_016857680 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 223 41.2 0.0036
XP_005271217 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 223 41.2 0.0036
NP_001277332 (OMIM: 187040,613065) T-cell acute ly ( 331) 223 41.2 0.0036
NP_003180 (OMIM: 187040,613065) T-cell acute lymph ( 331) 223 41.2 0.0036
NP_005412 (OMIM: 186855,613065) T-cell acute lymph ( 108) 205 38.3 0.0087
>>NP_835455 (OMIM: 607194,609069,615935) pancreas transc (328 aa)
initn: 2275 init1: 2275 opt: 2275 Z-score: 1658.1 bits: 315.1 E(85289): 1.3e-85
Smith-Waterman score: 2275; 99.7% identity (99.7% similar) in 328 aa overlap (1-328:1-328)
10 20 30 40 50 60
pF1KB9 MDAVLLEHFPGGLDAFPSSYFDEDDFFTDQSSRDPLEDGDELLADEQAEVEFLSHQLHEY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_835 MDAVLLEHFPGGLDAFPSSYFDEDDFFTDQSSRDPLEDGDELLADEQAEVEFLSHQLHEY
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 CYRDGACLLLQPAPPAAPLALAPPSSGGLGEPDDGGGGGYCCETGAPPGGFPYSPGSPPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_835 CYRDGACLLLQPAPPAAPLALAPPSSGGLGEPDDGGGGGYCCETGAPPGGFPYSPGSPPS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 CLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAELQQLRQAANVRERRRMQSIN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_835 CLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAELQQLRQAANVRERRRMQSIN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 DAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQADLPLRGGGAGGCGGPGGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_835 DAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQADLPLRGGGAGGCGGPGGGG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 RLGGDSPGSQAQKVIICHRGTRPPSPSDPDYGLPPLAGHSLSWTDEKQLKEQNIIRTAKV
:::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::
NP_835 RLGGDSPGSQAQKVIICHRGTRSPSPSDPDYGLPPLAGHSLSWTDEKQLKEQNIIRTAKV
250 260 270 280 290 300
310 320
pF1KB9 WTPEDPRKLNSKSSFNNIENEPPFEFVS
::::::::::::::::::::::::::::
NP_835 WTPEDPRKLNSKSSFNNIENEPPFEFVS
310 320
>>XP_011513798 (OMIM: 101400,123100,180750,601622) PREDI (202 aa)
initn: 217 init1: 146 opt: 253 Z-score: 202.2 bits: 45.0 E(85289): 0.00016
Smith-Waterman score: 253; 40.4% identity (59.6% similar) in 151 aa overlap (78-222:24-166)
50 60 70 80 90 100
pF1KB9 AEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLALAPPSS--GGLGEPDD----GGGGGYC
: :::. :: . .. :::.:
XP_011 MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPG
10 20 30 40 50
110 120 130 140 150 160
pF1KB9 CETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAEL
.:. :: :::: : : : : : .:...... .: ::
XP_011 GAAGGGVGG-GDEPGSP----AQGKRGKK--SAGCGGGGGAGGGGGSSSGGGSPQSYEEL
60 70 80 90 100
170 180 190 200 210 220
pF1KB9 QQLRQAANVRERRRMQSINDAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQA
: : ::::::.: ::.:.:: .::. ::::: .: :::..::.:: ::.:: ...:.
XP_011 QTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDK-LSKIQTLKLAARYIDFLYQVLQS
110 120 130 140 150 160
230 240 250 260 270 280
pF1KB9 DLPLRGGGAGGCGGPGGGGRLGGDSPGSQAQKVIICHRGTRPPSPSDPDYGLPPLAGHSL
:
XP_011 DELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH
170 180 190 200
>>NP_000465 (OMIM: 101400,123100,180750,601622) twist-re (202 aa)
initn: 217 init1: 146 opt: 253 Z-score: 202.2 bits: 45.0 E(85289): 0.00016
Smith-Waterman score: 253; 40.4% identity (59.6% similar) in 151 aa overlap (78-222:24-166)
50 60 70 80 90 100
pF1KB9 AEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLALAPPSS--GGLGEPDD----GGGGGYC
: :::. :: . .. :::.:
NP_000 MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPG
10 20 30 40 50
110 120 130 140 150 160
pF1KB9 CETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAEL
.:. :: :::: : : : : : .:...... .: ::
NP_000 GAAGGGVGG-GDEPGSP----AQGKRGKK--SAGCGGGGGAGGGGGSSSGGGSPQSYEEL
60 70 80 90 100
170 180 190 200 210 220
pF1KB9 QQLRQAANVRERRRMQSINDAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQA
: : ::::::.: ::.:.:: .::. ::::: .: :::..::.:: ::.:: ...:.
NP_000 QTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDK-LSKIQTLKLAARYIDFLYQVLQS
110 120 130 140 150 160
230 240 250 260 270 280
pF1KB9 DLPLRGGGAGGCGGPGGGGRLGGDSPGSQAQKVIICHRGTRPPSPSDPDYGLPPLAGHSL
:
NP_000 DELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH
170 180 190 200
>>NP_001157877 (OMIM: 607539,609432,615416) class A basi (235 aa)
initn: 186 init1: 148 opt: 252 Z-score: 200.7 bits: 45.0 E(85289): 0.00019
Smith-Waterman score: 252; 37.5% identity (55.4% similar) in 184 aa overlap (105-277:4-183)
80 90 100 110 120
pF1KB9 PAAPLALAPPSSGGLGEPDDGGGGGYCCETGAPPGGFPYSPGSPPSC--LAYPC----AG
::: :. :. : :. :: .
NP_001 MLRGAPGLGLTARKGAEDSAEDLGGPCPEPGGD
10 20 30
130 140 150 160 170 180
pF1KB9 AAVLSPGARLRGLSGAAAAAARRRRR-VRSEAELQQLRQAANVRERRRMQSINDAFEGLR
..::. .. . . : :.::: : :::.:. :.:::::::.:. . :.::..::
NP_001 SGVLGANGASCSRGEAEEPAGRRRARPVRSKAR----RMAANVRERKRILDYNEAFNALR
40 50 60 70 80
190 200 210 220 230 240
pF1KB9 SHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQADLPLRGG-GAGGCGGPGGGGRLG--G
. :::::. ::: :: : :: ...:. :: : : ::.. : : :
NP_001 RALRHDLGGKRLSKIATLRRAIHRIAALSLVLRASPAPRGPCGHLECHGPAARGDTGDTG
90 100 110 120 130 140
250 260 270 280 290 300
pF1KB9 DSPGSQAQKVIICHRGTRPPSPSDPD-YGLPPLAGHSLSWTDEKQLKEQNIIRTAKVWTP
:: : . ..:: :: : . :: :
NP_001 ASPPPPAGPSLARPDAARPSVPSAPRCASCPPHAPLARPSAVAEGPGLAQASGGSWRRCP
150 160 170 180 190 200
310 320
pF1KB9 EDPRKLNSKSSFNNIENEPPFEFVS
NP_001 GASSAGPPPWPRGYLRSAPGMGHPRS
210 220 230
>>NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapiens] (237 aa)
initn: 220 init1: 160 opt: 239 Z-score: 191.3 bits: 43.2 E(85289): 0.00065
Smith-Waterman score: 240; 36.5% identity (57.3% similar) in 178 aa overlap (114-282:40-197)
90 100 110 120 130 140
pF1KB9 PSSGGLGEPDDGGGGGYCCETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLRGLSG
: ..:: : :: .: .... : .
NP_006 SDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPP---APARRGAPNISRASEVPGAQD
10 20 30 40 50 60
150 160 170 180 190
pF1KB9 AAAAAARRR--RRVRSEAELQQLRQA----ANVRERRRMQSINDAFEGLRSHIPTLPYEK
::: :::::: :..::.. :: ::: ::...: :...::: .:..: .
NP_006 DEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDT
70 80 90 100 110 120
200 210 220 230 240 250
pF1KB9 RLSKVDTLRLAIGYINFLSELVQ-ADLPLRGGGAGGCGGPGGGGRLGGDSPGSQAQKVII
.:.:..:::.: .:: :.: .. :: : ::::.: : . .
NP_006 KLTKIETLRFAYNYIWALAETLRLADQ----------GLPGGGARERLLPP-----QCVP
130 140 150 160 170
260 270 280 290 300 310
pF1KB9 CHRGTRPPSP-SDPD-YGLPPLAGHSLSWTDEKQLKEQNIIRTAKVWTPEDPRKLNSKSS
: : :::: :: . .: :. ::
NP_006 CLPG--PPSPASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHT
180 190 200 210 220
>>XP_005260965 (OMIM: 606386) PREDICTED: oligodendrocyte (323 aa)
initn: 237 init1: 142 opt: 237 Z-score: 188.2 bits: 43.1 E(85289): 0.00096
Smith-Waterman score: 238; 30.0% identity (53.0% similar) in 230 aa overlap (64-277:4-226)
40 50 60 70 80 90
pF1KB9 DPLEDGDELLADEQAEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLALAPP-SSGGLGEP
:.. . .:. : . : :.:. :
XP_005 MDSDASLVSSRPSSPEPDDLFLPARSKGSSGSA
10 20 30
100 110 120 130 140
pF1KB9 DDGGGGGYCCETGAPPGGFPYSPGSPPSCLAYP---CAGAAVLSPGARLRG-LSGAAAAA
:: . . :: :. : :.: .:.. : .. . :.:::..
XP_005 FTGGTVSSSTPSDCPPELSAELRGAMGSAGAHPGDKLGGSGFKSSSSSTSSSTSSAAASS
40 50 60 70 80 90
150 160 170 180 190 200
pF1KB9 ARRRRRVRSEAELQQLRQAANVRERRRMQSINDAFEGLRSHIPTL--PYEKRLSKVDTLR
... .. .: :::::: : :::.::...: :..::: .: : ..:::. ::
XP_005 TKKDKKQMTEPELQQLRLKINSRERKRMHDLNIAMDGLREVMPYAHGPSVRKLSKIATLL
100 110 120 130 140 150
210 220 230 240 250
pF1KB9 LAIGYI----NFLSELVQADLPLRGGGAGG-----CGGPGGGGRLGGDSPGSQAQKVIIC
:: .:: : : :. . . :: .: ::: . .. : :.. :. .
XP_005 LARNYILMLTNSLEEMKRLVSEIYGGHHAGFHPSACGGLAHSAPL----PAATAHPAAAA
160 170 180 190 200
260 270 280 290 300 310
pF1KB9 HRGTRPPSPSDPDYGLPPLAGHSLSWTDEKQLKEQNIIRTAKVWTPEDPRKLNSKSSFNN
: ... :. : ::: :
XP_005 H-AAHHPAVHHPI--LPPAAAAAAAAAAAAAVSSASLPGSGLPSVGSIRPPHGLLKSPSA
210 220 230 240 250 260
>>NP_005797 (OMIM: 606386) oligodendrocyte transcription (323 aa)
initn: 237 init1: 142 opt: 237 Z-score: 188.2 bits: 43.1 E(85289): 0.00096
Smith-Waterman score: 238; 30.0% identity (53.0% similar) in 230 aa overlap (64-277:4-226)
40 50 60 70 80 90
pF1KB9 DPLEDGDELLADEQAEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLALAPP-SSGGLGEP
:.. . .:. : . : :.:. :
NP_005 MDSDASLVSSRPSSPEPDDLFLPARSKGSSGSA
10 20 30
100 110 120 130 140
pF1KB9 DDGGGGGYCCETGAPPGGFPYSPGSPPSCLAYP---CAGAAVLSPGARLRG-LSGAAAAA
:: . . :: :. : :.: .:.. : .. . :.:::..
NP_005 FTGGTVSSSTPSDCPPELSAELRGAMGSAGAHPGDKLGGSGFKSSSSSTSSSTSSAAASS
40 50 60 70 80 90
150 160 170 180 190 200
pF1KB9 ARRRRRVRSEAELQQLRQAANVRERRRMQSINDAFEGLRSHIPTL--PYEKRLSKVDTLR
... .. .: :::::: : :::.::...: :..::: .: : ..:::. ::
NP_005 TKKDKKQMTEPELQQLRLKINSRERKRMHDLNIAMDGLREVMPYAHGPSVRKLSKIATLL
100 110 120 130 140 150
210 220 230 240 250
pF1KB9 LAIGYI----NFLSELVQADLPLRGGGAGG-----CGGPGGGGRLGGDSPGSQAQKVIIC
:: .:: : : :. . . :: .: ::: . .. : :.. :. .
NP_005 LARNYILMLTNSLEEMKRLVSEIYGGHHAGFHPSACGGLAHSAPL----PAATAHPAAAA
160 170 180 190 200
260 270 280 290 300 310
pF1KB9 HRGTRPPSPSDPDYGLPPLAGHSLSWTDEKQLKEQNIIRTAKVWTPEDPRKLNSKSSFNN
: ... :. : ::: :
NP_005 H-AAHHPAVHHPI--LPPAAAAAAAAAAAAAVSSASLPGSGLPSVGSIRPPHGLLKSPSA
210 220 230 240 250 260
>>NP_476527 (OMIM: 200110,209885,227260,607556) twist-re (160 aa)
initn: 235 init1: 146 opt: 227 Z-score: 184.7 bits: 41.4 E(85289): 0.0015
Smith-Waterman score: 227; 48.9% identity (70.0% similar) in 90 aa overlap (133-222:44-124)
110 120 130 140 150 160
pF1KB9 ETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAELQ
:: :: .:. .: .: :::
NP_476 SLGTSEEELERQPKRFGRKRRYSKKSSEDGSPTPGKRGKKGSPSA--------QSFEELQ
20 30 40 50 60
170 180 190 200 210 220
pF1KB9 QLRQAANVRERRRMQSINDAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQAD
. : ::::::.: ::.:.:: .::. ::::: .: :::..::.:: ::.:: ...:.:
NP_476 SQRILANVRERQRTQSLNEAFAALRKIIPTLPSDK-LSKIQTLKLAARYIDFLYQVLQSD
70 80 90 100 110 120
230 240 250 260 270 280
pF1KB9 LPLRGGGAGGCGGPGGGGRLGGDSPGSQAQKVIICHRGTRPPSPSDPDYGLPPLAGHSLS
NP_476 EMDNKMTSCSYVAHERLSYAFSVWRMEGAWSMSASH
130 140 150 160
>>NP_001258822 (OMIM: 200110,209885,227260,607556) twist (160 aa)
initn: 235 init1: 146 opt: 227 Z-score: 184.7 bits: 41.4 E(85289): 0.0015
Smith-Waterman score: 227; 48.9% identity (70.0% similar) in 90 aa overlap (133-222:44-124)
110 120 130 140 150 160
pF1KB9 ETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAELQ
:: :: .:. .: .: :::
NP_001 SLGTSEEELERQPKRFGRKRRYSKKSSEDGSPTPGKRGKKGSPSA--------QSFEELQ
20 30 40 50 60
170 180 190 200 210 220
pF1KB9 QLRQAANVRERRRMQSINDAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQAD
. : ::::::.: ::.:.:: .::. ::::: .: :::..::.:: ::.:: ...:.:
NP_001 SQRILANVRERQRTQSLNEAFAALRKIIPTLPSDK-LSKIQTLKLAARYIDFLYQVLQSD
70 80 90 100 110 120
230 240 250 260 270 280
pF1KB9 LPLRGGGAGGCGGPGGGGRLGGDSPGSQAQKVIICHRGTRPPSPSDPDYGLPPLAGHSLS
NP_001 EMDNKMTSCSYVAHERLSYAFSVWRMEGAWSMSASH
130 140 150 160
>>NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapiens] (272 aa)
initn: 231 init1: 171 opt: 230 Z-score: 184.1 bits: 42.1 E(85289): 0.0016
Smith-Waterman score: 244; 31.2% identity (52.3% similar) in 266 aa overlap (52-275:2-254)
30 40 50 60 70 80
pF1KB9 DEDDFFTDQSSRDPLEDGDELLADEQAEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLAL
:.. . : .. . .:: : :: ::
NP_076 MFVKSETLELKEEEDVLVLLGSASPALA-AL
10 20 30
90 100 110 120 130
pF1KB9 APPSSGGLGEPDD--GGGGGYCCETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLR
.: ::.. : .. :..:: . :: : :. . ::: : :::
NP_076 TPLSSSADEEEEEEPGASGGARRQRGAEAG-----QGARGGV----AAGAEGCRP-ARLL
40 50 60 70 80
140 150 160 170 180 190
pF1KB9 GL-------SGAAAAAARRRRRVRSEAELQQLRQ-AANVRERRRMQSINDAFEGLRSHIP
:: . : :..: . ... .... :. :: ::: ::...: :...:: .:
NP_076 GLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLP
90 100 110 120 130 140
200 210 220 230 240
pF1KB9 TLPYEKRLSKVDTLRLAIGYINFLSELVQ-ADLPLRGGGAGGCGG----------PGGGG
:.: . .:.:..:::.: .:: :.: .. :: :::.:: : :::..
NP_076 TFPEDAKLTKIETLRFAHNYIWALTETLRLADHC--GGGGGGLPGALFSEAVLLSPGGAS
150 160 170 180 190
250 260 270
pF1KB9 RL---GGDSPG---------SQAQKVIICHRGTRP---------PSPSDPDYGLPPLAGH
.::::. : : . . .: : :. :: :: ::
NP_076 AALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPDK
200 210 220 230 240 250
280 290 300 310 320
pF1KB9 SLSWTDEKQLKEQNIIRTAKVWTPEDPRKLNSKSSFNNIENEPPFEFVS
NP_076 HRYAPHLPIARDCI
260 270
328 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 01:27:55 2016 done: Sat Nov 5 01:27:57 2016
Total Scan time: 9.530 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]