FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9630, 237 aa
1>>>pF1KB9630 237 - 237 aa - 237 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0663+/-0.000262; mu= 9.6306+/- 0.017
mean_var=158.3319+/-32.058, 0's: 0 Z-trim(125.1): 61 B-trim: 1967 in 1/59
Lambda= 0.101927
statistics sampled from 47910 (47978) to 47910 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.843), E-opt: 0.2 (0.563), width: 16
Scan time: 7.020
The best scores are: opt bits E(85289)
NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapien ( 237) 1621 248.7 6.5e-66
XP_016871769 (OMIM: 604882,610370) PREDICTED: neur ( 214) 472 79.7 4.4e-15
NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo ( 214) 472 79.7 4.4e-15
NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 437 74.7 1.8e-13
NP_067014 (OMIM: 611635) neurogenic differentiatio ( 331) 312 56.4 7.3e-08
NP_002491 (OMIM: 125853,601724,606394) neurogenic ( 356) 292 53.4 5.9e-07
NP_006151 (OMIM: 601725) neurogenic differentiatio ( 382) 286 52.6 1.1e-06
NP_073565 (OMIM: 611513) neurogenic differentiatio ( 337) 281 51.8 1.7e-06
NP_005797 (OMIM: 606386) oligodendrocyte transcrip ( 323) 244 46.3 7.3e-05
XP_005260965 (OMIM: 606386) PREDICTED: oligodendro ( 323) 244 46.3 7.3e-05
NP_005163 (OMIM: 601461) protein atonal homolog 1 ( 354) 242 46.1 9.6e-05
NP_835455 (OMIM: 607194,609069,615935) pancreas tr ( 328) 241 45.9 0.0001
NP_542173 (OMIM: 609331) class E basic helix-loop- ( 241) 237 45.2 0.00012
NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 219 42.5 0.00063
XP_005259969 (OMIM: 151440) PREDICTED: protein lyl ( 206) 215 41.9 0.001
NP_005574 (OMIM: 151440) protein lyl-1 [Homo sapie ( 280) 215 42.0 0.0013
XP_016882306 (OMIM: 151440) PREDICTED: protein lyl ( 302) 215 42.1 0.0013
XP_016882305 (OMIM: 151440) PREDICTED: protein lyl ( 351) 215 42.1 0.0015
NP_005161 (OMIM: 601886) achaete-scute homolog 2 [ ( 193) 201 39.8 0.004
XP_006716679 (OMIM: 609067) PREDICTED: basic helix ( 201) 200 39.7 0.0046
NP_001073983 (OMIM: 609067) basic helix-loop-helix ( 201) 200 39.7 0.0046
NP_786923 (OMIM: 609323) oligodendrocyte transcrip ( 272) 195 39.1 0.0095
>>NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapiens] (237 aa)
initn: 1621 init1: 1621 opt: 1621 Z-score: 1304.4 bits: 248.7 E(85289): 6.5e-66
Smith-Waterman score: 1621; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237)
10 20 30 40 50 60
pF1KB9 MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA
130 140 150 160 170 180
190 200 210 220 230
pF1KB9 SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
190 200 210 220 230
>>XP_016871769 (OMIM: 604882,610370) PREDICTED: neurogen (214 aa)
initn: 453 init1: 412 opt: 472 Z-score: 391.8 bits: 79.7 E(85289): 4.4e-15
Smith-Waterman score: 489; 53.7% identity (70.9% similar) in 175 aa overlap (42-213:34-193)
20 30 40 50 60 70
pF1KB9 LDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPAR-RGAPNISRASEVPGAQDDEQE
:.::.:.: :: : ..: : ..
XP_016 QPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRG--NCAEAEEGGCRGAPRKL
10 20 30 40 50 60
80 90 100 110 120 130
pF1KB9 RRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTK
: :: ::.: .:: : . ::::: ::::::::::::::.::::::.:::.::::.::::
XP_016 RARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
70 80 90 100 110 120
140 150 160 170 180 190
pF1KB9 IETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGA
:::::::.::::::..:::.::..: . : :: .: ::... .:::
XP_016 IETLRFAHNYIWALTQTLRIADHSLYA------LEPP--APHCGELGSPGGSPGDWGS--
130 140 150 160 170
200 210 220 230
pF1KB9 AAASPLSDPSS--PAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
::.:. .: :::: . :::
XP_016 -LYSPVSQAGSLSPAAS--LEERPGLLGATFSACLSPGSLAFSDFL
180 190 200 210
>>NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo sap (214 aa)
initn: 453 init1: 412 opt: 472 Z-score: 391.8 bits: 79.7 E(85289): 4.4e-15
Smith-Waterman score: 489; 53.7% identity (70.9% similar) in 175 aa overlap (42-213:34-193)
20 30 40 50 60 70
pF1KB9 LDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPAR-RGAPNISRASEVPGAQDDEQE
:.::.:.: :: : ..: : ..
NP_066 QPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRG--NCAEAEEGGCRGAPRKL
10 20 30 40 50 60
80 90 100 110 120 130
pF1KB9 RRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTK
: :: ::.: .:: : . ::::: ::::::::::::::.::::::.:::.::::.::::
NP_066 RARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
70 80 90 100 110 120
140 150 160 170 180 190
pF1KB9 IETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGA
:::::::.::::::..:::.::..: . : :: .: ::... .:::
NP_066 IETLRFAHNYIWALTQTLRIADHSLYA------LEPP--APHCGELGSPGGSPGDWGS--
130 140 150 160 170
200 210 220 230
pF1KB9 AAASPLSDPSS--PAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
::.:. .: :::: . :::
NP_066 -LYSPVSQAGSLSPAAS--LEERPGLLGATFSACLSPGSLAFSDFL
180 190 200 210
>>NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapiens] (272 aa)
initn: 519 init1: 373 opt: 437 Z-score: 362.6 bits: 74.7 E(85289): 1.8e-13
Smith-Waterman score: 437; 46.2% identity (67.2% similar) in 186 aa overlap (33-215:51-233)
10 20 30 40 50 60
pF1KB9 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV
:: :..: :. : . . :: . : ...
NP_076 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL
30 40 50 60 70
70 80 90 100 110
pF1KB9 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL
: : ..: : :. .: ... .. ....::.:::.::::::::::::::::: ::
NP_076 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL
80 90 100 110 120 130
120 130 140 150 160 170
pF1KB9 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP
:.::.:.:::::::::::.::::::.:::::::. :::. : . : :: :
NP_076 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA
140 150 160 170 180 190
180 190 200 210 220 230
pF1KB9 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
: : :.. . :: : .::: : . . .:
NP_076 A--LSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD
200 210 220 230 240 250
NP_076 KHRYAPHLPIARDCI
260 270
>>NP_067014 (OMIM: 611635) neurogenic differentiation fa (331 aa)
initn: 296 init1: 268 opt: 312 Z-score: 262.2 bits: 56.4 E(85289): 7.3e-08
Smith-Waterman score: 312; 39.0% identity (57.6% similar) in 177 aa overlap (65-231:60-226)
40 50 60 70 80 90
pF1KB9 LQQAASASGPPAPARRGAPNISRASEVPGAQDDEQERRRRRG-RTRVRSEALLHSLRRSR
.... :. .::: . . ..: :. .: .:
NP_067 EVKEEESRPGTYGMLSSLTEEHDSIEEEEEEEEDGEKPKRRGPKKKKMTKARLERFR-AR
30 40 50 60 70 80
100 110 120 130 140 150
pF1KB9 RVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTKIETLRFAYNYIWALAETLRLADQ
::::: :::.:::.:: ::: :: :.: . ::.::::::.: ::::::.:.:. . :
NP_067 RVKANARERTRMHGLNDALDNLRRVMPCYSKTQKLSKIETLRLARNYIWALSEVLE-TGQ
90 100 110 120 130 140
160 170 180 190 200
pF1KB9 GLPGGGARERLLPPQCVP-------CLP-GPPSPASDAESWGSGAAAASPLSDPSSPAAS
: : : : : :: :: : . . ::. : : .
NP_067 TPEGKGFVEMLCKGLSQPTSNLVAGCLQLGPQSVLLEKHE------DKSPICD--SAISV
150 160 170 180 190
210 220 230
pF1KB9 EDFTYR-PGDPVFSFPSLPKDLLHTTPCFIPYH
..:.:. :: : . . ::: :
NP_067 HNFNYQSPGLPSPPYGHMETHLLHLKPQVFKSLGESSFGSHLPDCSTPPYEGPLTPPLSI
200 210 220 230 240 250
>>NP_002491 (OMIM: 125853,601724,606394) neurogenic diff (356 aa)
initn: 336 init1: 278 opt: 292 Z-score: 245.9 bits: 53.4 E(85289): 5.9e-07
Smith-Waterman score: 292; 55.3% identity (77.6% similar) in 85 aa overlap (65-149:75-158)
40 50 60 70 80 90
pF1KB9 LQQAASASGPPAPARRGAPNISRASEVPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRR
.::.:. .:: . . ..: :. .. ::
NP_002 AMNAEEDSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKRRGPKKKKMTKARLERFKL-RR
50 60 70 80 90 100
100 110 120 130 140 150
pF1KB9 VKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTKIETLRFAYNYIWALAETLRLADQG
.::: :::::::.:::::: ::.:.: . ::.::::::.: ::::::.: ::
NP_002 MKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSGKSP
110 120 130 140 150 160
160 170 180 190 200 210
pF1KB9 LPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGD
NP_002 DLVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQNQDMPPHLPTASASFPVHPYSYQS
170 180 190 200 210 220
>>NP_006151 (OMIM: 601725) neurogenic differentiation fa (382 aa)
initn: 341 init1: 280 opt: 286 Z-score: 240.7 bits: 52.6 E(85289): 1.1e-06
Smith-Waterman score: 286; 54.9% identity (72.5% similar) in 91 aa overlap (60-149:89-178)
30 40 50 60 70 80
pF1KB9 EDCARLQQAASASGPPAPARRGAPNISRASEVPGAQDDEQERRRRRG-RTRVRSEALLHS
: : .. : :: ..:: . : ..: :.
NP_006 PLRGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLER
60 70 80 90 100 110
90 100 110 120 130 140
pF1KB9 LRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTKIETLRFAYNYIWALAETL
. :: ::: :::::::.:::::: ::.:.: . ::.::::::.: ::::::.: :
NP_006 -SKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEIL
120 130 140 150 160 170
150 160 170 180 190 200
pF1KB9 RLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGAAAASPLSDPSSPAASEDF
:
NP_006 RSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMH
180 190 200 210 220 230
>>NP_073565 (OMIM: 611513) neurogenic differentiation fa (337 aa)
initn: 297 init1: 268 opt: 281 Z-score: 237.5 bits: 51.8 E(85289): 1.7e-06
Smith-Waterman score: 284; 40.2% identity (56.1% similar) in 164 aa overlap (65-213:67-221)
40 50 60 70 80 90
pF1KB9 LQQAASASGPPAPARRGAPNISRASEVPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRR
..::. :::: . .. : . ::
NP_073 FSKQIVLRGKSIKRAPGEETEKEEEEEDREEEDENGLPRRRGLRKKKTTKLRLERVKFRR
40 50 60 70 80 90
100 110 120 130 140 150
pF1KB9 VKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTKIETLRFAYNYIWALAETLRLADQG
.:: :::::::.:: ::: ::.:.: . ::.::::::.: ::::::.: ::.
NP_073 QEANARERNRMHGLNDALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRI----
100 110 120 130 140 150
160 170 180 190
pF1KB9 LPGGGARERLLPPQCVPCLPGPPSPASD---------AESW--GSGAAAA----SPLSDP
: : :: : : .:... :.:. :.:. :: :: :
NP_073 ----GKRPDLLTFVQNLC-KGLSQPTTNLVAGCLQLNARSFLMGQGGEAAHHTRSPYSTF
160 170 180 190 200
200 210 220 230
pF1KB9 SSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
: : ..: ::
NP_073 YPPYHSPELTTPPGHGTLDNSKSMKPYNYCSAYESFYESTSPECASPQFEGPLSPPPINY
210 220 230 240 250 260
>>NP_005797 (OMIM: 606386) oligodendrocyte transcription (323 aa)
initn: 212 init1: 122 opt: 244 Z-score: 208.3 bits: 46.3 E(85289): 7.3e-05
Smith-Waterman score: 244; 29.2% identity (53.0% similar) in 236 aa overlap (17-237:29-258)
10 20 30 40
pF1KB9 MPARLETCISDLDCASSSGSDLSGFL---TDEEDC-----ARLQQAAS
:::: ..: . :: :.:. : .
NP_005 MDSDASLVSSRPSSPEPDDLFLPARSKGSSGSAFTGGTVSSSTPSDCPPELSAELRGAMG
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB9 ASGPPAPARRGAPNISRASEVPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDR
..: . :. ... .: ... . . . .: :..:: .: :.:
NP_005 SAGAHPGDKLGGSGFKSSSSSTSSSTSSAAASSTKKDKKQMTEPELQQLR----LKINSR
70 80 90 100 110
110 120 130 140 150
pF1KB9 ERNRMHNLNAALDALRSVLPSF--PDDTKLTKIETLRFAYNYIWALAETL----RLADQG
::.:::.:: :.:.:: :.: :. ::.:: :: .: ::: :...: ::...
NP_005 ERKRMHDLNIAMDGLREVMPYAHGPSVRKLSKIATLLLARNYILMLTNSLEEMKRLVSEI
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB9 LPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGAAAASP-LSDPSSPAASEDFTYRPG
:: . . : : . : ::. :. ... :: : . : : :. . .
NP_005 Y--GGHHAGFHPSACGGLAHSAPLPAATAHPAAAAHAAHHPAVHHPILPPAAAAAAAAAA
180 190 200 210 220 230
220 230
pF1KB9 DPVFSFPSLPKDLLHTTPCFIPYH
. : ::: . : .. . : :
NP_005 AAAVSSASLPGSGLPSVGSIRPPHGLLKSPSAAAAAPLGGGGGGSGASGGFQHWGGMPCP
240 250 260 270 280 290
>>XP_005260965 (OMIM: 606386) PREDICTED: oligodendrocyte (323 aa)
initn: 212 init1: 122 opt: 244 Z-score: 208.3 bits: 46.3 E(85289): 7.3e-05
Smith-Waterman score: 244; 29.2% identity (53.0% similar) in 236 aa overlap (17-237:29-258)
10 20 30 40
pF1KB9 MPARLETCISDLDCASSSGSDLSGFL---TDEEDC-----ARLQQAAS
:::: ..: . :: :.:. : .
XP_005 MDSDASLVSSRPSSPEPDDLFLPARSKGSSGSAFTGGTVSSSTPSDCPPELSAELRGAMG
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB9 ASGPPAPARRGAPNISRASEVPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDR
..: . :. ... .: ... . . . .: :..:: .: :.:
XP_005 SAGAHPGDKLGGSGFKSSSSSTSSSTSSAAASSTKKDKKQMTEPELQQLR----LKINSR
70 80 90 100 110
110 120 130 140 150
pF1KB9 ERNRMHNLNAALDALRSVLPSF--PDDTKLTKIETLRFAYNYIWALAETL----RLADQG
::.:::.:: :.:.:: :.: :. ::.:: :: .: ::: :...: ::...
XP_005 ERKRMHDLNIAMDGLREVMPYAHGPSVRKLSKIATLLLARNYILMLTNSLEEMKRLVSEI
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB9 LPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGAAAASP-LSDPSSPAASEDFTYRPG
:: . . : : . : ::. :. ... :: : . : : :. . .
XP_005 Y--GGHHAGFHPSACGGLAHSAPLPAATAHPAAAAHAAHHPAVHHPILPPAAAAAAAAAA
180 190 200 210 220 230
220 230
pF1KB9 DPVFSFPSLPKDLLHTTPCFIPYH
. : ::: . : .. . : :
XP_005 AAAVSSASLPGSGLPSVGSIRPPHGLLKSPSAAAAAPLGGGGGGSGASGGFQHWGGMPCP
240 250 260 270 280 290
237 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:49:44 2016 done: Fri Nov 4 17:49:45 2016
Total Scan time: 7.020 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]