FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9631, 272 aa 1>>>pF1KB9631 272 - 272 aa - 272 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6985+/-0.00026; mu= 6.7252+/- 0.016 mean_var=173.2169+/-34.990, 0's: 0 Z-trim(124.9): 61 B-trim: 0 in 0/58 Lambda= 0.097449 statistics sampled from 47357 (47421) to 47357 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.836), E-opt: 0.2 (0.556), width: 16 Scan time: 9.260 The best scores are: opt bits E(85289) NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 1811 265.4 7.8e-71 NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapien ( 237) 437 72.2 9.9e-13 NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo ( 214) 414 69.0 8.6e-12 XP_016871769 (OMIM: 604882,610370) PREDICTED: neur ( 214) 414 69.0 8.6e-12 NP_002491 (OMIM: 125853,601724,606394) neurogenic ( 356) 297 52.7 1.1e-06 NP_006151 (OMIM: 601725) neurogenic differentiatio ( 382) 295 52.4 1.5e-06 NP_073565 (OMIM: 611513) neurogenic differentiatio ( 337) 268 48.6 1.8e-05 NP_067014 (OMIM: 611635) neurogenic differentiatio ( 331) 267 48.4 2e-05 NP_005163 (OMIM: 601461) protein atonal homolog 1 ( 354) 262 47.8 3.4e-05 NP_835455 (OMIM: 607194,609069,615935) pancreas tr ( 328) 236 44.1 0.00041 NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 226 42.5 0.00071 NP_005797 (OMIM: 606386) oligodendrocyte transcrip ( 323) 228 43.0 0.00088 XP_005260965 (OMIM: 606386) PREDICTED: oligodendro ( 323) 228 43.0 0.00088 NP_786923 (OMIM: 609323) oligodendrocyte transcrip ( 272) 224 42.3 0.0011 NP_001073983 (OMIM: 609067) basic helix-loop-helix ( 201) 219 41.5 0.0015 XP_006716679 (OMIM: 609067) PREDICTED: basic helix ( 201) 219 41.5 0.0015 NP_689627 (OMIM: 613483) class E basic helix-loop- ( 381) 211 40.6 0.0052 NP_660161 (OMIM: 609875) protein atonal homolog 7 ( 152) 203 39.2 0.0057 NP_005161 (OMIM: 601886) achaete-scute homolog 2 [ ( 193) 204 39.4 0.0062 >>NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapiens] (272 aa) initn: 1811 init1: 1811 opt: 1811 Z-score: 1392.7 bits: 265.4 E(85289): 7.8e-71 Smith-Waterman score: 1811; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272) 10 20 30 40 50 60 pF1KB9 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_076 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_076 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_076 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_076 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP 190 200 210 220 230 240 250 260 270 pF1KB9 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI :::::::::::::::::::::::::::::::: NP_076 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI 250 260 270 >>NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapiens] (237 aa) initn: 476 init1: 373 opt: 437 Z-score: 349.5 bits: 72.2 E(85289): 9.9e-13 Smith-Waterman score: 437; 46.2% identity (67.7% similar) in 186 aa overlap (51-233:33-215) 30 40 50 60 70 pF1KB9 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL :: :..: :. : . . :: . : ... NP_006 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV 10 20 30 40 50 60 80 90 100 110 120 130 pF1KB9 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL : : ..: : :. .: ... .. ....::.:::.::::::::::::::::: :: NP_006 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL 70 80 90 100 110 140 150 160 170 180 190 pF1KB9 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA :.::.:.:::::::::::.::::::.:::::::. :::. : . : :: : NP_006 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP 120 130 140 150 160 170 200 210 220 230 240 250 pF1KB9 ALS--SSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD : . : :.. . :: : .::: : . . .: NP_006 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH 180 190 200 210 220 230 260 270 pF1KB9 KHRYAPHLPIARDCI >>NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo sap (214 aa) initn: 423 init1: 372 opt: 414 Z-score: 332.6 bits: 69.0 E(85289): 8.6e-12 Smith-Waterman score: 427; 48.4% identity (64.7% similar) in 190 aa overlap (64-247:43-212) 40 50 60 70 80 90 pF1KB9 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP ::. : . :: :: : .: . :: NP_066 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG 20 30 40 50 60 100 110 120 130 140 150 pF1KB9 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK .:.: :. .... ...:: :::.::::::::::.:::::: ::::::.:::::: NP_066 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK 70 80 90 100 110 120 160 170 180 190 200 210 pF1KB9 IETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASAALSSSGDSPSP :::::::::::::::.:::.::: . :. .: : :::: : :: : NP_066 IETLRFAHNYIWALTQTLRIADHSLYALEP-PAPHCGE--LGSPGG------SPGDWGSL 130 140 150 160 170 220 230 240 250 260 pF1KB9 ASTWSCTNSPAPSSSVSSNST---SPYSCTLSPASPAGSDMDYWQPPPPDKHRYAPHLPI : : ..: .:..:. . .: :::.: : :: NP_066 YSPVSQAGSLSPAASLEERPGLLGATFSACLSPGSLAFSDFL 180 190 200 210 270 pF1KB9 ARDCI >>XP_016871769 (OMIM: 604882,610370) PREDICTED: neurogen (214 aa) initn: 423 init1: 372 opt: 414 Z-score: 332.6 bits: 69.0 E(85289): 8.6e-12 Smith-Waterman score: 427; 48.4% identity (64.7% similar) in 190 aa overlap (64-247:43-212) 40 50 60 70 80 90 pF1KB9 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP ::. : . :: :: : .: . :: XP_016 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG 20 30 40 50 60 100 110 120 130 140 150 pF1KB9 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK .:.: :. .... ...:: :::.::::::::::.:::::: ::::::.:::::: XP_016 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK 70 80 90 100 110 120 160 170 180 190 200 210 pF1KB9 IETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASAALSSSGDSPSP :::::::::::::::.:::.::: . :. .: : :::: : :: : XP_016 IETLRFAHNYIWALTQTLRIADHSLYALEP-PAPHCGE--LGSPGG------SPGDWGSL 130 140 150 160 170 220 230 240 250 260 pF1KB9 ASTWSCTNSPAPSSSVSSNST---SPYSCTLSPASPAGSDMDYWQPPPPDKHRYAPHLPI : : ..: .:..:. . .: :::.: : :: XP_016 YSPVSQAGSLSPAASLEERPGLLGATFSACLSPGSLAFSDFL 180 190 200 210 270 pF1KB9 ARDCI >>NP_002491 (OMIM: 125853,601724,606394) neurogenic diff (356 aa) initn: 326 init1: 271 opt: 297 Z-score: 240.8 bits: 52.7 E(85289): 1.1e-06 Smith-Waterman score: 297; 38.0% identity (57.8% similar) in 192 aa overlap (85-265:76-253) 60 70 80 90 100 110 pF1KB9 RGAEAGQGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRR : ..:.: :. .. : ..:.: :: NP_002 MNAEEDSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKR-RGPKKKKMTKARLERFK-LRR 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB9 LKANNRERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHC .::: :::::::.:::::: ::.:.: . . ::.::::::.:.::::::.: :: NP_002 MKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILR----- 110 120 130 140 150 180 190 200 210 220 pF1KB9 GGGGGGLPGALFSEAVLLSPGGA--SAALSSSGDSPSPASTWSCTNSPAPS---SSVSSN .: : : : . : : . .. : .. . .: . :. : .. .: NP_002 ---SGKSPD-LVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQNQDMPPHLPTASASF 160 170 180 190 200 210 230 240 250 260 270 pF1KB9 STSPYSCTLSPA--SPAGSDMD----YWQPPPPDKHRYAPHLPIARDCI . ::: ::. :: . :: . ::: : :. : NP_002 PVHPYS-YQSPGLPSPPYGTMDSSHVFHVKPPP--HAYSAALEPFFESPLTDCTSPSFDG 220 230 240 250 260 270 NP_002 PLSPPLSINGNFSFKHEPSAEFEKNYAFTMHYPAATLAGAQSHGSIFSGTAAPRCEIPID 280 290 300 310 320 330 >>NP_006151 (OMIM: 601725) neurogenic differentiation fa (382 aa) initn: 351 init1: 268 opt: 295 Z-score: 238.8 bits: 52.4 E(85289): 1.5e-06 Smith-Waterman score: 295; 44.4% identity (64.4% similar) in 135 aa overlap (45-169:46-178) 20 30 40 50 60 pF1KB9 DVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQ-----RGAEAGQGARG---- ::: : :: :: :. ... . NP_006 PKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPLRGEEGTEATLAEVKE 20 30 40 50 60 70 70 80 90 100 110 120 pF1KB9 -GVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNR : .: : . . :: . .::.. :. .. : ..: .: :: ::: ::::: NP_006 EGELGGEEEEEEEEEEGLDEAEGERPKK-RGPKKRKMTKARLER-SKLRRQKANARERNR 80 90 100 110 120 130 130 140 150 160 170 180 pF1KB9 MHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGA ::.:::::: ::.:.: . . ::.::::::.:.::::::.: :: NP_006 MHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSGKRPDLVSYVQTLC 140 150 160 170 180 190 190 200 210 220 230 240 pF1KB9 LFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPA NP_006 KGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQA 200 210 220 230 240 250 >>NP_073565 (OMIM: 611513) neurogenic differentiation fa (337 aa) initn: 302 init1: 255 opt: 268 Z-score: 219.1 bits: 48.6 E(85289): 1.8e-05 Smith-Waterman score: 290; 34.6% identity (57.1% similar) in 191 aa overlap (92-256:75-264) 70 80 90 100 110 120 pF1KB9 GARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRE : :.. . : ..:.: :: .:: :: NP_073 GKSIKRAPGEETEKEEEEEDREEEDENGLPRRRGLRKKKTTKLRLERVK-FRRQEANARE 50 60 70 80 90 100 130 140 150 160 170 pF1KB9 RNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADH-------- :::::.:: ::: ::.:.: . . ::.::::::.:.::::::.: ::.. . NP_073 RNRMHGLNDALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRIGKRPDLLTFVQ 110 120 130 140 150 160 180 190 200 210 220 pF1KB9 --CGGGGGGLPG------ALFSEAVLLSPGGASAALSSSGDSP--SPASTWSCTNSPAPS : : . . : ... :.. :: .: . : : : . :. :. . NP_073 NLCKGLSQPTTNLVAGCLQLNARSFLMGQGGEAAHHTRSPYSTFYPPYHSPELTTPPGHG 170 180 190 200 210 220 230 240 250 260 270 pF1KB9 SSVSSNSTSPYS-CT-----LSPASPAGSDMDYWQP--PPPDKHRYAPHLPIARDCI . .:.: .::. :. .:: .. .. : ::: NP_073 TLDNSKSMKPYNYCSAYESFYESTSPECASPQFEGPLSPPPINYNGIFSLKQEETLDYGK 230 240 250 260 270 280 NP_073 NYNYGMHYCAVPPRGPLGQGAMFRLPTDSHFPYDLHLRSQSLTMQDELNAVFHN 290 300 310 320 330 >>NP_067014 (OMIM: 611635) neurogenic differentiation fa (331 aa) initn: 331 init1: 245 opt: 267 Z-score: 218.4 bits: 48.4 E(85289): 2e-05 Smith-Waterman score: 267; 33.3% identity (59.4% similar) in 192 aa overlap (89-271:66-244) 60 70 80 90 100 110 pF1KB9 AGQGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKAN .:.: :. .. : ..:.. .::.::: NP_067 SRPGTYGMLSSLTEEHDSIEEEEEEEEDGEKPKR-RGPKKKKMTKARLERFR-ARRVKAN 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB9 NRERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGG :::.:::.:: ::: ::.:.: . . ::.::::::.:.::::::.:.:. .. : : NP_067 ARERTRMHGLNDALDNLRRVMPCYSKTQKLSKIETLRLARNYIWALSEVLETGQTPEGKG 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB9 GGLPGALFSEAVLLSPGGASAALSSSGDSPSPASTW---SCTNSPAPSSSVSSNSTSPYS : : . . . .. : .. . .: :. .:: .:..: .. . : NP_067 -------FVEMLCKGLSQPTSNLVAGCLQLGPQSVLLEKHEDKSPICDSAISVHNFNYQS 160 170 180 190 200 240 250 260 270 pF1KB9 CTLSPASPAG---SDMDYWQPP---PPDKHRYAPHLPIARDCI : :. : : . . . .: . .. ::: :: NP_067 PGL-PSPPYGHMETHLLHLKPQVFKSLGESSFGSHLP---DCSTPPYEGPLTPPLSISGN 210 220 230 240 250 260 NP_067 FSLKQDGSPDLEKSYSFMPHYPSSSLSSGHVHSTPFQAGTPRYDVPIDMSYDSYPHHGIG 270 280 290 300 310 320 >>NP_005163 (OMIM: 601461) protein atonal homolog 1 [Hom (354 aa) initn: 262 init1: 209 opt: 262 Z-score: 214.2 bits: 47.8 E(85289): 3.4e-05 Smith-Waterman score: 276; 36.0% identity (56.5% similar) in 214 aa overlap (24-225:84-282) 10 20 30 40 50 pF1KB9 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARR :: :.: : : ..: : . .:: NP_005 SLLDSTDPRAWLAPTLQGICTARAAQYLLHSPELGA-----SEAAAPRDEVDGRGELVRR 60 70 80 90 100 60 70 80 90 100 110 pF1KB9 QRGAEAGQGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTR . :. ... . : : . . :. :.: : . :: :: : .. :. ..: : NP_005 SSGGASSSKSPGPVKVREQLCKLKG--GVVVD-ELGCSRQRAPS-----SKQVNGVQKQR 110 120 130 140 150 160 120 130 140 150 160 170 pF1KB9 RLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADH :: :: ::: :::.:: :.: ::.:.:.: .: ::.: :::..:. :: ::.: :. . NP_005 RLAANARERRRMHGLNHAFDQLRNVIPSFNNDKKLSKYETLQMAQIYINALSELLQTPS- 170 180 190 200 210 180 190 200 210 220 pF1KB9 CGGGGGGLPGA--------LFSEAVLLSPGG---ASAALSSSGDSPSPASTWSC-TNSPA :: : : : . : . .: :..: ..:: : :. :: : : NP_005 -GGEQPPPPPASCKSDHHHLRTAASYEGGAGNATAAGAQQASGGSQRPTPPGSCRTRFSA 220 230 240 250 260 270 230 240 250 260 270 pF1KB9 PSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI :.:. NP_005 PASAGGYSVQLDALHFSTFEDSALTAMMAQKNLSPSLPGSILQPVQEENSKTSPRSHRSD 280 290 300 310 320 330 >>NP_835455 (OMIM: 607194,609069,615935) pancreas transc (328 aa) initn: 262 init1: 171 opt: 236 Z-score: 194.9 bits: 44.1 E(85289): 0.00041 Smith-Waterman score: 244; 30.8% identity (52.3% similar) in 266 aa overlap (2-254:52-275) 10 20 30 pF1KB9 MFVKSETLELKEEEDVLVLLGSASPALA-AL :.. . : .. . .:: : :: :: NP_835 DEDDFFTDQSSRDPLEDGDELLADEQAEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLAL 30 40 50 60 70 80 40 50 60 70 80 pF1KB9 TPLSSSADEEEEEEPGASGGARRQRGAEAG-----QGARGGV----AAGAEGCRP-ARLL .: ::.. : .. :..:: . :: : :. . ::: : ::: NP_835 APPSSGGLGEPDD--GGGGGYCCETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLR 90 100 110 120 130 90 100 110 120 130 140 pF1KB9 GLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLP :: . : :..: . ... .... :. :: ::: ::...: :...:: .: NP_835 GL-------SGAAAAAARRRRRVRSEAELQQLRQ-AANVRERRRMQSINDAFEGLRSHIP 140 150 160 170 180 190 150 160 170 180 190 pF1KB9 TFPEDAKLTKIETLRFAHNYIWALTETLRLADHC--GGGGGGLPGALFSEAVLLSPGGAS :.: . .:.:..:::.: .:: :.: .. :: :::.:: : :::.. NP_835 TLPYEKRLSKVDTLRLAIGYINFLSELVQ-ADLPLRGGGAGGCGG----------PGGGG 200 210 220 230 240 200 210 220 230 240 250 pF1KB9 AALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPDK .::::. :.. . : . ::. :: :: :: NP_835 ---RLGGDSPG------------------SQAQKVIICHRGTRSPSPSDPDYGLPPLAGH 250 260 270 260 270 pF1KB9 HRYAPHLPIARDCI NP_835 SLSWTDEKQLKEQNIIRTAKVWTPEDPRKLNSKSSFNNIENEPPFEFVS 280 290 300 310 320 272 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:50:59 2016 done: Fri Nov 4 17:51:01 2016 Total Scan time: 9.260 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]