FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9627, 382 aa 1>>>pF1KB9627 382 - 382 aa - 382 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.2658+/-0.000893; mu= 3.3527+/- 0.055 mean_var=293.7861+/-58.818, 0's: 0 Z-trim(117.5): 27 B-trim: 663 in 1/54 Lambda= 0.074827 statistics sampled from 18256 (18282) to 18256 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.833), E-opt: 0.2 (0.562), width: 16 Scan time: 2.810 The best scores are: opt bits E(32554) CCDS11338.1 NEUROD2 gene_id:4761|Hs108|chr17 ( 382) 2634 296.9 2.1e-80 CCDS2283.1 NEUROD1 gene_id:4760|Hs108|chr2 ( 356) 1087 129.8 3.7e-30 CCDS5434.1 NEUROD6 gene_id:63974|Hs108|chr7 ( 337) 788 97.5 1.8e-20 CCDS8886.1 NEUROD4 gene_id:58158|Hs108|chr12 ( 331) 653 82.9 4.4e-16 >>CCDS11338.1 NEUROD2 gene_id:4761|Hs108|chr17 (382 aa) initn: 2634 init1: 2634 opt: 2634 Z-score: 1557.1 bits: 296.9 E(32554): 2.1e-80 Smith-Waterman score: 2634; 100.0% identity (100.0% similar) in 382 aa overlap (1-382:1-382) 10 20 30 40 50 60 pF1KB9 MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 RGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 LRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 KRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 YPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 PPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLS 310 320 330 340 350 360 370 380 pF1KB9 YDMHLHHDRGPMYEELNAFFHN :::::::::::::::::::::: CCDS11 YDMHLHHDRGPMYEELNAFFHN 370 380 >>CCDS2283.1 NEUROD1 gene_id:4760|Hs108|chr2 (356 aa) initn: 1073 init1: 809 opt: 1087 Z-score: 655.0 bits: 129.8 E(32554): 3.7e-30 Smith-Waterman score: 1087; 51.2% identity (69.5% similar) in 361 aa overlap (31-382:6-356) 10 20 30 40 50 60 pF1KB9 MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPL :..: : : .: :. ... CCDS22 MTKSYSESGLMGEPQPQGP-PSWTDECLSSQDEEH 10 20 30 70 80 90 100 110 pF1KB9 RGEEGTEATLAEVKEEGEL--GGEEEEE----EEEEEGLDEAEGERPKKRGPKKRKMTKA .... . . :: : :::::.: ::::: .: . ..::.:::::.::::: CCDS22 EADKKEDDLETMNAEEDSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKRRGPKKKKMTKA 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB9 RLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALS :::: ::::.::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS22 RLERFKLRRMKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALS 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB9 EILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPF ::::::: :::::.::::::::::::::::::::::: :.:: ::. : .. ... : CCDS22 EILRSGKSPDLVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQNQDMPPHLPTASASF 160 170 180 190 200 210 240 250 260 270 280 290 pF1KB9 AMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRT--HGYCAAYETLYAAAGGGGASPDYNS .::: : : : . .. . .. :.: :: : .. . .::. CCDS22 PVHPYSYQSP---GLPSPPYGTMDSSHVFHVKPPPHAYSAALEPFFESPLTDCTSPS--- 220 230 240 250 260 300 310 320 330 340 350 pF1KB9 SEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVF-GSSAVRG ..::::::: .:::::.:.. : . ::.: ..::: : . .:: .: :..: : CCDS22 --FDGPLSPPLSINGNFSFKHEPSAEFEKNYAFTMHYPAATLAGAQSHGSIFSGTAAPRC 270 280 290 300 310 320 360 370 380 pF1KB9 GVHSENLLSYDMHLHHDRGPMYEELNAFFHN . .:..:.: : ::.: : .:::.::. CCDS22 EIPIDNIMSFDSHSHHER-VMSAQLNAIFHD 330 340 350 >>CCDS5434.1 NEUROD6 gene_id:63974|Hs108|chr7 (337 aa) initn: 1037 init1: 696 opt: 788 Z-score: 480.8 bits: 97.5 E(32554): 1.8e-20 Smith-Waterman score: 1014; 52.0% identity (72.4% similar) in 333 aa overlap (51-382:34-337) 30 40 50 60 70 80 pF1KB9 WGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELG : .: . :::. .: :...: : CCDS54 LPFDESVVMPESQMCRKFSRECEDQKQIKKPESFSKQIVLRGKSIKRAPGEETEKEEE-- 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 GEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAA ::..:::.:.:: :..:: .:.: :: :::: :.:::.:::::::::: :: : CCDS54 -EEDREEEDENGL-------PRRRGLRKKKTTKLRLERVKFRRQEANARERNRMHGLNDA 70 80 90 100 110 150 160 170 180 190 200 pF1KB9 LDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPT :::::::::::::::::::::::::::::::::::::: ::::::...::.::::::::: CCDS54 LDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRIGKRPDLLTFVQNLCKGLSQPT 120 130 140 150 160 170 210 220 230 240 250 260 pF1KB9 TNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGG :::::::::::.:.:: ::...: : . .:.. :: .:. . : : . CCDS54 TNLVAGCLQLNARSFLMGQGGEAA---HHTRSPYSTFYPPYHSPELTTP--PGHGTLDN- 180 190 200 210 220 270 280 290 300 310 pF1KB9 AAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLSPP-LCLNGNFSLKQDSSPDH ..... ..::.:::..: .. ::. : ..::::::: . :: :::::. . :. CCDS54 -SKSMKPYNYCSAYESFYEST-----SPECASPQFEGPLSPPPINYNGIFSLKQEETLDY 230 240 250 260 270 280 320 330 340 350 360 370 pF1KB9 EKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAF :.:.:.::: :.: : :.: .: : . ... . ::.::. . : .:::: CCDS54 GKNYNYGMHYCAVPPRGPLGQGAMF-----R--LPTDSHFPYDLHLRSQSLTMQDELNAV 290 300 310 320 330 380 pF1KB9 FHN ::: CCDS54 FHN >>CCDS8886.1 NEUROD4 gene_id:58158|Hs108|chr12 (331 aa) initn: 924 init1: 642 opt: 653 Z-score: 402.2 bits: 82.9 E(32554): 4.4e-16 Smith-Waterman score: 849; 48.4% identity (67.4% similar) in 316 aa overlap (65-380:39-329) 40 50 60 70 80 90 pF1KB9 DAPPPPPPAPGPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLD :: . :. . :: . . ::::::::.: CCDS88 KEMGELVNTPSWMDKGLGSQNEVKEEESRPGTYGMLSSLTEEHD--SIEEEEEEEEDG-- 10 20 30 40 50 60 100 110 120 130 140 150 pF1KB9 EAEGERPKKRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKT :.::.:::::.::::::::: . :: :::::::.::: :: ::::::.:.:::::: CCDS88 ----EKPKRRGPKKKKMTKARLERFRARRVKANARERTRMHGLNDALDNLRRVMPCYSKT 70 80 90 100 110 120 160 170 180 190 200 210 pF1KB9 QKLSKIETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRN ::::::::::::.::::::::.:..:. :. ..:. :::::::::.:::::::::. .. CCDS88 QKLSKIETLRLARNYIWALSEVLETGQTPEGKGFVEMLCKGLSQPTSNLVAGCLQLGPQS 130 140 150 160 170 180 220 230 240 250 260 270 pF1KB9 FLTEQGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAY : :. : . . ...: . : : . : .: : : .. CCDS88 VLLEKHED---KSPICDSAISVHNFNYQSPGLPSPP------YGHMETHLL--HLKPQVF 190 200 210 220 280 290 300 310 320 330 pF1KB9 ETLYAAAGGGGASPDYNSSEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPG ..: . .. :. :: .. :::::.::: ..::::::::.::: :::: . :: . CCDS88 KSL-GESSFGSHLPDCSTPPYEGPLTPPLSISGNFSLKQDGSPDLEKSYSFMPHYPSSSL 230 240 250 260 270 280 340 350 360 370 380 pF1KB9 SRPTGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAFFHN : :. : ... : : . .::: . :: : .::. : CCDS88 SSGHVHSTPFQAGTPRYDVPID--MSYDSYPHHGIGT---QLNTVFTE 290 300 310 320 330 382 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:48:04 2016 done: Fri Nov 4 17:48:05 2016 Total Scan time: 2.810 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]