FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9631, 272 aa 1>>>pF1KB9631 272 - 272 aa - 272 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4093+/-0.000656; mu= 8.5281+/- 0.040 mean_var=162.3655+/-32.518, 0's: 0 Z-trim(117.2): 43 B-trim: 0 in 0/54 Lambda= 0.100653 statistics sampled from 17888 (17932) to 17888 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.551), width: 16 Scan time: 3.120 The best scores are: opt bits E(32554) CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 ( 272) 1811 273.6 1.1e-73 CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 ( 237) 437 74.0 1.1e-13 CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 ( 214) 414 70.6 1e-12 >>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 (272 aa) initn: 1811 init1: 1811 opt: 1811 Z-score: 1436.6 bits: 273.6 E(32554): 1.1e-73 Smith-Waterman score: 1811; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272) 10 20 30 40 50 60 pF1KB9 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP 190 200 210 220 230 240 250 260 270 pF1KB9 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI :::::::::::::::::::::::::::::::: CCDS36 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI 250 260 270 >>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 (237 aa) initn: 476 init1: 373 opt: 437 Z-score: 359.1 bits: 74.0 E(32554): 1.1e-13 Smith-Waterman score: 437; 46.2% identity (67.7% similar) in 186 aa overlap (51-233:33-215) 30 40 50 60 70 pF1KB9 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL :: :..: :. : . . :: . : ... CCDS41 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV 10 20 30 40 50 60 80 90 100 110 120 130 pF1KB9 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL : : ..: : :. .: ... .. ....::.:::.::::::::::::::::: :: CCDS41 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL 70 80 90 100 110 140 150 160 170 180 190 pF1KB9 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA :.::.:.:::::::::::.::::::.:::::::. :::. : . : :: : CCDS41 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP 120 130 140 150 160 170 200 210 220 230 240 250 pF1KB9 ALS--SSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD : . : :.. . :: : .::: : . . .: CCDS41 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH 180 190 200 210 220 230 260 270 pF1KB9 KHRYAPHLPIARDCI >>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 (214 aa) initn: 423 init1: 372 opt: 414 Z-score: 341.7 bits: 70.6 E(32554): 1e-12 Smith-Waterman score: 427; 48.4% identity (64.7% similar) in 190 aa overlap (64-247:43-212) 40 50 60 70 80 90 pF1KB9 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP ::. : . :: :: : .: . :: CCDS31 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG 20 30 40 50 60 100 110 120 130 140 150 pF1KB9 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK .:.: :. .... ...:: :::.::::::::::.:::::: ::::::.:::::: CCDS31 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK 70 80 90 100 110 120 160 170 180 190 200 210 pF1KB9 IETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASAALSSSGDSPSP :::::::::::::::.:::.::: . :. .: : :::: : :: : CCDS31 IETLRFAHNYIWALTQTLRIADHSLYALEP-PAPHCGE--LGSPGG------SPGDWGSL 130 140 150 160 170 220 230 240 250 260 pF1KB9 ASTWSCTNSPAPSSSVSSNST---SPYSCTLSPASPAGSDMDYWQPPPPDKHRYAPHLPI : : ..: .:..:. . .: :::.: : :: CCDS31 YSPVSQAGSLSPAASLEERPGLLGATFSACLSPGSLAFSDFL 180 190 200 210 270 pF1KB9 ARDCI 272 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:50:59 2016 done: Fri Nov 4 17:50:59 2016 Total Scan time: 3.120 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]