FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9632, 214 aa 1>>>pF1KB9632 214 - 214 aa - 214 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0454+/-0.000632; mu= 13.5465+/- 0.039 mean_var=123.3933+/-24.393, 0's: 0 Z-trim(116.2): 47 B-trim: 0 in 0/52 Lambda= 0.115459 statistics sampled from 16699 (16747) to 16699 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.514), width: 16 Scan time: 2.670 The best scores are: opt bits E(32554) CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 ( 214) 1442 249.8 9.4e-67 CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 ( 237) 474 88.6 3.5e-18 CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 ( 272) 414 78.7 3.9e-15 >>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 (214 aa) initn: 1442 init1: 1442 opt: 1442 Z-score: 1311.9 bits: 249.8 E(32554): 9.4e-67 Smith-Waterman score: 1442; 99.5% identity (99.5% similar) in 214 aa overlap (1-214:1-214) 10 20 30 40 50 60 pF1KB9 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LRARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 LRARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 KIETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGSLYSPVSQAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 KIETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGSLYSPVSQAG 130 140 150 160 170 180 190 200 210 pF1KB9 SLSPAASLEERPGLLGATSSACLSPGSLAFSDFL :::::::::::::::::: ::::::::::::::: CCDS31 SLSPAASLEERPGLLGATFSACLSPGSLAFSDFL 190 200 210 >>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 (237 aa) initn: 442 init1: 412 opt: 474 Z-score: 439.9 bits: 88.6 E(32554): 3.5e-18 Smith-Waterman score: 489; 53.7% identity (70.9% similar) in 175 aa overlap (34-193:42-213) 10 20 30 40 50 60 pF1KB9 QPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRG--NCAEAEEGGCRGAPRKL :.::.:.: :: : ..: : .. CCDS41 LDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPAR-RGAPNISRASEVPGAQDDEQE 20 30 40 50 60 70 70 80 90 100 110 120 pF1KB9 RARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK : :: ::.: .:: : . ::::: ::::::::::::::.::::::.:::.::::.:::: CCDS41 RRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTK 80 90 100 110 120 130 130 140 150 160 170 pF1KB9 IETLRFAHNYIWALTQTLRIADHSLYA------LEPP--APHCGELGSPGGSPGDWGS-- :::::::.::::::..:::.::..: . : :: .: ::... .::: CCDS41 IETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGA 140 150 160 170 180 190 180 190 200 210 pF1KB9 -LYSPVSQAGSLSPAAS--LEERPGLLGATSSACLSPGSLAFSDFL ::.:. .: :::: . ::: CCDS41 AAASPLSDPSS--PAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH 200 210 220 230 >>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 (272 aa) initn: 419 init1: 372 opt: 414 Z-score: 385.2 bits: 78.7 E(32554): 3.9e-15 Smith-Waterman score: 428; 48.4% identity (64.6% similar) in 192 aa overlap (43-211:64-239) 20 30 40 50 60 pF1KB9 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG ::. : . :: :: : .: . :: CCDS36 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP 40 50 60 70 80 90 70 80 90 100 110 120 pF1KB9 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK .:.: :. .... ...:: :::.::::::::::.:::::: ::::::.:::::: CCDS36 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK 100 110 120 130 140 150 130 140 150 160 170 pF1KB9 IETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGS---LYSP--- :::::::::::::::.:::.::: :: :. :: :: : : :: CCDS36 IETLRFAHNYIWALTETLRLADH-----------CG--GGGGGLPGALFSEAVLLSPGGA 160 170 180 190 180 190 200 210 pF1KB9 ---VSQAG-SLSPAA--SLEERPGLLGATSSACLSPGSLAFSDFL .:..: : :::. : . :. ...:: :: : ..: CCDS36 SAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD 200 210 220 230 240 250 CCDS36 KHRYAPHLPIARDCI 260 270 214 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:51:35 2016 done: Fri Nov 4 17:51:36 2016 Total Scan time: 2.670 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]