FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9630, 237 aa 1>>>pF1KB9630 237 - 237 aa - 237 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2516+/-0.000635; mu= 8.6255+/- 0.039 mean_var=154.4853+/-31.323, 0's: 0 Z-trim(117.4): 44 B-trim: 837 in 1/53 Lambda= 0.103188 statistics sampled from 18050 (18095) to 18050 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.84), E-opt: 0.2 (0.556), width: 16 Scan time: 2.810 The best scores are: opt bits E(32554) CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 ( 237) 1621 251.6 3.4e-67 CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 ( 214) 472 80.5 9.7e-16 CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 ( 272) 437 75.4 4.3e-14 >>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 (237 aa) initn: 1621 init1: 1621 opt: 1621 Z-score: 1319.9 bits: 251.6 E(32554): 3.4e-67 Smith-Waterman score: 1621; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237) 10 20 30 40 50 60 pF1KB9 MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH 190 200 210 220 230 >>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 (214 aa) initn: 453 init1: 412 opt: 472 Z-score: 396.1 bits: 80.5 E(32554): 9.7e-16 Smith-Waterman score: 489; 53.7% identity (70.9% similar) in 175 aa overlap (42-213:34-193) 20 30 40 50 60 70 pF1KB9 LDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPAR-RGAPNISRASEVPGAQDDEQE :.::.:.: :: : ..: : .. CCDS31 QPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRG--NCAEAEEGGCRGAPRKL 10 20 30 40 50 60 80 90 100 110 120 130 pF1KB9 RRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTK : :: ::.: .:: : . ::::: ::::::::::::::.::::::.:::.::::.:::: CCDS31 RARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK 70 80 90 100 110 120 140 150 160 170 180 190 pF1KB9 IETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGA :::::::.::::::..:::.::..: . : :: .: ::... .::: CCDS31 IETLRFAHNYIWALTQTLRIADHSLYA------LEPP--APHCGELGSPGGSPGDWGS-- 130 140 150 160 170 200 210 220 230 pF1KB9 AAASPLSDPSS--PAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH ::.:. .: :::: . ::: CCDS31 -LYSPVSQAGSLSPAAS--LEERPGLLGATFSACLSPGSLAFSDFL 180 190 200 210 >>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 (272 aa) initn: 519 init1: 373 opt: 437 Z-score: 366.5 bits: 75.4 E(32554): 4.3e-14 Smith-Waterman score: 437; 46.2% identity (67.2% similar) in 186 aa overlap (33-215:51-233) 10 20 30 40 50 60 pF1KB9 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV :: :..: :. : . . :: . : ... CCDS36 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL 30 40 50 60 70 70 80 90 100 110 pF1KB9 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL : : ..: : :. .: ... .. ....::.:::.::::::::::::::::: :: CCDS36 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL 80 90 100 110 120 130 120 130 140 150 160 170 pF1KB9 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP :.::.:.:::::::::::.::::::.:::::::. :::. : . : :: : CCDS36 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA 140 150 160 170 180 190 180 190 200 210 220 230 pF1KB9 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH : : :.. . :: : .::: : . . .: CCDS36 A--LSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD 200 210 220 230 240 250 CCDS36 KHRYAPHLPIARDCI 260 270 237 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:49:43 2016 done: Fri Nov 4 17:49:44 2016 Total Scan time: 2.810 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]