FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9630, 237 aa
1>>>pF1KB9630 237 - 237 aa - 237 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.2516+/-0.000635; mu= 8.6255+/- 0.039
mean_var=154.4853+/-31.323, 0's: 0 Z-trim(117.4): 44 B-trim: 837 in 1/53
Lambda= 0.103188
statistics sampled from 18050 (18095) to 18050 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.84), E-opt: 0.2 (0.556), width: 16
Scan time: 2.810
The best scores are: opt bits E(32554)
CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 ( 237) 1621 251.6 3.4e-67
CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 ( 214) 472 80.5 9.7e-16
CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 ( 272) 437 75.4 4.3e-14
>>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5 (237 aa)
initn: 1621 init1: 1621 opt: 1621 Z-score: 1319.9 bits: 251.6 E(32554): 3.4e-67
Smith-Waterman score: 1621; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237)
10 20 30 40 50 60
pF1KB9 MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA
130 140 150 160 170 180
190 200 210 220 230
pF1KB9 SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
190 200 210 220 230
>>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10 (214 aa)
initn: 453 init1: 412 opt: 472 Z-score: 396.1 bits: 80.5 E(32554): 9.7e-16
Smith-Waterman score: 489; 53.7% identity (70.9% similar) in 175 aa overlap (42-213:34-193)
20 30 40 50 60 70
pF1KB9 LDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPAR-RGAPNISRASEVPGAQDDEQE
:.::.:.: :: : ..: : ..
CCDS31 QPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRG--NCAEAEEGGCRGAPRKL
10 20 30 40 50 60
80 90 100 110 120 130
pF1KB9 RRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLPSFPDDTKLTK
: :: ::.: .:: : . ::::: ::::::::::::::.::::::.:::.::::.::::
CCDS31 RARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
70 80 90 100 110 120
140 150 160 170 180 190
pF1KB9 IETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPASDAESWGSGA
:::::::.::::::..:::.::..: . : :: .: ::... .:::
CCDS31 IETLRFAHNYIWALTQTLRIADHSLYA------LEPP--APHCGELGSPGGSPGDWGS--
130 140 150 160 170
200 210 220 230
pF1KB9 AAASPLSDPSS--PAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
::.:. .: :::: . :::
CCDS31 -LYSPVSQAGSLSPAAS--LEERPGLLGATFSACLSPGSLAFSDFL
180 190 200 210
>>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4 (272 aa)
initn: 519 init1: 373 opt: 437 Z-score: 366.5 bits: 75.4 E(32554): 4.3e-14
Smith-Waterman score: 437; 46.2% identity (67.2% similar) in 186 aa overlap (33-215:51-233)
10 20 30 40 50 60
pF1KB9 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV
:: :..: :. : . . :: . : ...
CCDS36 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL
30 40 50 60 70
70 80 90 100 110
pF1KB9 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL
: : ..: : :. .: ... .. ....::.:::.::::::::::::::::: ::
CCDS36 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL
80 90 100 110 120 130
120 130 140 150 160 170
pF1KB9 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP
:.::.:.:::::::::::.::::::.:::::::. :::. : . : :: :
CCDS36 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA
140 150 160 170 180 190
180 190 200 210 220 230
pF1KB9 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH
: : :.. . :: : .::: : . . .:
CCDS36 A--LSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD
200 210 220 230 240 250
CCDS36 KHRYAPHLPIARDCI
260 270
237 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:49:43 2016 done: Fri Nov 4 17:49:44 2016
Total Scan time: 2.810 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]