FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0993, 278 aa
1>>>pF1KB0993 278 - 278 aa - 278 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2746+/-0.000663; mu= 17.2221+/- 0.040
mean_var=86.7300+/-16.696, 0's: 0 Z-trim(112.7): 7 B-trim: 0 in 0/52
Lambda= 0.137718
statistics sampled from 13426 (13430) to 13426 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.766), E-opt: 0.2 (0.413), width: 16
Scan time: 2.610
The best scores are: opt bits E(32554)
CCDS4875.1 CNPY3 gene_id:10695|Hs108|chr6 ( 278) 1836 373.8 7.4e-104
CCDS34701.1 CNPY4 gene_id:245812|Hs108|chr7 ( 248) 665 141.1 7.5e-34
>>CCDS4875.1 CNPY3 gene_id:10695|Hs108|chr6 (278 aa)
initn: 1836 init1: 1836 opt: 1836 Z-score: 1978.0 bits: 373.8 E(32554): 7.4e-104
Smith-Waterman score: 1836; 100.0% identity (100.0% similar) in 278 aa overlap (1-278:1-278)
10 20 30 40 50 60
pF1KB0 MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 SAFEETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTGSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 SAFEETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTGSN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RFAKGMSETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIEDW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 RFAKGMSETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIEDW
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB0 YRNHQEEDLTEFLCANHVLKGKDTSCLAEQWSGKKGDTAALGGKKSKKKSSRAKAAGGRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 YRNHQEEDLTEFLCANHVLKGKDTSCLAEQWSGKKGDTAALGGKKSKKKSSRAKAAGGRS
190 200 210 220 230 240
250 260 270
pF1KB0 SSSKQRKELGGLEGDPSPEEDEGIQKASPLTHSPPDEL
::::::::::::::::::::::::::::::::::::::
CCDS48 SSSKQRKELGGLEGDPSPEEDEGIQKASPLTHSPPDEL
250 260 270
>>CCDS34701.1 CNPY4 gene_id:245812|Hs108|chr7 (248 aa)
initn: 639 init1: 489 opt: 665 Z-score: 721.2 bits: 141.1 E(32554): 7.5e-34
Smith-Waterman score: 665; 44.9% identity (74.4% similar) in 227 aa overlap (17-238:6-231)
10 20 30 40 50 60
pF1KB0 MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELK
: .::.:.: . : .. :..: :::::::::: ...::.
CCDS34 MGPVRLGILLFLFLAVHEAWAGMLKEEDDDTERLPSKCEVCKLLSTELQ
10 20 30 40
70 80 90 100 110
pF1KB0 SAFEETGKTKEVIGTGYGILD--QKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTG
. . .::...::. : .:: .. : :. :. :: :. :..:.:.::::.: :: :
CCDS34 AELSRTGRSREVLELGQ-VLDTGKRKRHVPYSVSETRLEEALENLCERILDYSVHAERKG
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB0 SNRFAKGMSETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIE
: :.:::.:.:. ::..::.::::: . :: :::.: :.::. :::::....::::...
CCDS34 SLRYAKGQSQTMATLKGLVQKGVKVDLGIPLELWDEPSVEVTYLKKQCETMLEEFEDIVG
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB0 DWYRNHQEEDLTEFLCANHVLKGKDTSCLAEQWSGKK---GDTAALGGKKSKKKSSRAKA
::: .:::. : .::: .::: . .:.:: : :.::. :. . : ...... . .
CCDS34 DWYFHHQEQPLQNFLCEGHVLPAAETACLQETWTGKEITDGEEKTEGEEEQEEEEEEEEE
170 180 190 200 210 220
240 250 260 270
pF1KB0 AGGRSSSSKQRKELGGLEGDPSPEEDEGIQKASPLTHSPPDEL
::
CCDS34 EGGDKMTKTGSHPKLDREDL
230 240
278 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 17:42:55 2016 done: Sat Nov 5 17:42:55 2016
Total Scan time: 2.610 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]