FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0993, 278 aa 1>>>pF1KB0993 278 - 278 aa - 278 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2746+/-0.000663; mu= 17.2221+/- 0.040 mean_var=86.7300+/-16.696, 0's: 0 Z-trim(112.7): 7 B-trim: 0 in 0/52 Lambda= 0.137718 statistics sampled from 13426 (13430) to 13426 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.766), E-opt: 0.2 (0.413), width: 16 Scan time: 2.610 The best scores are: opt bits E(32554) CCDS4875.1 CNPY3 gene_id:10695|Hs108|chr6 ( 278) 1836 373.8 7.4e-104 CCDS34701.1 CNPY4 gene_id:245812|Hs108|chr7 ( 248) 665 141.1 7.5e-34 >>CCDS4875.1 CNPY3 gene_id:10695|Hs108|chr6 (278 aa) initn: 1836 init1: 1836 opt: 1836 Z-score: 1978.0 bits: 373.8 E(32554): 7.4e-104 Smith-Waterman score: 1836; 100.0% identity (100.0% similar) in 278 aa overlap (1-278:1-278) 10 20 30 40 50 60 pF1KB0 MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 SAFEETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTGSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 SAFEETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTGSN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RFAKGMSETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIEDW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 RFAKGMSETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIEDW 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB0 YRNHQEEDLTEFLCANHVLKGKDTSCLAEQWSGKKGDTAALGGKKSKKKSSRAKAAGGRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 YRNHQEEDLTEFLCANHVLKGKDTSCLAEQWSGKKGDTAALGGKKSKKKSSRAKAAGGRS 190 200 210 220 230 240 250 260 270 pF1KB0 SSSKQRKELGGLEGDPSPEEDEGIQKASPLTHSPPDEL :::::::::::::::::::::::::::::::::::::: CCDS48 SSSKQRKELGGLEGDPSPEEDEGIQKASPLTHSPPDEL 250 260 270 >>CCDS34701.1 CNPY4 gene_id:245812|Hs108|chr7 (248 aa) initn: 639 init1: 489 opt: 665 Z-score: 721.2 bits: 141.1 E(32554): 7.5e-34 Smith-Waterman score: 665; 44.9% identity (74.4% similar) in 227 aa overlap (17-238:6-231) 10 20 30 40 50 60 pF1KB0 MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELK : .::.:.: . : .. :..: :::::::::: ...::. CCDS34 MGPVRLGILLFLFLAVHEAWAGMLKEEDDDTERLPSKCEVCKLLSTELQ 10 20 30 40 70 80 90 100 110 pF1KB0 SAFEETGKTKEVIGTGYGILD--QKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTG . . .::...::. : .:: .. : :. :. :: :. :..:.:.::::.: :: : CCDS34 AELSRTGRSREVLELGQ-VLDTGKRKRHVPYSVSETRLEEALENLCERILDYSVHAERKG 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB0 SNRFAKGMSETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIE : :.:::.:.:. ::..::.::::: . :: :::.: :.::. :::::....::::... CCDS34 SLRYAKGQSQTMATLKGLVQKGVKVDLGIPLELWDEPSVEVTYLKKQCETMLEEFEDIVG 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB0 DWYRNHQEEDLTEFLCANHVLKGKDTSCLAEQWSGKK---GDTAALGGKKSKKKSSRAKA ::: .:::. : .::: .::: . .:.:: : :.::. :. . : ...... . . CCDS34 DWYFHHQEQPLQNFLCEGHVLPAAETACLQETWTGKEITDGEEKTEGEEEQEEEEEEEEE 170 180 190 200 210 220 240 250 260 270 pF1KB0 AGGRSSSSKQRKELGGLEGDPSPEEDEGIQKASPLTHSPPDEL :: CCDS34 EGGDKMTKTGSHPKLDREDL 230 240 278 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 17:42:55 2016 done: Sat Nov 5 17:42:55 2016 Total Scan time: 2.610 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]