FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA1154, 476 aa
1>>>pF1KA1154 476 - 476 aa - 476 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.5178+/- 0.001; mu= 9.2146+/- 0.061
mean_var=174.9004+/-34.446, 0's: 0 Z-trim(109.9): 33 B-trim: 0 in 0/53
Lambda= 0.096979
statistics sampled from 11180 (11198) to 11180 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.344), width: 16
Scan time: 3.550
The best scores are: opt bits E(32554)
CCDS4550.1 DCDC2 gene_id:51473|Hs108|chr6 ( 476) 3093 445.1 7.4e-125
CCDS74481.1 DCDC2C gene_id:728597|Hs108|chr2 ( 364) 579 93.3 4.7e-19
CCDS44100.1 DCDC2B gene_id:149069|Hs108|chr1 ( 349) 538 87.5 2.4e-17
>>CCDS4550.1 DCDC2 gene_id:51473|Hs108|chr6 (476 aa)
initn: 3093 init1: 3093 opt: 3093 Z-score: 2355.1 bits: 445.1 E(32554): 7.4e-125
Smith-Waterman score: 3093; 99.8% identity (100.0% similar) in 476 aa overlap (1-476:1-476)
10 20 30 40 50 60
pF1KA1 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA1 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNTEVKPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNTEVKPV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA1 IHSRINVSARFRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 IHSRINVSARFRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITLR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA1 SGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDKSTMRRPFGQKASS
::::::::::::::::::::::::::::::::::::::::.:::::::::::::::::::
CCDS45 SGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYSELLFDKSTMRRPFGQKASS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA1 LPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKNS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 LPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKNS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA1 QETIPNSDEGIFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANKDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 QETIPNSDEGIFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANKDA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA1 EQKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEELQQVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 EQKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEELQQVN
370 380 390 400 410 420
430 440 450 460 470
pF1KA1 NELQLVLDKERKSQGAGSGQDEADVDPQRPPRPEVKITSPEENENNQQNKDYAAVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 NELQLVLDKERKSQGAGSGQDEADVDPQRPPRPEVKITSPEENENNQQNKDYAAVA
430 440 450 460 470
>>CCDS74481.1 DCDC2C gene_id:728597|Hs108|chr2 (364 aa)
initn: 560 init1: 169 opt: 579 Z-score: 455.6 bits: 93.3 E(32554): 4.7e-19
Smith-Waterman score: 716; 33.7% identity (68.1% similar) in 383 aa overlap (3-380:2-359)
10 20 30 40 50 60
pF1KA1 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG
:. . :. .. .:...:::::::::.:.. :. ......::..:...: :..:::
CCDS74 MGTRGPSAPVDTTPAKTIVVYRNGDPFYVGKKFVLSRRRAATFEALLEQLTEQVDVPFG
10 20 30 40 50
70 80 90 100 110
pF1KA1 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNT-EVKP
:: ..:: :::. :: .:.::.:::.:.: ::.:.:. : . ..: .. . :.::
CCDS74 -VRRLFTPTRGHRVLGLDALQAGGKYVAAGRERFKELDYIHI--VPRKPAKIRKLKEIKP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KA1 VIHSRINVSARFRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITL
:.: ::: .... . : ...:: :. : ....::. .:..:: :: . ::. .
CCDS74 VVHCDINVPSKWQTYHRISRHINVFTNGRLFIPPAKIIIPKFSLSDWDIVLATIGEKV-F
120 130 140 150 160 170
180 190 200 210 220 230
pF1KA1 RSGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDKSTMRRPFGQKAS
:.:..:.:..:.:. .. .:....:::::: . :: .:: :.
CCDS74 PLGGVRKLFTMNGHLLGDSKDLQDNHFYVAVGLETFKYFPY---------------WKSP
180 190 200 210 220
240 250 260 270 280 290
pF1KA1 SLPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKN
.: : .: .. :....:.. :... .: : : .. ... :.. :
CCDS74 RVPSEVQQRYANVEKNSQRKKKV----DSKGKEPCKYDGIPPKTQ-DSVYYAKEEKKKTL
230 240 250 260 270
300 310 320 330 340 350
pF1KA1 SQETIPNSDEG-IFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANK
.. . . :: ..:: . .::.:: .:.:....:.::::::: :::: :.:. .. .
CCDS74 AEPLVQRGAEGDVYKAPTPSKETQGALDVKEEHNVQLEVPVDQRQAEIVKEDEEIHENTP
280 290 300 310 320 330
360 370 380 390 400 410
pF1KA1 DAE---QKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEE
: : .::: . . :.:.. .::
CCDS74 DFEGNKDKED-ARLCEDVERKMAREWKPVD
340 350 360
>>CCDS44100.1 DCDC2B gene_id:149069|Hs108|chr1 (349 aa)
initn: 697 init1: 275 opt: 538 Z-score: 424.9 bits: 87.5 E(32554): 2.4e-17
Smith-Waterman score: 664; 38.1% identity (64.6% similar) in 339 aa overlap (14-349:6-326)
10 20 30 40 50 60
pF1KA1 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG
:..: :.::::::::. : ..:. ... ..:.:: :::..::::.
CCDS44 MAGGSPAAKRVVVYRNGDPFFPGSQLVVTQRRFPTMEAFLCEVTSAVQAPL-
10 20 30 40 50
70 80 90 100 110 120
pF1KA1 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNTEVKPV
::: .::: :: . .: .... :.:::.: : :.::.:: . : : :
CCDS44 AVRALYTPCHGHPVTNLADLKNRGQYVAAGFERFHKLHYLP--HRGKDPGGKSCRLQGPP
60 70 80 90 100
130 140 150 160 170
pF1KA1 IHSRINVSARFRK-PLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITL
. .. .: :. : : : .. ::::..: : . . . ..:. ::...:::. :
CCDS44 VTRHLCDGAIGRQLPAGAPSYIHVFRNGDLVSPPFSLKLSQAASQDWETVLKLLTEKVKL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KA1 RSGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDKSTMRRPFGQKAS
.:::: .: :::: . .: :: .:..:::::.:.:: ::: ::: . .. : :
CCDS44 QSGAVCKLCTLEGLPLSAGKELVTGHYYVAVGEDEFKDLPYLELLVPSPSLPRGCWQ---
170 180 190 200 210 220
240 250 260 270 280 290
pF1KA1 SLPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKN
:: :: . .: . : .. : . . . :. .: : .. .:... ..
CCDS44 --PPGSKSRPHR-QGAQGH-RAQVTQPSPKEPDRIK--------PSAFYARPQQTIQPRS
230 240 250 260 270
300 310 320 330 340 350
pF1KA1 SQETI--PNSDEGIFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKAN
. :. :.. :.. : .:.:: :: :: .:::::.: :.::: :.::.:
CCDS44 KLPTLSFPSGVIGVYGAPHRRKETAGALEVADDEDTQTEEPLDQRAAQIVEEALSLENQP
280 290 300 310 320 330
360 370 380 390 400 410
pF1KA1 KDAEQKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEELQ
CCDS44 GAGAAISASAPALPS
340
476 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 20:39:10 2016 done: Wed Nov 2 20:39:11 2016
Total Scan time: 3.550 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]