FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1154, 476 aa 1>>>pF1KA1154 476 - 476 aa - 476 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.5178+/- 0.001; mu= 9.2146+/- 0.061 mean_var=174.9004+/-34.446, 0's: 0 Z-trim(109.9): 33 B-trim: 0 in 0/53 Lambda= 0.096979 statistics sampled from 11180 (11198) to 11180 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.344), width: 16 Scan time: 3.550 The best scores are: opt bits E(32554) CCDS4550.1 DCDC2 gene_id:51473|Hs108|chr6 ( 476) 3093 445.1 7.4e-125 CCDS74481.1 DCDC2C gene_id:728597|Hs108|chr2 ( 364) 579 93.3 4.7e-19 CCDS44100.1 DCDC2B gene_id:149069|Hs108|chr1 ( 349) 538 87.5 2.4e-17 >>CCDS4550.1 DCDC2 gene_id:51473|Hs108|chr6 (476 aa) initn: 3093 init1: 3093 opt: 3093 Z-score: 2355.1 bits: 445.1 E(32554): 7.4e-125 Smith-Waterman score: 3093; 99.8% identity (100.0% similar) in 476 aa overlap (1-476:1-476) 10 20 30 40 50 60 pF1KA1 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNTEVKPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNTEVKPV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 IHSRINVSARFRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 IHSRINVSARFRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITLR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 SGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDKSTMRRPFGQKASS ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: CCDS45 SGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYSELLFDKSTMRRPFGQKASS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 LPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKNS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 QETIPNSDEGIFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANKDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QETIPNSDEGIFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANKDA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA1 EQKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEELQQVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 EQKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEELQQVN 370 380 390 400 410 420 430 440 450 460 470 pF1KA1 NELQLVLDKERKSQGAGSGQDEADVDPQRPPRPEVKITSPEENENNQQNKDYAAVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 NELQLVLDKERKSQGAGSGQDEADVDPQRPPRPEVKITSPEENENNQQNKDYAAVA 430 440 450 460 470 >>CCDS74481.1 DCDC2C gene_id:728597|Hs108|chr2 (364 aa) initn: 560 init1: 169 opt: 579 Z-score: 455.6 bits: 93.3 E(32554): 4.7e-19 Smith-Waterman score: 716; 33.7% identity (68.1% similar) in 383 aa overlap (3-380:2-359) 10 20 30 40 50 60 pF1KA1 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG :. . :. .. .:...:::::::::.:.. :. ......::..:...: :..::: CCDS74 MGTRGPSAPVDTTPAKTIVVYRNGDPFYVGKKFVLSRRRAATFEALLEQLTEQVDVPFG 10 20 30 40 50 70 80 90 100 110 pF1KA1 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNT-EVKP :: ..:: :::. :: .:.::.:::.:.: ::.:.:. : . ..: .. . :.:: CCDS74 -VRRLFTPTRGHRVLGLDALQAGGKYVAAGRERFKELDYIHI--VPRKPAKIRKLKEIKP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KA1 VIHSRINVSARFRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITL :.: ::: .... . : ...:: :. : ....::. .:..:: :: . ::. . CCDS74 VVHCDINVPSKWQTYHRISRHINVFTNGRLFIPPAKIIIPKFSLSDWDIVLATIGEKV-F 120 130 140 150 160 170 180 190 200 210 220 230 pF1KA1 RSGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDKSTMRRPFGQKAS :.:..:.:..:.:. .. .:....:::::: . :: .:: :. CCDS74 PLGGVRKLFTMNGHLLGDSKDLQDNHFYVAVGLETFKYFPY---------------WKSP 180 190 200 210 220 240 250 260 270 280 290 pF1KA1 SLPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKN .: : .: .. :....:.. :... .: : : .. ... :.. : CCDS74 RVPSEVQQRYANVEKNSQRKKKV----DSKGKEPCKYDGIPPKTQ-DSVYYAKEEKKKTL 230 240 250 260 270 300 310 320 330 340 350 pF1KA1 SQETIPNSDEG-IFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANK .. . . :: ..:: . .::.:: .:.:....:.::::::: :::: :.:. .. . CCDS74 AEPLVQRGAEGDVYKAPTPSKETQGALDVKEEHNVQLEVPVDQRQAEIVKEDEEIHENTP 280 290 300 310 320 330 360 370 380 390 400 410 pF1KA1 DAE---QKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEE : : .::: . . :.:.. .:: CCDS74 DFEGNKDKED-ARLCEDVERKMAREWKPVD 340 350 360 >>CCDS44100.1 DCDC2B gene_id:149069|Hs108|chr1 (349 aa) initn: 697 init1: 275 opt: 538 Z-score: 424.9 bits: 87.5 E(32554): 2.4e-17 Smith-Waterman score: 664; 38.1% identity (64.6% similar) in 339 aa overlap (14-349:6-326) 10 20 30 40 50 60 pF1KA1 MSGSSARSSHLSQPVVKSVLVYRNGDPFYAGRRVVIHEKKVSSFEVFLKEVTGGVQAPFG :..: :.::::::::. : ..:. ... ..:.:: :::..::::. CCDS44 MAGGSPAAKRVVVYRNGDPFFPGSQLVVTQRRFPTMEAFLCEVTSAVQAPL- 10 20 30 40 50 70 80 90 100 110 120 pF1KA1 AVRNIYTPRTGHRIRKLDQIQSGGNYVAGGQEAFKKLNYLDIGEIKKRPMEVVNTEVKPV ::: .::: :: . .: .... :.:::.: : :.::.:: . : : : CCDS44 AVRALYTPCHGHPVTNLADLKNRGQYVAAGFERFHKLHYLP--HRGKDPGGKSCRLQGPP 60 70 80 90 100 130 140 150 160 170 pF1KA1 IHSRINVSARFRK-PLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITL . .. .: :. : : : .. ::::..: : . . . ..:. ::...:::. : CCDS44 VTRHLCDGAIGRQLPAGAPSYIHVFRNGDLVSPPFSLKLSQAASQDWETVLKLLTEKVKL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KA1 RSGAVHRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDKSTMRRPFGQKAS .:::: .: :::: . .: :: .:..:::::.:.:: ::: ::: . .. : : CCDS44 QSGAVCKLCTLEGLPLSAGKELVTGHYYVAVGEDEFKDLPYLELLVPSPSLPRGCWQ--- 170 180 190 200 210 220 240 250 260 270 280 290 pF1KA1 SLPPIVGSRKSKGSGNDRHSKSTVGSSDNSSPQPLKRKGKKEDVNSEKLTKLKQNVKLKN :: :: . .: . : .. : . . . :. .: : .. .:... .. CCDS44 --PPGSKSRPHR-QGAQGH-RAQVTQPSPKEPDRIK--------PSAFYARPQQTIQPRS 230 240 250 260 270 300 310 320 330 340 350 pF1KA1 SQETI--PNSDEGIFKAGAERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKAN . :. :.. :.. : .:.:: :: :: .:::::.: :.::: :.::.: CCDS44 KLPTLSFPSGVIGVYGAPHRRKETAGALEVADDEDTQTEEPLDQRAAQIVEEALSLENQP 280 290 300 310 320 330 360 370 380 390 400 410 pF1KA1 KDAEQKEDFSGMNGDLEEEGGREATDAPEQVEEILDHSEQQARPARVNGGTDEENGEELQ CCDS44 GAGAAISASAPALPS 340 476 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 20:39:10 2016 done: Wed Nov 2 20:39:11 2016 Total Scan time: 3.550 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]