FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5834, 481 aa 1>>>pF1KB5834 481 - 481 aa - 481 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4265+/-0.000762; mu= 9.8456+/- 0.046 mean_var=153.8339+/-31.093, 0's: 0 Z-trim(114.1): 25 B-trim: 882 in 1/52 Lambda= 0.103407 statistics sampled from 14674 (14696) to 14674 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.451), width: 16 Scan time: 3.900 The best scores are: opt bits E(32554) CCDS1954.1 DOK1 gene_id:1796|Hs108|chr2 ( 481) 3323 507.2 1.5e-143 CCDS56125.1 DOK1 gene_id:1796|Hs108|chr2 ( 342) 2367 364.5 1e-100 CCDS82474.1 DOK1 gene_id:1796|Hs108|chr2 ( 177) 1048 167.5 1e-41 CCDS78098.1 DOK3 gene_id:79930|Hs108|chr5 ( 440) 675 112.2 1.2e-24 CCDS4426.1 DOK3 gene_id:79930|Hs108|chr5 ( 496) 675 112.2 1.3e-24 CCDS6016.1 DOK2 gene_id:9046|Hs108|chr8 ( 412) 590 99.5 7.4e-21 CCDS47350.1 DOK3 gene_id:79930|Hs108|chr5 ( 330) 504 86.6 4.6e-17 >>CCDS1954.1 DOK1 gene_id:1796|Hs108|chr2 (481 aa) initn: 3323 init1: 3323 opt: 3323 Z-score: 2690.5 bits: 507.2 E(32554): 1.5e-143 Smith-Waterman score: 3323; 100.0% identity (100.0% similar) in 481 aa overlap (1-481:1-481) 10 20 30 40 50 60 pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 ERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 ERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGND 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 IFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELLDSPPALYAEPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 IFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELLDSPPALYAEPL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 DSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLLKAKLTDPKEDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 DSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLLKAKLTDPKEDP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 IYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDYAVPPPRSTKPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 IYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDYAVPPPRSTKPL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 LAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVGTDKTGVKSEGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 LAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVGTDKTGVKSEGS 430 440 450 460 470 480 pF1KB5 T : CCDS19 T >>CCDS56125.1 DOK1 gene_id:1796|Hs108|chr2 (342 aa) initn: 2367 init1: 2367 opt: 2367 Z-score: 1921.7 bits: 364.5 E(32554): 1e-100 Smith-Waterman score: 2367; 100.0% identity (100.0% similar) in 342 aa overlap (140-481:1-342) 110 120 130 140 150 160 pF1KB5 WVQTLCRNAFPKGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCG :::::::::::::::::::::::::::::: CCDS56 MLENSLYSPTWEGSQFWVTVQRTEAAERCG 10 20 30 170 180 190 200 210 220 pF1KB5 LHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPG 40 50 60 70 80 90 230 240 250 260 270 280 pF1KB5 TFTFQTAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 TFTFQTAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELL 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB5 DSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLL 160 170 180 190 200 210 350 360 370 380 390 400 pF1KB5 KAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 KAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDY 220 230 240 250 260 270 410 420 430 440 450 460 pF1KB5 AVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVG 280 290 300 310 320 330 470 480 pF1KB5 TDKTGVKSEGST :::::::::::: CCDS56 TDKTGVKSEGST 340 >>CCDS82474.1 DOK1 gene_id:1796|Hs108|chr2 (177 aa) initn: 1048 init1: 1048 opt: 1048 Z-score: 862.2 bits: 167.5 E(32554): 1e-41 Smith-Waterman score: 1048; 100.0% identity (100.0% similar) in 152 aa overlap (1-152:1-152) 10 20 30 40 50 60 pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEA :::::::::::::::::::::::::::::::: CCDS82 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGHVLFRGRPPLPLRPWNLHLPDGTGK 130 140 150 160 170 190 200 210 220 230 240 pF1KB5 ERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGND >>CCDS78098.1 DOK3 gene_id:79930|Hs108|chr5 (440 aa) initn: 574 init1: 214 opt: 675 Z-score: 556.0 bits: 112.2 E(32554): 1.2e-24 Smith-Waterman score: 675; 37.8% identity (59.3% similar) in 386 aa overlap (1-375:4-379) 10 20 30 40 50 pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHK--GSSSGGGR- .. . .: :. : .:: : :::.::.:: ..: :::::: .. . : ...: : CCDS78 MDPLETPIKDGILYQQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDRS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB5 -GSSRRLDCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQT : .:: . .:::::.:::: :. :. :. . :: : :..:::::::. ::. CCDS78 AGPGRRGERRVIRLADCVSVLPADGESCPRD-TGAFLLTTTERSHLLAAQ--HRQAWMGP 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 LCRNAFP-KGSWTLAPTD-NPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLH .:. ::: : . . :: . :: . . : :::.:: : ..: :.::::::: :: :. CCDS78 ICQLAFPGTGEASSGSTDAQSPKRGLVPMEENSIYSSWQEVGEFPVVVQRTEAATRCQLK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 GSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTF : .: . . . : ... . : :::: .::..: :: .::::::::: :: : : CCDS78 GPALLVLGPDAIQL----REAKGTQALYSWPYHFLRKFGSDKGVFSFEAGRRCHSGEGLF 180 190 200 210 220 230 240 250 260 270 280 pF1KB5 TFQTAQGNDIFQAVETAIHRQKAQGKA---GQGHDVLRADSHEGEVAEGKL-PSPPGPQE .:.: . :. .:: :: ::. . : . :: : . . :.: ::::. CCDS78 AFSTPCAPDLCRAVAGAIARQRERLPELTRPQPCPLPRATSLPSLDTPGELREMPPGPEP 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB5 LLDSPPALYAEPL-DSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQ . : ::: .:: . : ..: : :. .:. : .. :: .: . CCDS78 PTSRKMHL-AEPGPQSLPLLLGPEPNDLASGLYASVCKRAS-GPPGNEHLYENLCVLEAS 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB5 QLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPAT :.. .:.: : : .:. .: CCDS78 PTLHGGEPEPHEGPGSRSPT-TSPIYHNGQDLSWPGPANDSTLEAQYRRLLELDQVEGTG 360 370 380 390 400 410 >>CCDS4426.1 DOK3 gene_id:79930|Hs108|chr5 (496 aa) initn: 574 init1: 214 opt: 675 Z-score: 555.3 bits: 112.2 E(32554): 1.3e-24 Smith-Waterman score: 675; 37.8% identity (59.3% similar) in 386 aa overlap (1-375:60-435) 10 20 30 pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLY .. . .: :. : .:: : :::.::.:: CCDS44 GKCEEFPSSLSSVSPGLEAAALLLAVTMDPLETPIKDGILYQQHVKFGKKCWRKVWALLY 30 40 50 60 70 80 40 50 60 70 80 pF1KB5 PASPHGVARLEFFDHK--GSSSGGGR--GSSRRLDCKVIRLAECVSVAPVTVETPPEPGA ..: :::::: .. . : ...: : : .:: . .:::::.:::: :. :. :. . CCDS44 AGGPSGVARLESWEVRDGGLGAAGDRSAGPGRRGERRVIRLADCVSVLPADGESCPRD-T 90 100 110 120 130 140 90 100 110 120 130 140 pF1KB5 TAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP-KGSWTLAPTD-NPPKLSALEMLENS :: : :..:::::::. ::. .:. ::: : . . :: . :: . . : ::: CCDS44 GAFLLTTTERSHLLAAQ--HRQAWMGPICQLAFPGTGEASSGSTDAQSPKRGLVPMEENS 150 160 170 180 190 200 150 160 170 180 190 200 pF1KB5 LYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYT .:: : ..: :.::::::: :: :.: .: . . . : ... . : :::: CCDS44 IYSSWQEVGEFPVVVQRTEAATRCQLKGPALLVLGPDAIQL----REAKGTQALYSWPYH 210 220 230 240 250 260 210 220 230 240 250 260 pF1KB5 LLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGNDIFQAVETAIHRQKAQGKA---GQG .::..: :: .::::::::: :: : :.:.: . :. .:: :: ::. . : CCDS44 FLRKFGSDKGVFSFEAGRRCHSGEGLFAFSTPCAPDLCRAVAGAIARQRERLPELTRPQP 270 280 290 300 310 320 270 280 290 300 310 pF1KB5 HDVLRADSHEGEVAEGKL-PSPPGPQELLDSPPALYAEPL-DSLRIAPCPSQDSLYSDPL . :: : . . :.: ::::. . : ::: .:: . : ..: : CCDS44 CPLPRATSLPSLDTPGELREMPPGPEPPTSRKMHL-AEPGPQSLPLLLGPEPNDLASGLY 330 340 350 360 370 380 320 330 340 350 360 370 pF1KB5 DSTSAQAGEGVQRKKPLYWDLYEHAQQQLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDL :. .:. : .. :: .: . :.. .:.: : : .:. .: CCDS44 ASVCKRAS-GPPGNEHLYENLCVLEASPTLHGGEPEPHEGPGSRSPT-TSPIYHNGQDLS 390 400 410 420 430 380 390 400 410 420 430 pF1KB5 PREPKDAWWCQARVKEEGYELPYNPATDDYAVPPPRSTKPLLAPKPQGPAFPEPGTATGS CCDS44 WPGPANDSTLEAQYRRLLELDQVEGTGRPDPQAGFKAKLVTLLSRERRKGPAPCDRP 440 450 460 470 480 490 >>CCDS6016.1 DOK2 gene_id:9046|Hs108|chr8 (412 aa) initn: 654 init1: 265 opt: 590 Z-score: 487.9 bits: 99.5 E(32554): 7.4e-21 Smith-Waterman score: 724; 38.7% identity (59.4% similar) in 424 aa overlap (2-410:3-392) 10 20 30 40 50 pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRR :::: .: :.::.:. :.::. : :: .: ..::::. . : . :: CCDS60 MGDGAVKQGFLYLQQQQTFGKKWRRFGASLYGGSDCALARLELQE--------GPEKPRR 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 LDC--KVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRN . :::::..:. :: . :. ..:: :.: .: .:::: : . :::..: CCDS60 CEAARKVIRLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQAICLL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 AFPKGSWTLAPTDNPPKLSALEMLENSLYSPTWE-G--SQFWVTVQRTEAAERCGLHGSY ::: :. .. : : : :: ::: . : ..: ::.. :::.::: :.::: CCDS60 AFPGQRKELSGPEG--KQSRPCMEENELYSSAVTVGPHKEFAVTMRPTEASERCHLRGSY 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 VLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQ .::. : : :. . : .::: .:::.::::: ::::::::: :: :.: :. CCDS60 TLRAGESALELW--GGPEPGTQ-LYDWPYRFLRRFGRDKVTFSFEAGRRCVSGEGNFEFE 180 190 200 210 220 240 250 260 270 280 290 pF1KB5 TAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGP-QELLDS-P : :::.:: :.: :: :: . : . . . . ..:: : .: .. :: : CCDS60 TRQGNEIFLALEEAISAQKNAAPA--------TPQPQPATIPASLPRPDSPYSRPHDSLP 230 240 250 260 270 300 310 320 330 340 pF1KB5 PALYAEPLDSLRIAPCP-SQDSLYSDPLDSTSAQAGE---GVQRKKP-LYWD-LYEHAQQ : . :. :: : .:.. :. :.:... . :. :. : : : ::. .. CCDS60 PPSPTTPVP----APRPRGQEGEYAVPFDAVARSLGKNFRGILAVPPQLLADPLYDSIEE 280 290 300 310 320 330 350 360 370 380 390 400 pF1KB5 QLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPK-DAWWCQARVKEEGYELPY-NP : :. : :::::::.: . .::: :.::. .:: :: . .. : . .: CCDS60 TL------PPRPDHIYDEPEGVAAL---SLYDSPQEPRGEAWRRQATADRDPAGLQHVQP 340 350 360 370 380 410 420 430 440 450 460 pF1KB5 ATDDYAVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCG : .:.. CCDS60 AGQDFSASGWQPGTEYDNVVLKKGPK 390 400 410 >>CCDS47350.1 DOK3 gene_id:79930|Hs108|chr5 (330 aa) initn: 450 init1: 147 opt: 504 Z-score: 419.9 bits: 86.6 E(32554): 4.6e-17 Smith-Waterman score: 504; 35.9% identity (61.3% similar) in 287 aa overlap (1-273:4-276) 10 20 30 40 50 pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHK--GSSSGGGR- .. . .: :. : .:: : :::.::.:: ..: :::::: .. . : ...: : CCDS47 MDPLETPIKDGILYQQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDRS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB5 -GSSRRLDCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQT : .:: . .:::::.:::: :. :. :. . :: : :..:::::::. ::. CCDS47 AGPGRRGERRVIRLADCVSVLPADGESCPRD-TGAFLLTTTERSHLLAAQ--HRQAWMGP 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 LCRNAFP-KGSWTLAPTD-NPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLH .:. ::: : . . :: . :: . . : :::.:: : ..: :.::::::: :: :. CCDS47 ICQLAFPGTGEASSGSTDAQSPKRGLVPMEENSIYSSWQEVGEFPVVVQRTEAATRCQLK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 GSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTF : .: . . . : ... . : :::: .::..: ::.... : : . . CCDS47 GPALLVLGPDAIQL----REAKGTQALYSWPYHFLRKFGSDKILLG------TP-GVSLL 180 190 200 210 220 240 250 260 270 280 pF1KB5 TFQTAQGNDIFQAV--ETAIHRQKAQGKAGQGH------DVLRADSHEGEVAEGKLPSPP . . .:. . :. .. .. : .:... ::: . .:: CCDS47 ICKGERTDDVSGIILDESLLRAYSVPGAGGHSRVQDSLGPVLREPTFQGERSFLKTSMLR 230 240 250 260 270 280 290 300 310 320 330 340 pF1KB5 GPQELLDSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEH CCDS47 SLLCSCSWRHPRSQPRTQASCLQGSDCPAPHRNSTSAAHTLGTS 290 300 310 320 330 481 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:20:03 2016 done: Sat Nov 5 23:20:04 2016 Total Scan time: 3.900 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]