FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5834, 481 aa
1>>>pF1KB5834 481 - 481 aa - 481 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4265+/-0.000762; mu= 9.8456+/- 0.046
mean_var=153.8339+/-31.093, 0's: 0 Z-trim(114.1): 25 B-trim: 882 in 1/52
Lambda= 0.103407
statistics sampled from 14674 (14696) to 14674 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.451), width: 16
Scan time: 3.900
The best scores are: opt bits E(32554)
CCDS1954.1 DOK1 gene_id:1796|Hs108|chr2 ( 481) 3323 507.2 1.5e-143
CCDS56125.1 DOK1 gene_id:1796|Hs108|chr2 ( 342) 2367 364.5 1e-100
CCDS82474.1 DOK1 gene_id:1796|Hs108|chr2 ( 177) 1048 167.5 1e-41
CCDS78098.1 DOK3 gene_id:79930|Hs108|chr5 ( 440) 675 112.2 1.2e-24
CCDS4426.1 DOK3 gene_id:79930|Hs108|chr5 ( 496) 675 112.2 1.3e-24
CCDS6016.1 DOK2 gene_id:9046|Hs108|chr8 ( 412) 590 99.5 7.4e-21
CCDS47350.1 DOK3 gene_id:79930|Hs108|chr5 ( 330) 504 86.6 4.6e-17
>>CCDS1954.1 DOK1 gene_id:1796|Hs108|chr2 (481 aa)
initn: 3323 init1: 3323 opt: 3323 Z-score: 2690.5 bits: 507.2 E(32554): 1.5e-143
Smith-Waterman score: 3323; 100.0% identity (100.0% similar) in 481 aa overlap (1-481:1-481)
10 20 30 40 50 60
pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 ERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGND
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 ERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGND
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 IFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELLDSPPALYAEPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 IFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELLDSPPALYAEPL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 DSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLLKAKLTDPKEDP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 DSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLLKAKLTDPKEDP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 IYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDYAVPPPRSTKPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 IYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDYAVPPPRSTKPL
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 LAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVGTDKTGVKSEGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 LAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVGTDKTGVKSEGS
430 440 450 460 470 480
pF1KB5 T
:
CCDS19 T
>>CCDS56125.1 DOK1 gene_id:1796|Hs108|chr2 (342 aa)
initn: 2367 init1: 2367 opt: 2367 Z-score: 1921.7 bits: 364.5 E(32554): 1e-100
Smith-Waterman score: 2367; 100.0% identity (100.0% similar) in 342 aa overlap (140-481:1-342)
110 120 130 140 150 160
pF1KB5 WVQTLCRNAFPKGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCG
::::::::::::::::::::::::::::::
CCDS56 MLENSLYSPTWEGSQFWVTVQRTEAAERCG
10 20 30
170 180 190 200 210 220
pF1KB5 LHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPG
40 50 60 70 80 90
230 240 250 260 270 280
pF1KB5 TFTFQTAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 TFTFQTAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGPQELL
100 110 120 130 140 150
290 300 310 320 330 340
pF1KB5 DSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQQLL
160 170 180 190 200 210
350 360 370 380 390 400
pF1KB5 KAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 KAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPATDDY
220 230 240 250 260 270
410 420 430 440 450 460
pF1KB5 AVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCGLSRVG
280 290 300 310 320 330
470 480
pF1KB5 TDKTGVKSEGST
::::::::::::
CCDS56 TDKTGVKSEGST
340
>>CCDS82474.1 DOK1 gene_id:1796|Hs108|chr2 (177 aa)
initn: 1048 init1: 1048 opt: 1048 Z-score: 862.2 bits: 167.5 E(32554): 1e-41
Smith-Waterman score: 1048; 100.0% identity (100.0% similar) in 152 aa overlap (1-152:1-152)
10 20 30 40 50 60
pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS82 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRRL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS82 DCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEA
::::::::::::::::::::::::::::::::
CCDS82 KGSWTLAPTDNPPKLSALEMLENSLYSPTWEGHVLFRGRPPLPLRPWNLHLPDGTGK
130 140 150 160 170
190 200 210 220 230 240
pF1KB5 ERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGND
>>CCDS78098.1 DOK3 gene_id:79930|Hs108|chr5 (440 aa)
initn: 574 init1: 214 opt: 675 Z-score: 556.0 bits: 112.2 E(32554): 1.2e-24
Smith-Waterman score: 675; 37.8% identity (59.3% similar) in 386 aa overlap (1-375:4-379)
10 20 30 40 50
pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHK--GSSSGGGR-
.. . .: :. : .:: : :::.::.:: ..: :::::: .. . : ...: :
CCDS78 MDPLETPIKDGILYQQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDRS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB5 -GSSRRLDCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQT
: .:: . .:::::.:::: :. :. :. . :: : :..:::::::. ::.
CCDS78 AGPGRRGERRVIRLADCVSVLPADGESCPRD-TGAFLLTTTERSHLLAAQ--HRQAWMGP
70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LCRNAFP-KGSWTLAPTD-NPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLH
.:. ::: : . . :: . :: . . : :::.:: : ..: :.::::::: :: :.
CCDS78 ICQLAFPGTGEASSGSTDAQSPKRGLVPMEENSIYSSWQEVGEFPVVVQRTEAATRCQLK
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 GSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTF
: .: . . . : ... . : :::: .::..: :: .::::::::: :: : :
CCDS78 GPALLVLGPDAIQL----REAKGTQALYSWPYHFLRKFGSDKGVFSFEAGRRCHSGEGLF
180 190 200 210 220 230
240 250 260 270 280
pF1KB5 TFQTAQGNDIFQAVETAIHRQKAQGKA---GQGHDVLRADSHEGEVAEGKL-PSPPGPQE
.:.: . :. .:: :: ::. . : . :: : . . :.: ::::.
CCDS78 AFSTPCAPDLCRAVAGAIARQRERLPELTRPQPCPLPRATSLPSLDTPGELREMPPGPEP
240 250 260 270 280 290
290 300 310 320 330 340
pF1KB5 LLDSPPALYAEPL-DSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEHAQQ
. : ::: .:: . : ..: : :. .:. : .. :: .: .
CCDS78 PTSRKMHL-AEPGPQSLPLLLGPEPNDLASGLYASVCKRAS-GPPGNEHLYENLCVLEAS
300 310 320 330 340 350
350 360 370 380 390 400
pF1KB5 QLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQARVKEEGYELPYNPAT
:.. .:.: : : .:. .:
CCDS78 PTLHGGEPEPHEGPGSRSPT-TSPIYHNGQDLSWPGPANDSTLEAQYRRLLELDQVEGTG
360 370 380 390 400 410
>>CCDS4426.1 DOK3 gene_id:79930|Hs108|chr5 (496 aa)
initn: 574 init1: 214 opt: 675 Z-score: 555.3 bits: 112.2 E(32554): 1.3e-24
Smith-Waterman score: 675; 37.8% identity (59.3% similar) in 386 aa overlap (1-375:60-435)
10 20 30
pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLY
.. . .: :. : .:: : :::.::.::
CCDS44 GKCEEFPSSLSSVSPGLEAAALLLAVTMDPLETPIKDGILYQQHVKFGKKCWRKVWALLY
30 40 50 60 70 80
40 50 60 70 80
pF1KB5 PASPHGVARLEFFDHK--GSSSGGGR--GSSRRLDCKVIRLAECVSVAPVTVETPPEPGA
..: :::::: .. . : ...: : : .:: . .:::::.:::: :. :. :. .
CCDS44 AGGPSGVARLESWEVRDGGLGAAGDRSAGPGRRGERRVIRLADCVSVLPADGESCPRD-T
90 100 110 120 130 140
90 100 110 120 130 140
pF1KB5 TAFRLDTAQRSHLLAADAPSSAAWVQTLCRNAFP-KGSWTLAPTD-NPPKLSALEMLENS
:: : :..:::::::. ::. .:. ::: : . . :: . :: . . : :::
CCDS44 GAFLLTTTERSHLLAAQ--HRQAWMGPICQLAFPGTGEASSGSTDAQSPKRGLVPMEENS
150 160 170 180 190 200
150 160 170 180 190 200
pF1KB5 LYSPTWEGSQFWVTVQRTEAAERCGLHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYT
.:: : ..: :.::::::: :: :.: .: . . . : ... . : ::::
CCDS44 IYSSWQEVGEFPVVVQRTEAATRCQLKGPALLVLGPDAIQL----REAKGTQALYSWPYH
210 220 230 240 250 260
210 220 230 240 250 260
pF1KB5 LLRRYGRDKVMFSFEAGRRCPSGPGTFTFQTAQGNDIFQAVETAIHRQKAQGKA---GQG
.::..: :: .::::::::: :: : :.:.: . :. .:: :: ::. . :
CCDS44 FLRKFGSDKGVFSFEAGRRCHSGEGLFAFSTPCAPDLCRAVAGAIARQRERLPELTRPQP
270 280 290 300 310 320
270 280 290 300 310
pF1KB5 HDVLRADSHEGEVAEGKL-PSPPGPQELLDSPPALYAEPL-DSLRIAPCPSQDSLYSDPL
. :: : . . :.: ::::. . : ::: .:: . : ..: :
CCDS44 CPLPRATSLPSLDTPGELREMPPGPEPPTSRKMHL-AEPGPQSLPLLLGPEPNDLASGLY
330 340 350 360 370 380
320 330 340 350 360 370
pF1KB5 DSTSAQAGEGVQRKKPLYWDLYEHAQQQLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDL
:. .:. : .. :: .: . :.. .:.: : : .:. .:
CCDS44 ASVCKRAS-GPPGNEHLYENLCVLEASPTLHGGEPEPHEGPGSRSPT-TSPIYHNGQDLS
390 400 410 420 430
380 390 400 410 420 430
pF1KB5 PREPKDAWWCQARVKEEGYELPYNPATDDYAVPPPRSTKPLLAPKPQGPAFPEPGTATGS
CCDS44 WPGPANDSTLEAQYRRLLELDQVEGTGRPDPQAGFKAKLVTLLSRERRKGPAPCDRP
440 450 460 470 480 490
>>CCDS6016.1 DOK2 gene_id:9046|Hs108|chr8 (412 aa)
initn: 654 init1: 265 opt: 590 Z-score: 487.9 bits: 99.5 E(32554): 7.4e-21
Smith-Waterman score: 724; 38.7% identity (59.4% similar) in 424 aa overlap (2-410:3-392)
10 20 30 40 50
pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSRR
:::: .: :.::.:. :.::. : :: .: ..::::. . : . ::
CCDS60 MGDGAVKQGFLYLQQQQTFGKKWRRFGASLYGGSDCALARLELQE--------GPEKPRR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 LDC--KVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCRN
. :::::..:. :: . :. ..:: :.: .: .:::: : . :::..:
CCDS60 CEAARKVIRLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQAICLL
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 AFPKGSWTLAPTDNPPKLSALEMLENSLYSPTWE-G--SQFWVTVQRTEAAERCGLHGSY
::: :. .. : : : :: ::: . : ..: ::.. :::.::: :.:::
CCDS60 AFPGQRKELSGPEG--KQSRPCMEENELYSSAVTVGPHKEFAVTMRPTEASERCHLRGSY
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 VLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTFQ
.::. : : :. . : .::: .:::.::::: ::::::::: :: :.: :.
CCDS60 TLRAGESALELW--GGPEPGTQ-LYDWPYRFLRRFGRDKVTFSFEAGRRCVSGEGNFEFE
180 190 200 210 220
240 250 260 270 280 290
pF1KB5 TAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGP-QELLDS-P
: :::.:: :.: :: :: . : . . . . ..:: : .: .. :: :
CCDS60 TRQGNEIFLALEEAISAQKNAAPA--------TPQPQPATIPASLPRPDSPYSRPHDSLP
230 240 250 260 270
300 310 320 330 340
pF1KB5 PALYAEPLDSLRIAPCP-SQDSLYSDPLDSTSAQAGE---GVQRKKP-LYWD-LYEHAQQ
: . :. :: : .:.. :. :.:... . :. :. : : : ::. ..
CCDS60 PPSPTTPVP----APRPRGQEGEYAVPFDAVARSLGKNFRGILAVPPQLLADPLYDSIEE
280 290 300 310 320 330
350 360 370 380 390 400
pF1KB5 QLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPK-DAWWCQARVKEEGYELPY-NP
: :. : :::::::.: . .::: :.::. .:: :: . .. : . .:
CCDS60 TL------PPRPDHIYDEPEGVAAL---SLYDSPQEPRGEAWRRQATADRDPAGLQHVQP
340 350 360 370 380
410 420 430 440 450 460
pF1KB5 ATDDYAVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDCG
: .:..
CCDS60 AGQDFSASGWQPGTEYDNVVLKKGPK
390 400 410
>>CCDS47350.1 DOK3 gene_id:79930|Hs108|chr5 (330 aa)
initn: 450 init1: 147 opt: 504 Z-score: 419.9 bits: 86.6 E(32554): 4.6e-17
Smith-Waterman score: 504; 35.9% identity (61.3% similar) in 287 aa overlap (1-273:4-276)
10 20 30 40 50
pF1KB5 MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFDHK--GSSSGGGR-
.. . .: :. : .:: : :::.::.:: ..: :::::: .. . : ...: :
CCDS47 MDPLETPIKDGILYQQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDRS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB5 -GSSRRLDCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQT
: .:: . .:::::.:::: :. :. :. . :: : :..:::::::. ::.
CCDS47 AGPGRRGERRVIRLADCVSVLPADGESCPRD-TGAFLLTTTERSHLLAAQ--HRQAWMGP
70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LCRNAFP-KGSWTLAPTD-NPPKLSALEMLENSLYSPTWEGSQFWVTVQRTEAAERCGLH
.:. ::: : . . :: . :: . . : :::.:: : ..: :.::::::: :: :.
CCDS47 ICQLAFPGTGEASSGSTDAQSPKRGLVPMEENSIYSSWQEVGEFPVVVQRTEAATRCQLK
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 GSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTF
: .: . . . : ... . : :::: .::..: ::.... : : . .
CCDS47 GPALLVLGPDAIQL----REAKGTQALYSWPYHFLRKFGSDKILLG------TP-GVSLL
180 190 200 210 220
240 250 260 270 280
pF1KB5 TFQTAQGNDIFQAV--ETAIHRQKAQGKAGQGH------DVLRADSHEGEVAEGKLPSPP
. . .:. . :. .. .. : .:... ::: . .::
CCDS47 ICKGERTDDVSGIILDESLLRAYSVPGAGGHSRVQDSLGPVLREPTFQGERSFLKTSMLR
230 240 250 260 270 280
290 300 310 320 330 340
pF1KB5 GPQELLDSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKKPLYWDLYEH
CCDS47 SLLCSCSWRHPRSQPRTQASCLQGSDCPAPHRNSTSAAHTLGTS
290 300 310 320 330
481 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 23:20:03 2016 done: Sat Nov 5 23:20:04 2016
Total Scan time: 3.900 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]