FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0716, 412 aa
1>>>pF1KE0716 412 - 412 aa - 412 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.7928+/-0.00091; mu= -1.6806+/- 0.055
mean_var=263.8747+/-53.380, 0's: 0 Z-trim(115.4): 18 B-trim: 0 in 0/53
Lambda= 0.078954
statistics sampled from 15982 (15997) to 15982 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.491), width: 16
Scan time: 3.130
The best scores are: opt bits E(32554)
CCDS6016.1 DOK2 gene_id:9046|Hs108|chr8 ( 412) 2854 337.8 1.1e-92
CCDS78098.1 DOK3 gene_id:79930|Hs108|chr5 ( 440) 677 89.8 5.3e-18
CCDS4426.1 DOK3 gene_id:79930|Hs108|chr5 ( 496) 677 89.9 5.8e-18
CCDS1954.1 DOK1 gene_id:1796|Hs108|chr2 ( 481) 590 80.0 5.5e-15
CCDS47350.1 DOK3 gene_id:79930|Hs108|chr5 ( 330) 485 67.9 1.6e-11
>>CCDS6016.1 DOK2 gene_id:9046|Hs108|chr8 (412 aa)
initn: 2854 init1: 2854 opt: 2854 Z-score: 1777.2 bits: 337.8 E(32554): 1.1e-92
Smith-Waterman score: 2854; 100.0% identity (100.0% similar) in 412 aa overlap (1-412:1-412)
10 20 30 40 50 60
pF1KE0 MGDGAVKQGFLYLQQQQTFGKKWRRFGASLYGGSDCALARLELQEGPEKPRRCEAARKVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 MGDGAVKQGFLYLQQQQTFGKKWRRFGASLYGGSDCALARLELQEGPEKPRRCEAARKVI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 RLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQAICLLAFPGQRKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 RLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQAICLLAFPGQRKE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LSGPEGKQSRPCMEENELYSSAVTVGPHKEFAVTMRPTEASERCHLRGSYTLRAGESALE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 LSGPEGKQSRPCMEENELYSSAVTVGPHKEFAVTMRPTEASERCHLRGSYTLRAGESALE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 LWGGPEPGTQLYDWPYRFLRRFGRDKVTFSFEAGRRCVSGEGNFEFETRQGNEIFLALEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 LWGGPEPGTQLYDWPYRFLRRFGRDKVTFSFEAGRRCVSGEGNFEFETRQGNEIFLALEE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AISAQKNAAPATPQPQPATIPASLPRPDSPYSRPHDSLPPPSPTTPVPAPRPRGQEGEYA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 AISAQKNAAPATPQPQPATIPASLPRPDSPYSRPHDSLPPPSPTTPVPAPRPRGQEGEYA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 VPFDAVARSLGKNFRGILAVPPQLLADPLYDSIEETLPPRPDHIYDEPEGVAALSLYDSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 VPFDAVARSLGKNFRGILAVPPQLLADPLYDSIEETLPPRPDHIYDEPEGVAALSLYDSP
310 320 330 340 350 360
370 380 390 400 410
pF1KE0 QEPRGEAWRRQATADRDPAGLQHVQPAGQDFSASGWQPGTEYDNVVLKKGPK
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 QEPRGEAWRRQATADRDPAGLQHVQPAGQDFSASGWQPGTEYDNVVLKKGPK
370 380 390 400 410
>>CCDS78098.1 DOK3 gene_id:79930|Hs108|chr5 (440 aa)
initn: 566 init1: 262 opt: 677 Z-score: 436.6 bits: 89.8 E(32554): 5.3e-18
Smith-Waterman score: 729; 39.0% identity (57.2% similar) in 423 aa overlap (6-406:8-393)
10 20 30 40 50
pF1KE0 MGDGAVKQGFLYLQQQQTFGKK-WRRFGASLYGGSDCALARLELQE------GPEKPR
.:.:.:: ::. :::: ::. : ::.:. ..:::: : : :
CCDS78 MDPLETPIKDGILY-QQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDR
10 20 30 40 50
60 70 80 90 100
pF1KE0 RC----EAARKVIRLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQ
.. :.::::.::. : : :: : ::::.::.: : :: .:::: .: :.
CCDS78 SAGPGRRGERRVIRLADCVSVLPADGE-SCPRDTGAFLLTTTERSHLLAAQ--HRQAWMG
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE0 AICLLAFPGQRKELSGPEGKQSRPC-----MEENELYSSAVTVGPHKEFAVTMRPTEASE
:: ::::: . :: :: : :::: .::: :: :: :... :::.
CCDS78 PICQLAFPGTGEASSGSTDAQS-PKRGLVPMEENSIYSSWQEVG---EFPVVVQRTEAAT
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE0 RCHLRGSYTLRAGESALELWGGPEPGTQ-LYDWPYRFLRRFGRDKVTFSFEAGRRCVSGE
::.:.: : : .:..: ::: ::.:::.:::.:: :: .::::::::: :::
CCDS78 RCQLKGPALLVLGPDAIQL--REAKGTQALYSWPYHFLRKFGSDKGVFSFEAGRRCHSGE
180 190 200 210 220 230
230 240 250 260 270
pF1KE0 GNFEFETRQGNEIFLALEEAISAQKNAAPATPQPQPATIP--ASLPRPDSPYS-RPHDSL
: : : : . .. :. ::. :.. : .::: .: .::: :.: :
CCDS78 GLFAFSTPCAPDLCRAVAGAIARQRERLPELTRPQPCPLPRATSLPSLDTPGELREM---
240 250 260 270 280
280 290 300 310 320 330
pF1KE0 PPPSPTTPVPAPRPRGQEGEYAVPFDAVARSLGKNFRGILAVPPQLLADPLYDSI--EET
::.: :. .. : ..:. .:. :. ::. :: :. . .
CCDS78 -PPGPEPPTSRKMHLAEPGPQSLPL-------------LLGPEPNDLASGLYASVCKRAS
290 300 310 320 330
340 350 360 370 380 390
pF1KE0 LPPRPDHIYDEPEGVAALSLYDSPQEPRGEAWRRQATADRDPAGLQHVQPAGQDFSASGW
:: .:.: :.. .: :: :: ... ..:.:. . . :::.: :
CCDS78 GPPGNEHLY---ENLCVLEA--SPTLHGGEPEPHEGPGSRSPT-TSPIYHNGQDLS---W
340 350 360 370 380
400 410
pF1KE0 QPGTEYDNVVLKKGPK
:: :...
CCDS78 -PGPANDSTLEAQYRRLLELDQVEGTGRPDPQAGFKAKLVTLLSRERRKGPAPCDRP
390 400 410 420 430 440
>>CCDS4426.1 DOK3 gene_id:79930|Hs108|chr5 (496 aa)
initn: 566 init1: 262 opt: 677 Z-score: 435.9 bits: 89.9 E(32554): 5.8e-18
Smith-Waterman score: 729; 39.0% identity (57.2% similar) in 423 aa overlap (6-406:64-449)
10 20 30
pF1KE0 MGDGAVKQGFLYLQQQQTFGKK-WRRFGASLYGGS
.:.:.:: ::. :::: ::. : ::.:.
CCDS44 EFPSSLSSVSPGLEAAALLLAVTMDPLETPIKDGILY-QQHVKFGKKCWRKVWALLYAGG
40 50 60 70 80 90
40 50 60 70 80
pF1KE0 DCALARLELQE------GPEKPRRC----EAARKVIRLSDCLRVAEAGGEASSPRDTSAF
..:::: : : : .. :.::::.::. : : :: : ::::.::
CCDS44 PSGVARLESWEVRDGGLGAAGDRSAGPGRRGERRVIRLADCVSVLPADGE-SCPRDTGAF
100 110 120 130 140 150
90 100 110 120 130
pF1KE0 FLETKERLYLLAAPAAERGDWVQAICLLAFPGQRKELSGPEGKQSRPC-----MEENELY
.: : :: .:::: .: :. :: ::::: . :: :: : :::: .:
CCDS44 LLTTTERSHLLAAQ--HRQAWMGPICQLAFPGTGEASSGSTDAQS-PKRGLVPMEENSIY
160 170 180 190 200
140 150 160 170 180 190
pF1KE0 SSAVTVGPHKEFAVTMRPTEASERCHLRGSYTLRAGESALELWGGPEPGTQ-LYDWPYRF
:: :: :: :... :::. ::.:.: : : .:..: ::: ::.:::.:
CCDS44 SSWQEVG---EFPVVVQRTEAATRCQLKGPALLVLGPDAIQL--REAKGTQALYSWPYHF
210 220 230 240 250 260
200 210 220 230 240 250
pF1KE0 LRRFGRDKVTFSFEAGRRCVSGEGNFEFETRQGNEIFLALEEAISAQKNAAPATPQPQPA
::.:: :: .::::::::: :::: : : : . .. :. ::. :.. : .:::
CCDS44 LRKFGSDKGVFSFEAGRRCHSGEGLFAFSTPCAPDLCRAVAGAIARQRERLPELTRPQPC
270 280 290 300 310 320
260 270 280 290 300 310
pF1KE0 TIP--ASLPRPDSPYS-RPHDSLPPPSPTTPVPAPRPRGQEGEYAVPFDAVARSLGKNFR
.: .::: :.: : ::.: :. .. : ..:.
CCDS44 PLPRATSLPSLDTPGELREM----PPGPEPPTSRKMHLAEPGPQSLPL------------
330 340 350 360
320 330 340 350 360 370
pF1KE0 GILAVPPQLLADPLYDSI--EETLPPRPDHIYDEPEGVAALSLYDSPQEPRGEAWRRQAT
.:. :. ::. :: :. . . :: .:.: :.. .: :: :: ...
CCDS44 -LLGPEPNDLASGLYASVCKRASGPPGNEHLY---ENLCVLEA--SPTLHGGEPEPHEGP
370 380 390 400 410 420
380 390 400 410
pF1KE0 ADRDPAGLQHVQPAGQDFSASGWQPGTEYDNVVLKKGPK
..:.:. . . :::.: : :: :...
CCDS44 GSRSPT-TSPIYHNGQDLS---W-PGPANDSTLEAQYRRLLELDQVEGTGRPDPQAGFKA
430 440 450 460 470
CCDS44 KLVTLLSRERRKGPAPCDRP
480 490
>>CCDS1954.1 DOK1 gene_id:1796|Hs108|chr2 (481 aa)
initn: 654 init1: 265 opt: 590 Z-score: 382.5 bits: 80.0 E(32554): 5.5e-15
Smith-Waterman score: 724; 39.1% identity (59.5% similar) in 425 aa overlap (3-392:2-410)
10 20 30 40 50
pF1KE0 MGDGAVKQGFLYLQQQQTFG-KKWRRFGASLYGGSDCALARLELQE--------GPEKPR
:::: .: :.::.:. :: :.::. : :: .: ..::::. . : . :
CCDS19 MDGAVMEGPLFLQSQR-FGTKRWRKTWAVLYPASPHGVARLEFFDHKGSSSGGGRGSSR
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 RCEAARKVIRLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQAICL
: . :::::..:. :: . :. ..:: :.: .: .:::: : . :::..:
CCDS19 RLDC--KVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAADAPSSAAWVQTLCR
60 70 80 90 100 110
120 130 140 150 160
pF1KE0 LAFPGQRKELSGPEG--KQSRPCMEENELYSSAVTVGPHKEFAVTMRPTEASERCHLRGS
::: :. .. : : : :: ::: . : ..: ::.. :::.::: :.::
CCDS19 NAFPKGSWTLAPTDNPPKLSALEMLENSLYSPTWE-G--SQFWVTVQRTEAAERCGLHGS
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE0 YTLRAGESALELWG-GPEPGT--QLYDWPYRFLRRFGRDKVTFSFEAGRRCVSGEGNFEF
:.::. : : : . : .::: .:::.::::: ::::::::: :: :.: :
CCDS19 YVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSFEAGRRCPSGPGTFTF
180 190 200 210 220 230
230 240 250 260 270
pF1KE0 ETRQGNEIFLALEEAISAQKNAAPA--------TPQPQPATIPASLPRPDSPYSRPHDSL
.: :::.:: :.: :: :: . : . . . . ..:: : .: .. ::
CCDS19 QTAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAEGKLPSPPGP-QELLDS-
240 250 260 270 280 290
280 290 300 310 320 330
pF1KE0 PPPSPTTPVP----APRPRGQEGEYAVPFDAVARSLGKNFRGILAVPPQLLADPLYDSIE
:: . :. :: : .:.. :. :.:... . :. :. : : : ::. .
CCDS19 PPALYAEPLDSLRIAPCP-SQDSLYSDPLDSTSAQAGE---GVQRKKP-LYWD-LYEHAQ
300 310 320 330 340
340 350 360 370 380
pF1KE0 ETL------PPRPDHIYDEPEGVAAL---SLYDSPQEPRGEAWRRQATADRDPAGLQHVQ
. : :. : :::::::.: . .::: :.::. .:: :: . .. : . .
CCDS19 QQLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPK-DAWWCQARVKEEGYELPY-N
350 360 370 380 390 400
390 400 410
pF1KE0 PAGQDFSASGWQPGTEYDNVVLKKGPK
:: .:..
CCDS19 PATDDYAVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYSQVQKSGASGSWDC
410 420 430 440 450 460
>>CCDS47350.1 DOK3 gene_id:79930|Hs108|chr5 (330 aa)
initn: 444 init1: 102 opt: 485 Z-score: 320.2 bits: 67.9 E(32554): 1.6e-11
Smith-Waterman score: 485; 46.1% identity (61.6% similar) in 219 aa overlap (6-207:8-216)
10 20 30 40 50
pF1KE0 MGDGAVKQGFLYLQQQQTFGKK-WRRFGASLYGGSDCALARLELQE------GPEKPR
.:.:.:: ::. :::: ::. : ::.:. ..:::: : : :
CCDS47 MDPLETPIKDGILY-QQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDR
10 20 30 40 50
60 70 80 90 100
pF1KE0 RC----EAARKVIRLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQ
.. :.::::.::. : : :: : ::::.::.: : :: .:::: .: :.
CCDS47 SAGPGRRGERRVIRLADCVSVLPADGE-SCPRDTGAFLLTTTERSHLLAA--QHRQAWMG
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE0 AICLLAFPGQRKELSGPEGKQSR-----PCMEENELYSSAVTVGPHKEFAVTMRPTEASE
:: ::::: . :: :: : :::: .::: :: :: :... :::.
CCDS47 PICQLAFPGTGEASSGSTDAQSPKRGLVP-MEENSIYSSWQEVG---EFPVVVQRTEAAT
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE0 RCHLRGSYTLRAGESALELWGGPEPGTQ-LYDWPYRFLRRFGRDKVTFSFEAGRRCVSGE
::.:.: : : .:..: ::: ::.:::.:::.:: ::.
CCDS47 RCQLKGPALLVLGPDAIQLR--EAKGTQALYSWPYHFLRKFGSDKILLGTPGVSLLICKG
180 190 200 210 220 230
230 240 250 260 270 280
pF1KE0 GNFEFETRQGNEIFLALEEAISAQKNAAPATPQPQPATIPASLPRPDSPYSRPHDSLPPP
CCDS47 ERTDDVSGIILDESLLRAYSVPGAGGHSRVQDSLGPVLREPTFQGERSFLKTSMLRSLLC
240 250 260 270 280 290
412 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 02:56:52 2016 done: Sat Nov 5 02:56:53 2016
Total Scan time: 3.130 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]