FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8139, 326 aa
1>>>pF1KB8139 326 - 326 aa - 326 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4300+/-0.00088; mu= 14.8925+/- 0.053
mean_var=66.8158+/-13.080, 0's: 0 Z-trim(105.9): 30 B-trim: 0 in 0/51
Lambda= 0.156904
statistics sampled from 8637 (8658) to 8637 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.655), E-opt: 0.2 (0.266), width: 16
Scan time: 2.060
The best scores are: opt bits E(32554)
CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 ( 326) 2214 510.1 9.4e-145
CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 ( 365) 1978 456.7 1.3e-128
CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 ( 331) 1422 330.9 8.9e-91
CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 ( 306) 1242 290.1 1.5e-78
CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 ( 198) 756 180.0 1.4e-45
>>CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 (326 aa)
initn: 2214 init1: 2214 opt: 2214 Z-score: 2712.2 bits: 510.1 E(32554): 9.4e-145
Smith-Waterman score: 2214; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:1-326)
10 20 30 40 50 60
pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET
250 260 270 280 290 300
310 320
pF1KB8 DLLNRFILLKPKPSQGDSSEAKTPSQ
::::::::::::::::::::::::::
CCDS10 DLLNRFILLKPKPSQGDSSEAKTPSQ
310 320
>>CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 (365 aa)
initn: 1978 init1: 1978 opt: 1978 Z-score: 2422.8 bits: 456.7 E(32554): 1.3e-128
Smith-Waterman score: 2082; 89.1% identity (89.1% similar) in 358 aa overlap (1-319:1-358)
10 20 30 40 50 60
pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
190 200 210 220 230 240
250 260 270 280
pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGE-----------
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGESLPCPTPTCQE
250 260 270 280 290 300
290 300 310 320
pF1KB8 ----------------------------GYGAAQASSETDLLNRFILLKPKPSQGDSSEA
::::::::::::::::::::::::::::::
CCDS81 ALWRMRPIGQGSFDLALSSEPASVPTGEGYGAAQASSETDLLNRFILLKPKPSQGDSSEA
310 320 330 340 350 360
pF1KB8 KTPSQ
CCDS81 KTPSQ
>>CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 (331 aa)
initn: 1423 init1: 1308 opt: 1422 Z-score: 1743.2 bits: 330.9 E(32554): 8.9e-91
Smith-Waterman score: 1423; 63.9% identity (84.9% similar) in 324 aa overlap (1-324:1-314)
10 20 30 40 50 60
pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
::.::.:::::::::..::::::.:::::::.:.:::::.::::.::::.. .:. :::
CCDS32 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
:. :.: .::::.:::..:::::: :....::.:.:::::::: : : .::::.::::::
CCDS32 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
:::::::: ::: ::..::::.:.: ::::.:::: .:::::::::::::: .:::: ::
CCDS32 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
: ::::::::.: ::::.:::::.::::.::::.:::.:::.::::::::::::.:..::
CCDS32 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET
::...:: .. :: .. . . ::::::::::: .....: : :.:.:... :
CCDS32 MEQKARLQTSLTEPMTL--SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQ
250 260 270 280 290
310 320
pF1KB8 DLLNRFILLKPKPSQGDSSEAKTPSQ
. :. . .: ::. :
CCDS32 TF--------PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ
300 310 320 330
>>CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 (306 aa)
initn: 1238 init1: 1176 opt: 1242 Z-score: 1523.5 bits: 290.1 E(32554): 1.5e-78
Smith-Waterman score: 1242; 64.7% identity (87.6% similar) in 275 aa overlap (1-275:1-273)
10 20 30 40 50 60
pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
::.::.::::::::...::.::::.::::::.:.:::::.::::. ::... .: :::
CCDS13 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
:..::: :.:::: ::..:..: :.::...::.:.:.:::.:: :.:..::.:.:.::::
CCDS13 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
:::::::: ::. ::..::::.:.: :::::.::: ::::.: : :::..::::::.:::
CCDS13 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
: .:::::::.: :::::::::..::::. :::..:: :::.::::.:::::::.: ::.
CCDS13 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHER-LLQ
190 200 210 220 230
250 260 270 280 290 300
pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET
:: : : .:. . . . :::::::.:::
CCDS13 SVKNSMLQMKMSER-AASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTET
240 250 260 270 280 290
310 320
pF1KB8 DLLNRFILLKPKPSQGDSSEAKTPSQ
CCDS13 FPAYRSEH
300
>>CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 (198 aa)
initn: 751 init1: 689 opt: 756 Z-score: 931.9 bits: 180.0 E(32554): 1.4e-45
Smith-Waterman score: 756; 66.9% identity (85.2% similar) in 169 aa overlap (109-275:1-165)
80 90 100 110 120 130
pF1KB8 AVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDISLGEPDLLAPGVQCEQTDR
.::.:.:.:::::::::::: ::. ::..:
CCDS13 MECVGTRINDISLGEPDLLATGVEREQSER
10 20 30
140 150 160 170 180 190
pF1KB8 FNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWPLCSLRRYGRDATRFTFEA
:::.:.: :::::.::: ::::.: : :::..::::::.:::: .:::::::.: :::::
CCDS13 FNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEA
40 50 60 70 80 90
200 210 220 230 240 250
pF1KB8 GRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLEMEKNVRLLNKGTEHYSYP
::::..::::. :::..:: :::.::::.:::::::.: ::. :: : : .:. .
CCDS13 GRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHER-LLQSVKNSMLQMKMSERAA--
100 110 120 130 140
260 270 280 290 300 310
pF1KB8 CTPTTM--LPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSETDLLNRFILLKPKPSQG
. .:: :::::::.:::
CCDS13 -SLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETFPAYRSEH
150 160 170 180 190
326 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 09:55:46 2016 done: Fri Nov 4 09:55:47 2016
Total Scan time: 2.060 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]