FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0759, 331 aa 1>>>pF1KE0759 331 - 331 aa - 331 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6808+/-0.00091; mu= 13.7842+/- 0.054 mean_var=63.9089+/-13.072, 0's: 0 Z-trim(105.7): 32 B-trim: 488 in 2/47 Lambda= 0.160433 statistics sampled from 8534 (8555) to 8534 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.633), E-opt: 0.2 (0.263), width: 16 Scan time: 2.310 The best scores are: opt bits E(32554) CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 ( 331) 2226 523.9 6.9e-149 CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 ( 306) 1513 358.9 3.1e-99 CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 ( 326) 1422 337.8 7.2e-93 CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 ( 365) 1395 331.6 6e-91 CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 ( 198) 914 220.2 1.1e-57 >>CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 (331 aa) initn: 2226 init1: 2226 opt: 2226 Z-score: 2786.4 bits: 523.9 E(32554): 6.9e-149 Smith-Waterman score: 2226; 100.0% identity (100.0% similar) in 331 aa overlap (1-331:1-331) 10 20 30 40 50 60 pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 MEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQTF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQTF 250 260 270 280 290 300 310 320 330 pF1KE0 PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ ::::::::::::::::::::::::::::::: CCDS32 PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ 310 320 330 >>CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 (306 aa) initn: 1469 init1: 1469 opt: 1513 Z-score: 1895.1 bits: 358.9 E(32554): 3.1e-99 Smith-Waterman score: 1513; 70.1% identity (89.6% similar) in 308 aa overlap (1-307:1-306) 10 20 30 40 50 60 pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT ::::::::::::::.::::.:::..::::::::::::::.::::: ::.::::: .:::: CCDS13 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS ::.:.::..:::. :::::..: :.:.:::::::::.:::.:::: : :::.:::.:::: CCDS13 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP ::::::::.::.:::.:::::::::.::::..:::..:::.: : :::..: .:::. :: CCDS13 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE ::.:::::::.::::::.::::.:::::: ::::.:: ::::::::.:::::::::: :. CCDS13 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERL-LQ 190 200 210 220 230 250 260 270 280 290 pF1KE0 MEQKARLQTSLTE-PMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQT ... :: ...: .:: . :::::::.:::::.:.:..: :: . . :. :..: CCDS13 SVKNSMLQMKMSERAASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVS-SPLKLHRTET 240 250 260 270 280 290 300 310 320 330 pF1KE0 FPSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ ::.: :. CCDS13 FPAYRSEH 300 >>CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 (326 aa) initn: 1423 init1: 1308 opt: 1422 Z-score: 1780.8 bits: 337.8 E(32554): 7.2e-93 Smith-Waterman score: 1423; 63.9% identity (84.9% similar) in 324 aa overlap (1-314:1-324) 10 20 30 40 50 60 pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT ::.::.:::::::::..::::::.:::::::.:.:::::.::::.::::.. .:. ::: CCDS10 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS :. :.: .::::.:::..:::::: :....::.:.:::::::: : : .::::.:::::: CCDS10 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP :::::::: ::: ::..::::.:.: ::::.:::: .:::::::::::::: .:::: :: CCDS10 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE : ::::::::.: ::::.:::::.::::.::::.:::.:::.::::::::::::.:..:: CCDS10 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE 190 200 210 220 230 240 250 260 270 280 290 pF1KE0 MEQKARLQTSLTEPMTL--SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQ ::...:: .. :: .. . . ::::::::::: .....: : :.:.:... : CCDS10 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET 250 260 270 280 290 300 300 310 320 330 pF1KE0 TF--------PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ . :. . .: ::. : CCDS10 DLLNRFILLKPKPSQGDSSEAKTPSQ 310 320 >>CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 (365 aa) initn: 1396 init1: 1308 opt: 1395 Z-score: 1746.2 bits: 331.6 E(32554): 6e-91 Smith-Waterman score: 1395; 68.4% identity (89.3% similar) in 291 aa overlap (1-289:1-291) 10 20 30 40 50 60 pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT ::.::.:::::::::..::::::.:::::::.:.:::::.::::.::::.. .:. ::: CCDS81 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS :. :.: .::::.:::..:::::: :....::.:.:::::::: : : .::::.:::::: CCDS81 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP :::::::: ::: ::..::::.:.: ::::.:::: .:::::::::::::: .:::: :: CCDS81 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE : ::::::::.: ::::.:::::.::::.::::.:::.:::.::::::::::::.:..:: CCDS81 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE 190 200 210 220 230 240 250 260 270 280 290 pF1KE0 MEQKARLQTSLTEPMTL--SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQ ::...:: .. :: .. . . ::::::::::: .....: : :... CCDS81 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGESLPCPTPTCQE 250 260 270 280 290 300 300 310 320 330 pF1KE0 TFPSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ CCDS81 ALWRMRPIGQGSFDLALSSEPASVPTGEGYGAAQASSETDLLNRFILLKPKPSQGDSSEA 310 320 330 340 350 360 >>CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 (198 aa) initn: 870 init1: 870 opt: 914 Z-score: 1148.9 bits: 220.2 E(32554): 1.1e-57 Smith-Waterman score: 914; 66.0% identity (87.0% similar) in 200 aa overlap (109-307:1-198) 80 90 100 110 120 130 pF1KE0 AVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDISLGEPDLLAAGVQREQNER :::.:::.::::::::::::.::.:::.:: CCDS13 MECVGTRINDISLGEPDLLATGVEREQSER 10 20 30 140 150 160 170 180 190 pF1KE0 FNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWPLSSLRRYGRDSTWFTFES :::::::.::::..:::..:::.: : :::..: .:::. ::::.:::::::.::::::. CCDS13 FNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEA 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE0 GRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLEMEQKARLQTSLTE-PMTL ::::.:::::: ::::.:: ::::::::.:::::::::: :. ... :: ...: .: CCDS13 GRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERL-LQSVKNSMLQMKMSERAASL 100 110 120 130 140 260 270 280 290 300 310 pF1KE0 SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQTFPSYAPEQSEEAQQPLSR : . :::::::.:::::.:.:..: :: . . :. :..:::.: :. CCDS13 STMVPLPRSAYWQHITRQHSTGQLYRLQDVS-SPLKLHRTETFPAYRSEH 150 160 170 180 190 320 330 pF1KE0 SSSYGFSYSSSLIQ 331 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 03:04:49 2016 done: Sat Nov 5 03:04:49 2016 Total Scan time: 2.310 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]