FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6407, 306 aa 1>>>pF1KB6407 306 - 306 aa - 306 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4890+/-0.000989; mu= 14.2189+/- 0.059 mean_var=61.4346+/-12.307, 0's: 0 Z-trim(103.2): 27 B-trim: 0 in 0/48 Lambda= 0.163632 statistics sampled from 7304 (7321) to 7304 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.602), E-opt: 0.2 (0.225), width: 16 Scan time: 2.320 The best scores are: opt bits E(32554) CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 ( 306) 2068 496.9 7.9e-141 CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 ( 331) 1513 365.9 2.3e-101 CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 ( 198) 1336 324.1 5.6e-89 CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 ( 365) 1243 302.2 3.9e-82 CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 ( 326) 1242 301.9 4.2e-82 >>CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 (306 aa) initn: 2068 init1: 2068 opt: 2068 Z-score: 2641.8 bits: 496.9 E(32554): 7.9e-141 Smith-Waterman score: 2068; 100.0% identity (100.0% similar) in 306 aa overlap (1-306:1-306) 10 20 30 40 50 60 pF1KB6 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERLLQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERLLQS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 VKNSMLQMKMSERAASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VKNSMLQMKMSERAASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETFP 250 260 270 280 290 300 pF1KB6 AYRSEH :::::: CCDS13 AYRSEH >>CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 (331 aa) initn: 1469 init1: 1469 opt: 1513 Z-score: 1933.2 bits: 365.9 E(32554): 2.3e-101 Smith-Waterman score: 1513; 70.1% identity (89.6% similar) in 308 aa overlap (1-306:1-307) 10 20 30 40 50 60 pF1KB6 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT ::::::::::::::.::::.:::..::::::::::::::.::::: ::.::::: .:::: CCDS32 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS ::.:.::..:::. :::::..: :.:.:::::::::.:::.:::: : :::.:::.:::: CCDS32 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP ::::::::.::.:::.:::::::::.::::..:::..:::.: : :::..: .:::. :: CCDS32 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP 130 140 150 160 170 180 190 200 210 220 230 pF1KB6 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERL-LQ ::.:::::::.::::::.::::.:::::: ::::.:: ::::::::.:::::::::: :. CCDS32 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB6 SVKNSMLQMKMSERAASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVS-SPLKLHRTET ... :: ...: .:: . :::::::.:::::.:.:..: :: . . :. :..: CCDS32 MEQKARLQTSLTE-PMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQT 250 260 270 280 290 300 pF1KB6 FPAYRSEH ::.: :. CCDS32 FPSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ 300 310 320 330 >>CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 (198 aa) initn: 1336 init1: 1336 opt: 1336 Z-score: 1711.0 bits: 324.1 E(32554): 5.6e-89 Smith-Waterman score: 1336; 100.0% identity (100.0% similar) in 198 aa overlap (109-306:1-198) 80 90 100 110 120 130 pF1KB6 AIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDISLGEPDLLATGVEREQSER :::::::::::::::::::::::::::::: CCDS13 MECVGTRINDISLGEPDLLATGVEREQSER 10 20 30 140 150 160 170 180 190 pF1KB6 FNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEA 40 50 60 70 80 90 200 210 220 230 240 250 pF1KB6 GRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERLLQSVKNSMLQMKMSERAASLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERLLQSVKNSMLQMKMSERAASLS 100 110 120 130 140 150 260 270 280 290 300 pF1KB6 TMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETFPAYRSEH :::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETFPAYRSEH 160 170 180 190 >>CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 (365 aa) initn: 1238 init1: 1176 opt: 1243 Z-score: 1588.0 bits: 302.2 E(32554): 3.9e-82 Smith-Waterman score: 1243; 62.0% identity (84.9% similar) in 292 aa overlap (1-290:1-292) 10 20 30 40 50 60 pF1KB6 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT ::.::.::::::::...::.::::.::::::.:.:::::.::::. ::... .: ::: CCDS81 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS :..::: :.:::: ::..:..: :.::...::.:.:.:::.:: :.:..::.:.:.:::: CCDS81 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP :::::::: ::. ::..::::.:.: :::::.::: ::::.: : :::..::::::.::: CCDS81 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP 130 140 150 160 170 180 190 200 210 220 230 pF1KB6 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHER-LLQ : .:::::::.: :::::::::..::::. :::..:: :::.::::.:::::::.: ::. CCDS81 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB6 SVKNSMLQMKMSERAASLSTMVP-LPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTET :: : : .:. . : . :::::::.::: ... .. : : CCDS81 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGESLPCPTPTCQE 250 260 270 280 290 300 300 pF1KB6 FPAYRSEH CCDS81 ALWRMRPIGQGSFDLALSSEPASVPTGEGYGAAQASSETDLLNRFILLKPKPSQGDSSEA 310 320 330 340 350 360 >>CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 (326 aa) initn: 1238 init1: 1176 opt: 1242 Z-score: 1587.6 bits: 301.9 E(32554): 4.2e-82 Smith-Waterman score: 1242; 65.1% identity (87.6% similar) in 275 aa overlap (1-273:1-275) 10 20 30 40 50 60 pF1KB6 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT ::.::.::::::::...::.::::.::::::.:.:::::.::::. ::... .: ::: CCDS10 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS :..::: :.:::: ::..:..: :.::...::.:.:.:::.:: :.:..::.:.:.:::: CCDS10 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP :::::::: ::. ::..::::.:.: :::::.::: ::::.: : :::..::::::.::: CCDS10 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP 130 140 150 160 170 180 190 200 210 220 230 pF1KB6 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHER-LLQ : .:::::::.: :::::::::..::::. :::..:: :::.::::.:::::::.: ::. CCDS10 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB6 SVKNSMLQMKMSERAASLSTMVP-LPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTET :: : : .:. . : . :::::::.::: CCDS10 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET 250 260 270 280 290 300 300 pF1KB6 FPAYRSEH CCDS10 DLLNRFILLKPKPSQGDSSEAKTPSQ 310 320 306 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 18:44:28 2016 done: Sat Nov 5 18:44:29 2016 Total Scan time: 2.320 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]