FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8139, 326 aa 1>>>pF1KB8139 326 - 326 aa - 326 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4300+/-0.00088; mu= 14.8925+/- 0.053 mean_var=66.8158+/-13.080, 0's: 0 Z-trim(105.9): 30 B-trim: 0 in 0/51 Lambda= 0.156904 statistics sampled from 8637 (8658) to 8637 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.655), E-opt: 0.2 (0.266), width: 16 Scan time: 2.060 The best scores are: opt bits E(32554) CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 ( 326) 2214 510.1 9.4e-145 CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 ( 365) 1978 456.7 1.3e-128 CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 ( 331) 1422 330.9 8.9e-91 CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 ( 306) 1242 290.1 1.5e-78 CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 ( 198) 756 180.0 1.4e-45 >>CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 (326 aa) initn: 2214 init1: 2214 opt: 2214 Z-score: 2712.2 bits: 510.1 E(32554): 9.4e-145 Smith-Waterman score: 2214; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:1-326) 10 20 30 40 50 60 pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET 250 260 270 280 290 300 310 320 pF1KB8 DLLNRFILLKPKPSQGDSSEAKTPSQ :::::::::::::::::::::::::: CCDS10 DLLNRFILLKPKPSQGDSSEAKTPSQ 310 320 >>CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 (365 aa) initn: 1978 init1: 1978 opt: 1978 Z-score: 2422.8 bits: 456.7 E(32554): 1.3e-128 Smith-Waterman score: 2082; 89.1% identity (89.1% similar) in 358 aa overlap (1-319:1-358) 10 20 30 40 50 60 pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE 190 200 210 220 230 240 250 260 270 280 pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGE----------- ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGESLPCPTPTCQE 250 260 270 280 290 300 290 300 310 320 pF1KB8 ----------------------------GYGAAQASSETDLLNRFILLKPKPSQGDSSEA :::::::::::::::::::::::::::::: CCDS81 ALWRMRPIGQGSFDLALSSEPASVPTGEGYGAAQASSETDLLNRFILLKPKPSQGDSSEA 310 320 330 340 350 360 pF1KB8 KTPSQ CCDS81 KTPSQ >>CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 (331 aa) initn: 1423 init1: 1308 opt: 1422 Z-score: 1743.2 bits: 330.9 E(32554): 8.9e-91 Smith-Waterman score: 1423; 63.9% identity (84.9% similar) in 324 aa overlap (1-324:1-314) 10 20 30 40 50 60 pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT ::.::.:::::::::..::::::.:::::::.:.:::::.::::.::::.. .:. ::: CCDS32 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS :. :.: .::::.:::..:::::: :....::.:.:::::::: : : .::::.:::::: CCDS32 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP :::::::: ::: ::..::::.:.: ::::.:::: .:::::::::::::: .:::: :: CCDS32 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE : ::::::::.: ::::.:::::.::::.::::.:::.:::.::::::::::::.:..:: CCDS32 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET ::...:: .. :: .. . . ::::::::::: .....: : :.:.:... : CCDS32 MEQKARLQTSLTEPMTL--SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQ 250 260 270 280 290 310 320 pF1KB8 DLLNRFILLKPKPSQGDSSEAKTPSQ . :. . .: ::. : CCDS32 TF--------PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ 300 310 320 330 >>CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 (306 aa) initn: 1238 init1: 1176 opt: 1242 Z-score: 1523.5 bits: 290.1 E(32554): 1.5e-78 Smith-Waterman score: 1242; 64.7% identity (87.6% similar) in 275 aa overlap (1-275:1-273) 10 20 30 40 50 60 pF1KB8 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT ::.::.::::::::...::.::::.::::::.:.:::::.::::. ::... .: ::: CCDS13 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS :..::: :.:::: ::..:..: :.::...::.:.:.:::.:: :.:..::.:.:.:::: CCDS13 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP :::::::: ::. ::..::::.:.: :::::.::: ::::.: : :::..::::::.::: CCDS13 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE : .:::::::.: :::::::::..::::. :::..:: :::.::::.:::::::.: ::. CCDS13 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHER-LLQ 190 200 210 220 230 250 260 270 280 290 300 pF1KB8 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET :: : : .:. . . . :::::::.::: CCDS13 SVKNSMLQMKMSER-AASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTET 240 250 260 270 280 290 310 320 pF1KB8 DLLNRFILLKPKPSQGDSSEAKTPSQ CCDS13 FPAYRSEH 300 >>CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 (198 aa) initn: 751 init1: 689 opt: 756 Z-score: 931.9 bits: 180.0 E(32554): 1.4e-45 Smith-Waterman score: 756; 66.9% identity (85.2% similar) in 169 aa overlap (109-275:1-165) 80 90 100 110 120 130 pF1KB8 AVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDISLGEPDLLAPGVQCEQTDR .::.:.:.:::::::::::: ::. ::..: CCDS13 MECVGTRINDISLGEPDLLATGVEREQSER 10 20 30 140 150 160 170 180 190 pF1KB8 FNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWPLCSLRRYGRDATRFTFEA :::.:.: :::::.::: ::::.: : :::..::::::.:::: .:::::::.: ::::: CCDS13 FNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEA 40 50 60 70 80 90 200 210 220 230 240 250 pF1KB8 GRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLEMEKNVRLLNKGTEHYSYP ::::..::::. :::..:: :::.::::.:::::::.: ::. :: : : .:. . CCDS13 GRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHER-LLQSVKNSMLQMKMSERAA-- 100 110 120 130 140 260 270 280 290 300 310 pF1KB8 CTPTTM--LPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSETDLLNRFILLKPKPSQG . .:: :::::::.::: CCDS13 -SLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETFPAYRSEH 150 160 170 180 190 326 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:55:46 2016 done: Fri Nov 4 09:55:47 2016 Total Scan time: 2.060 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]