FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5858, 490 aa 1>>>pF1KB5858 490 - 490 aa - 490 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4193+/-0.00107; mu= 12.2921+/- 0.064 mean_var=83.1162+/-16.974, 0's: 0 Z-trim(104.9): 35 B-trim: 140 in 1/49 Lambda= 0.140680 statistics sampled from 8129 (8151) to 8129 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.625), E-opt: 0.2 (0.25), width: 16 Scan time: 2.660 The best scores are: opt bits E(32554) CCDS33240.1 SMYD1 gene_id:150572|Hs108|chr2 ( 490) 3359 691.9 4.2e-199 CCDS82480.1 SMYD1 gene_id:150572|Hs108|chr2 ( 477) 1716 358.4 9.7e-99 CCDS31022.1 SMYD2 gene_id:56950|Hs108|chr1 ( 433) 464 104.3 2.8e-22 CCDS53486.1 SMYD3 gene_id:64754|Hs108|chr1 ( 428) 453 102.1 1.3e-21 >>CCDS33240.1 SMYD1 gene_id:150572|Hs108|chr2 (490 aa) initn: 3359 init1: 3359 opt: 3359 Z-score: 3688.1 bits: 691.9 E(32554): 4.2e-199 Smith-Waterman score: 3359; 100.0% identity (100.0% similar) in 490 aa overlap (1-490:1-490) 10 20 30 40 50 60 pF1KB5 MTIGRMENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MTIGRMENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 KLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIMWRVEREGTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIMWRVEREGTG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 LTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 GFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 KISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKDNPKPSQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKDNPKPSQE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 VVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 LSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 AILLVTHGPSHPITKDLEAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQVMAEPSNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 AILLVTHGPSHPITKDLEAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQVMAEPSNE 430 440 450 460 470 480 490 pF1KB5 PSPALFHKKQ :::::::::: CCDS33 PSPALFHKKQ 490 >>CCDS82480.1 SMYD1 gene_id:150572|Hs108|chr2 (477 aa) initn: 1716 init1: 1716 opt: 1716 Z-score: 1886.1 bits: 358.4 E(32554): 9.7e-99 Smith-Waterman score: 3231; 97.1% identity (97.3% similar) in 490 aa overlap (1-490:1-477) 10 20 30 40 50 60 pF1KB5 MTIGRMENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MTIGRMENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 KLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIMWRVEREGTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 KLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIMWRVEREGTG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 LTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 GFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALG ::::::::::::::::::::::::::::::::::::::: .::::::: CCDS82 GFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNG-------------KIELRALG 190 200 210 220 250 260 270 280 290 300 pF1KB5 KISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKDNPKPSQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 KISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKDNPKPSQE 230 240 250 260 270 280 310 320 330 340 350 360 pF1KB5 VVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 VVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEV 290 300 310 320 330 340 370 380 390 400 410 420 pF1KB5 LSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY 350 360 370 380 390 400 430 440 450 460 470 480 pF1KB5 AILLVTHGPSHPITKDLEAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQVMAEPSNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 AILLVTHGPSHPITKDLEAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQVMAEPSNE 410 420 430 440 450 460 490 pF1KB5 PSPALFHKKQ :::::::::: CCDS82 PSPALFHKKQ 470 >>CCDS31022.1 SMYD2 gene_id:56950|Hs108|chr1 (433 aa) initn: 665 init1: 254 opt: 464 Z-score: 513.5 bits: 104.3 E(32554): 2.8e-22 Smith-Waterman score: 808; 31.4% identity (63.8% similar) in 437 aa overlap (9-438:9-427) 10 20 30 40 50 60 pF1KB5 MTIGRMENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQE .: : . ::::::.: . : ..:..:. ::. :. . . :. :: :.: CCDS31 MRAEGLGGLERFCSPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKE 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 KLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKV--PNENIRLAARIMWRVEREG : .::.:: : ::. :::. : :: ::: . .:. :.:..::.:::. . . . CCDS31 GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB5 TGLTEGCLVSVDDLQNHVEHFGEEEQKDL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVI :..: ....:.... ..:.::: . :. .. ... . . . . .:. . CCDS31 ERTPSEKLLAVKEFESHLDKL-DNEKKDLIQSDIAALHHFYSKHLGFPDNDSLVVLFAQV 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 NCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELR ::::::. :.. :. .: .:::...:.::.: :: : ... :.: CCDS31 NCNGFTIEDEE-LSHLGSAIFPDVALMNHSCCPNVIVTYKG-------------TLAEVR 180 190 200 210 220 240 250 260 270 280 290 pF1KB5 ALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKD-NPK :. .:. :::. .::::.: .:.:. .:. .:.: : :..: : :: . .. . CCDS31 AVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKDKDKAKVEIRKLSDP 230 240 250 260 270 280 300 310 320 330 340 350 pF1KB5 PSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSI :. :....:........:.. .:. :....:. ::. :: :.:.:::.:. CCDS31 PKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQ 290 300 310 320 330 340 360 370 380 390 400 410 pF1KB5 VSEVLSYLQAFEEASFYARRMVDGYMK---LYHPNNAQLGMAVMRAGLTNWHAGNIEVGH . : :.: .: : :..... : : :: : :.. . . : . : . .:. CCDS31 AMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKA---AGE 350 360 370 380 390 400 420 430 440 450 460 470 pF1KB5 GMICKAYAILLVTHGPSHPITKDLEAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQV . :: ::. :.:: .:: .... CCDS31 KALKKAIAIMEVAHGKDHPYISEIKQEIESH 410 420 430 >>CCDS53486.1 SMYD3 gene_id:64754|Hs108|chr1 (428 aa) initn: 708 init1: 239 opt: 453 Z-score: 501.5 bits: 102.1 E(32554): 1.3e-21 Smith-Waterman score: 787; 31.0% identity (62.3% similar) in 448 aa overlap (9-449:6-426) 10 20 30 40 50 60 pF1KB5 MTIGRMENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQE :: :.. .: ::.:. . ....: . .: . . :: :. .: CCDS53 MEPLKVEKFATAKRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKE 10 20 30 40 50 70 80 90 100 110 pF1KB5 KLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKR-YGKVPNENIRLAARIMWRVEREGT :: ::.::. :.::. ::: :: .:: ::. .: . : ...:: .:..... .:. CCDS53 KLMRCSQCRVAKYCSAKCQKKAWPDHKRECKCLKSCKPRYPPDSVRLLGRVVFKLM-DGA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 GLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQY----ISHIFG : : ::....... :.... :: : :: .. . :. :. . . :. CCDS53 PSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 VINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIE . ::.::. . . .: ::::..:...:.::.: :::...:: : : . CCDS53 KVICNSFTICNAE-MQEVGVGLYPSISLLNHSCDPNCSIVFN-GPH------------LL 180 190 200 210 220 240 250 260 270 280 290 pF1KB5 LRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKD-DLFLGVKDN :::. : :::::. :.:.: .::::..::. :: :.: : .:: . :: :.. : CCDS53 LRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTG---- 230 240 250 260 270 300 310 320 330 340 350 pF1KB5 PKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRML ...: ::. ...:.::.. ... ...:. .:. . .. . : :::.:..: CCDS53 ---DEQVWKEV----QESLKKIEELKAHWKWEQVLAMCQAIISSNSERLPDINIYQLKVL 280 290 300 310 320 330 360 370 380 390 400 410 pF1KB5 SIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQL-GMAVMRAGLTNWHAGNIEVGH . . .. : .::: ::. : .. : ... :.. . :. ::..: . : : . . CCDS53 DCAMDACINLGLLEEALFYGTRTMEPY-RIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 340 350 360 370 380 390 420 430 440 450 460 470 pF1KB5 GMICKAYAILLVTHGPSHPITKDLEAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQV . :. :. :::: : . .:: . . . ..: CCDS53 KNLRLAFDIMRVTHGREHSLIEDLILLLEECDANIRAS 400 410 420 480 490 pF1KB5 MAEPSNEPSPALFHKKQ 490 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 16:30:50 2016 done: Sat Nov 5 16:30:50 2016 Total Scan time: 2.660 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]