FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0431, 336 aa 1>>>pF1KE0431 336 - 336 aa - 336 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9521+/-0.00107; mu= 12.5143+/- 0.064 mean_var=64.1435+/-13.045, 0's: 0 Z-trim(102.7): 25 B-trim: 129 in 2/46 Lambda= 0.160139 statistics sampled from 7083 (7090) to 7083 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.581), E-opt: 0.2 (0.218), width: 16 Scan time: 2.490 The best scores are: opt bits E(32554) CCDS34616.1 NT5C3A gene_id:51251|Hs108|chr7 ( 336) 2175 511.4 4.1e-145 CCDS34617.1 NT5C3A gene_id:51251|Hs108|chr7 ( 297) 1878 442.8 1.6e-124 CCDS55101.1 NT5C3A gene_id:51251|Hs108|chr7 ( 285) 1866 440.0 1.1e-123 CCDS11410.2 NT5C3B gene_id:115024|Hs108|chr17 ( 300) 1174 280.2 1.5e-75 >>CCDS34616.1 NT5C3A gene_id:51251|Hs108|chr7 (336 aa) initn: 2175 init1: 2175 opt: 2175 Z-score: 2718.7 bits: 511.4 E(32554): 4.1e-145 Smith-Waterman score: 2175; 100.0% identity (100.0% similar) in 336 aa overlap (1-336:1-336) 10 20 30 40 50 60 pF1KE0 MRAPSMDRAAVARVGAVASASVCALVAGVVLAQYIFTLKRKTGRKTKIIEMMPEFQKSSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MRAPSMDRAAVARVGAVASASVCALVAGVVLAQYIFTLKRKTGRKTKIIEMMPEFQKSSV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 RIKNPTRVEEIICGLIKGGAAKLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RIKNPTRVEEIICGLIKGGAAKLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDEC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 RKKLLQLKEKYYAIEVDPVLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RKKLLQLKEKYYAIEVDPVLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LKEGYENFFDKLQQHSIPVFIFSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LKEGYENFFDKLQQHSIPVFIFSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GFKGELIHVFNKHDGALRNTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GFKGELIHVFNKHDGALRNTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYL 250 260 270 280 290 300 310 320 330 pF1KE0 NDRVDELLEKYMDSYDIVLVQDESLEVANSILQKIL :::::::::::::::::::::::::::::::::::: CCDS34 NDRVDELLEKYMDSYDIVLVQDESLEVANSILQKIL 310 320 330 >>CCDS34617.1 NT5C3A gene_id:51251|Hs108|chr7 (297 aa) initn: 1878 init1: 1878 opt: 1878 Z-score: 2348.8 bits: 442.8 E(32554): 1.6e-124 Smith-Waterman score: 1878; 99.3% identity (100.0% similar) in 288 aa overlap (49-336:10-297) 20 30 40 50 60 70 pF1KE0 SASVCALVAGVVLAQYIFTLKRKTGRKTKIIEMMPEFQKSSVRIKNPTRVEEIICGLIKG ..:::::::::::::::::::::::::::: CCDS34 MTNQESAVHVKMMPEFQKSSVRIKNPTRVEEIICGLIKG 10 20 30 80 90 100 110 120 130 pF1KE0 GAAKLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDECRKKLLQLKEKYYAIEVDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GAAKLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDECRKKLLQLKEKYYAIEVDP 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE0 VLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKEGYENFFDKLQQHSIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 VLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKEGYENFFDKLQQHSIP 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE0 VFIFSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFKGELIHVFNKHDGALR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 VFIFSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFKGELIHVFNKHDGALR 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE0 NTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYLNDRVDELLEKYMDSYDIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 NTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYLNDRVDELLEKYMDSYDIV 220 230 240 250 260 270 320 330 pF1KE0 LVQDESLEVANSILQKIL :::::::::::::::::: CCDS34 LVQDESLEVANSILQKIL 280 290 >>CCDS55101.1 NT5C3A gene_id:51251|Hs108|chr7 (285 aa) initn: 1866 init1: 1866 opt: 1866 Z-score: 2334.1 bits: 440.0 E(32554): 1.1e-123 Smith-Waterman score: 1866; 100.0% identity (100.0% similar) in 285 aa overlap (52-336:1-285) 30 40 50 60 70 80 pF1KE0 VCALVAGVVLAQYIFTLKRKTGRKTKIIEMMPEFQKSSVRIKNPTRVEEIICGLIKGGAA :::::::::::::::::::::::::::::: CCDS55 MPEFQKSSVRIKNPTRVEEIICGLIKGGAA 10 20 30 90 100 110 120 130 140 pF1KE0 KLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDECRKKLLQLKEKYYAIEVDPVLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDECRKKLLQLKEKYYAIEVDPVLT 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 VEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKEGYENFFDKLQQHSIPVFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKEGYENFFDKLQQHSIPVFI 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE0 FSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFKGELIHVFNKHDGALRNTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 FSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFKGELIHVFNKHDGALRNTE 160 170 180 190 200 210 270 280 290 300 310 320 pF1KE0 YFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYLNDRVDELLEKYMDSYDIVLVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 YFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYLNDRVDELLEKYMDSYDIVLVQ 220 230 240 250 260 270 330 pF1KE0 DESLEVANSILQKIL ::::::::::::::: CCDS55 DESLEVANSILQKIL 280 >>CCDS11410.2 NT5C3B gene_id:115024|Hs108|chr17 (300 aa) initn: 1174 init1: 1174 opt: 1174 Z-score: 1469.7 bits: 280.2 E(32554): 1.5e-75 Smith-Waterman score: 1174; 57.1% identity (86.5% similar) in 289 aa overlap (48-336:1-289) 20 30 40 50 60 70 pF1KE0 ASASVCALVAGVVLAQYIFTLKRKTGRKTKIIEMMPEFQKSSVRIKNPTRVEEIICGLIK . : . ..:..: ...: ::.::. .: : CCDS11 MAEEVSTLMKATVLMRQPGRVQEIVGALRK 10 20 30 80 90 100 110 120 130 pF1KE0 GGAAKLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDECRKKLLQLKEKYYAIEVD ::. .::.:.:::::::::.:.:::::. .::.:: :....::::.: : ..:: ::.: CCDS11 GGGDRLQVISDFDMTLSRFAYNGKRCPSSYNILDNSKIISEECRKELTALLHHYYPIEID 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE0 PVLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKEGYENFFDKLQQHSI : ::.:: :.::::.::.:.:: :: . : .. ..: ::..::.:::..::. : ...: CCDS11 PHRTVKEKLPHMVEWWTKAHNLLCQQKIQKFQIAQVVRESNAMLREGYKTFFNTLYHNNI 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE0 PVFIFSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFKGELIHVFNKHDGAL :.:::::::::.:::.::: :.:::...:::.:::.: : :.::::.:::..::...: CCDS11 PLFIFSAGIGDILEEIIRQMKVFHPNIHIVSNYMDFNEDGFLQGFKGQLIHTYNKNSSAC 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE0 RNTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYLNDRVDELLEKYMDSYDI .:. ::.::. ..:.:::::: ::: ::::: .:..:::::.:::.:.: :.::::::: CCDS11 ENSGYFQQLEGKTNVILLGDSIGDLTMADGVPGVQNILKIGFLNDKVEERRERYMDSYDI 220 230 240 250 260 270 320 330 pF1KE0 VLVQDESLEVANSILQKIL :: .::.:.:.:..::.:: CCDS11 VLEKDETLDVVNGLLQHILCQGVQLEMQGP 280 290 300 336 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 09:39:14 2016 done: Thu Nov 3 09:39:14 2016 Total Scan time: 2.490 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]