FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0493, 395 aa 1>>>pF1KE0493 395 - 395 aa - 395 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4812+/-0.000798; mu= 16.3748+/- 0.048 mean_var=72.9493+/-14.762, 0's: 0 Z-trim(108.7): 13 B-trim: 176 in 1/49 Lambda= 0.150163 statistics sampled from 10400 (10413) to 10400 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.701), E-opt: 0.2 (0.32), width: 16 Scan time: 3.060 The best scores are: opt bits E(32554) CCDS14099.1 CHKB gene_id:1120|Hs108|chr22 ( 395) 2680 589.7 1.6e-168 CCDS8179.1 CHKA gene_id:1119|Hs108|chr11 ( 439) 1511 336.5 2.9e-92 CCDS8178.1 CHKA gene_id:1119|Hs108|chr11 ( 457) 1227 274.9 1e-73 CCDS8698.1 ETNK1 gene_id:55500|Hs108|chr12 ( 452) 419 99.9 5e-21 CCDS73006.1 ETNK2 gene_id:55224|Hs108|chr1 ( 394) 349 84.7 1.6e-16 >>CCDS14099.1 CHKB gene_id:1120|Hs108|chr22 (395 aa) initn: 2680 init1: 2680 opt: 2680 Z-score: 3139.1 bits: 589.7 E(32554): 1.6e-168 Smith-Waterman score: 2680; 100.0% identity (100.0% similar) in 395 aa overlap (1-395:1-395) 10 20 30 40 50 60 pF1KE0 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCREYLGGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCREYLGGA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 WRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQGVDSLVLES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 WRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQGVDSLVLES 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 VMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKMAQFHGMEMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKMAQFHGMEMP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 FTKEPHWLFGTMERYLKQIQDLPPTGLPEMNLLEMYSLKDEMGNLRKLLESTPSPVVFCH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 FTKEPHWLFGTMERYLKQIQDLPPTGLPEMNLLEMYSLKDEMGNLRKLLESTPSPVVFCH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 NDIQEGNILLLSEPENADSLMLVDFEYSSYNYRGFDIGNHFCEWVYDYTHEEWPFYKARP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 NDIQEGNILLLSEPENADSLMLVDFEYSSYNYRGFDIGNHFCEWVYDYTHEEWPFYKARP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 TDYPTQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDLLVEVSRYALASHFFWGLWSILQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TDYPTQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDLLVEVSRYALASHFFWGLWSILQ 310 320 330 340 350 360 370 380 390 pF1KE0 ASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS ::::::::::::::::::::::::::::::::::: CCDS14 ASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS 370 380 390 >>CCDS8179.1 CHKA gene_id:1119|Hs108|chr11 (439 aa) initn: 1327 init1: 448 opt: 1511 Z-score: 1769.8 bits: 336.5 E(32554): 2.9e-92 Smith-Waterman score: 1511; 60.8% identity (81.5% similar) in 367 aa overlap (29-388:74-437) 10 20 30 40 50 pF1KE0 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCREYLG : : .. .: :::: ::.:.: CCDS81 ESKQLGGQQPPLALPPPPPLPLPLPLPQPPPPQPPADEQPEPRTR---RRAYLWCKEFLP 50 60 70 80 90 100 60 70 80 90 100 110 pF1KE0 GAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQ-GVDSLV :::: .. .:... . :::::.::.::::: ..:.:::.::::::::::: :....: CCDS81 GAWRGLREDEFHISVIRGGLSNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMGAEAMV 110 120 130 140 150 160 120 130 140 150 160 170 pF1KE0 LESVMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKMAQFHGM ::::::::::::::::.:::.::.:::::.:::: : :.:: : .:: :: ::: :::: CCDS81 LESVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGM 170 180 190 200 210 220 180 190 200 210 220 230 pF1KE0 EMPFTKEPHWLFGTMERYLKQIQDLPPTG---LPEMNLLEMYSLKDEMGNLRKLLESTPS .:::.:::.:::::::.:::.. . : . ... : :.: :. :::.::::::: CCDS81 KMPFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPS 230 240 250 260 270 280 240 250 260 270 280 290 pF1KE0 PVVFCHNDIQEGNILLLSEPENADS--LMLVDFEYSSYNYRGFDIGNHFCEWVYDYTHEE :::::::: :::::::: ::... :::.:::::::::::::::::::::.:::..:. CCDS81 PVVFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEK 290 300 310 320 330 340 300 310 320 330 340 350 pF1KE0 WPFYKARPTDYPTQEQQLHFIRHYLAEAKKG-ETLSQEEQRKLEEDLLVEVSRYALASHF .::..: :::..:::::: :: .. :.:: ::. ..:..:.::.:.:::::: CCDS81 YPFFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHF 350 360 370 380 390 400 360 370 380 390 pF1KE0 FWGLWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS .::::::.::..:.:::::.::::.::. ::.:: .: CCDS81 LWGLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV 410 420 430 >>CCDS8178.1 CHKA gene_id:1119|Hs108|chr11 (457 aa) initn: 1047 init1: 456 opt: 1227 Z-score: 1437.0 bits: 274.9 E(32554): 1e-73 Smith-Waterman score: 1475; 57.9% identity (77.7% similar) in 385 aa overlap (29-388:74-455) 10 20 30 40 50 pF1KE0 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCREYLG : : .. .: :::: ::.:.: CCDS81 ESKQLGGQQPPLALPPPPPLPLPLPLPQPPPPQPPADEQPEPRTR---RRAYLWCKEFLP 50 60 70 80 90 100 60 70 80 90 100 110 pF1KE0 GAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQ------- :::: .. .:... . :::::.::.::::: ..:.:::.::::::::::: CCDS81 GAWRGLREDEFHISVIRGGLSNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKE 110 120 130 140 150 160 120 130 140 150 pF1KE0 ------------GVDSLVLESVMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELR :....:::::::::::::::::.:::.::.:::::.:::: : :.:: CCDS81 GSEQAQKENEFQGAEAMVLESVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELS 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE0 EPVLSAAIATKMAQFHGMEMPFTKEPHWLFGTMERYLKQIQDLPPTG---LPEMNLLEMY : .:: :: ::: ::::.:::.:::.:::::::.:::.. . : . ... : : CCDS81 LPDISAEIAEKMATFHGMKMPFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSY 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE0 SLKDEMGNLRKLLESTPSPVVFCHNDIQEGNILLLSEPENADS--LMLVDFEYSSYNYRG .: :. :::.::::::::::::::: :::::::: ::... :::.::::::::::: CCDS81 NLPLELENLRSLLESTPSPVVFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRG 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE0 FDIGNHFCEWVYDYTHEEWPFYKARPTDYPTQEQQLHFIRHYLAEAKKG-ETLSQEEQRK ::::::::::.:::..:..::..: :::..:::::: :: .. :.:: ::. CCDS81 FDIGNHFCEWMYDYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSI 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE0 LEEDLLVEVSRYALASHFFWGLWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHS ..:..:.::.:.::::::.::::::.::..:.:::::.::::.::. ::.:: .: CCDS81 IKEEMLLEVNRFALASHFLWGLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV 410 420 430 440 450 pF1KE0 SS >>CCDS8698.1 ETNK1 gene_id:55500|Hs108|chr12 (452 aa) initn: 527 init1: 266 opt: 419 Z-score: 491.1 bits: 99.9 E(32554): 5e-21 Smith-Waterman score: 555; 30.5% identity (59.0% similar) in 383 aa overlap (29-391:97-448) 10 20 30 40 50 pF1KE0 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCRE--- : .:. . . .: :. . ::: CCDS86 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEE---HRCREGAL 70 80 90 100 110 120 60 70 80 90 100 110 pF1KE0 ----YLGGAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQ .: : .:.:. . . :..: :. : . . . .: ::.:.:: . CCDS86 SLLQHLRPHW---DPQEVTLQLFTDGITNKLIGCYVGNTMEDV------VLVRIYGNKTE 130 140 150 160 170 120 130 140 150 160 170 pF1KE0 GVDSLVLESVMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKM . . : : .: .. .:::: .: .: ..: .. : ... .:.. :: .. CCDS86 LLVDRDEEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQL 180 190 200 210 220 230 180 190 200 210 220 pF1KE0 AQFHGMEMP---FTKEPHWLFGTMERYLKQIQDLPPTGLPEMNLLEMYS--------LKD :..:... . : :: : .:.. : :::. . .. . . :.. CCDS86 AKIHAIHAHNGWIPKSNLWL--KMGKYFSLI----PTGFADEDINKRFLSDIPSSQILQE 240 250 260 270 280 230 240 250 260 270 280 pF1KE0 EMGNLRKLLESTPSPVVFCHNDIQEGNILLLSEPENADSLMLVDFEYSSYNYRGFDIGNH :: ....: . ::::.::::. ::. :. .....:.:::.::: ..::::: CCDS86 EMTWMKEILSNLGSPVVLCHNDLLCKNIIY---NEKQGDVQFIDYEYSGYNYLAYDIGNH 290 300 310 320 330 340 290 300 310 320 330 pF1KE0 FCEW--VYDYTHEEWPFYKARPTDYPTQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDL : :. : : . . :: .: : ...: :: :. . .. : .: : : CCDS86 FNEFAGVSDVDY----------SLYPDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEIL 350 360 370 380 390 340 350 360 370 380 390 pF1KE0 LVEVSRYALASHFFWGLWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS ...:...:::::::::::...::..::::: .: :: ::. ::..: ..:.. CCDS86 FIQVNQFALASHFFWGLWALIQAKYSTIEFDFLGYAIVRFNQYFKMKPEVTALKVPE 400 410 420 430 440 450 >>CCDS73006.1 ETNK2 gene_id:55224|Hs108|chr1 (394 aa) initn: 365 init1: 125 opt: 349 Z-score: 410.0 bits: 84.7 E(32554): 1.6e-16 Smith-Waterman score: 392; 29.6% identity (56.9% similar) in 304 aa overlap (55-347:62-338) 30 40 50 60 70 pF1KE0 QSKCPDTTPKRRRASSLSRDAERRAYQWCREYLGGAWRRVQ-------PEELRVYPVSGG . : :: : .: ::..:. . : CCDS73 KAAASASCREPPGPPRAAAVAYFGISVDPDDILPGALRLIQELRPHWKPEQVRTKRFTDG 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE0 LSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQGVDSLVLESVMFAILAERSLGPQLYG ..: : : . . . . ::.:.:: . . . : : .: .: .:.:: CCDS73 ITNKLVACYVEEDMQDC------VLVRVYGERTELLVDRENEVRNFQLLRAHSCAPKLYC 100 110 120 130 140 140 150 160 170 180 190 pF1KE0 VFPEGRLEQYIPSRPLKTQELREPVLSAAIATKMAQFHGMEMPFTKEPHWLFGTMERYLK .: .: .:. . :. ...::: : :: .::..: .. . :. :. :. CCDS73 TFQNGLCYEYMQGVALEPEHIREPRLFRLIALEMAKIHTIHANGSLPKPILWHKMHNYFT 150 160 170 180 190 200 200 210 220 230 240 250 pF1KE0 QIQD-LPPTGLPEMNLLEMYSLKDEMGNLRKLLESTPSPVVFCHNDIQEGNILLLSEPEN ... . :. .. .:. :. :.. :.. : . :::::::::. ::. : CCDS73 LVKNEINPSLSADVPKVEV--LERELAWLKEHLSQLESPVVFCHNDLLCKNIIYDS---I 210 220 230 240 250 260 260 270 280 290 300 310 pF1KE0 ADSLMLVDFEYSSYNYRGFDIGNHFCEWVYDYTHEEWPFYKARPTDY---PTQEQQLHFI . ..:.::..:::..::::::: : : . .:: :..: ::... CCDS73 KGHVRFIDYEYAGYNYQAFDIGNHFNE-----------FAGVNEVDYCLYPARETQLQWL 270 280 290 300 320 330 340 350 360 370 pF1KE0 RHYLAEAKKGETLSQEEQRKLEEDLLVEVSRYALASHFFWGLWSILQASMSTIEFGYLDY :: .:.:: ... .: .. : :.:...:: CCDS73 -HYYLQAQKGMAVTPREVQR----LYVQVNKFALGPSCVSSTMTASLQCCRVGNRHGEIA 310 320 330 340 350 360 380 390 pF1KE0 AQSRFQFYFQQKGQLTSVHSSS CCDS73 RLTLSGLFPGVSLLLGSLGPHPEPVLHHRL 370 380 390 395 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 04:23:36 2016 done: Thu Nov 3 04:23:37 2016 Total Scan time: 3.060 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]