FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0115, 228 aa 1>>>pF1KE0115 228 - 228 aa - 228 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7180+/-0.000812; mu= 12.3463+/- 0.049 mean_var=75.0082+/-14.861, 0's: 0 Z-trim(108.1): 31 B-trim: 67 in 1/49 Lambda= 0.148088 statistics sampled from 9976 (9999) to 9976 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.307), width: 16 Scan time: 2.220 The best scores are: opt bits E(32554) CCDS9001.1 THAP2 gene_id:83591|Hs108|chr12 ( 228) 1515 332.6 1.3e-91 CCDS86.1 THAP3 gene_id:90326|Hs108|chr1 ( 175) 280 68.7 2.7e-12 CCDS6136.1 THAP1 gene_id:55145|Hs108|chr8 ( 213) 277 68.1 5e-12 CCDS55572.1 THAP3 gene_id:90326|Hs108|chr1 ( 239) 275 67.7 7.4e-12 CCDS55573.1 THAP3 gene_id:90326|Hs108|chr1 ( 238) 270 66.6 1.6e-11 >>CCDS9001.1 THAP2 gene_id:83591|Hs108|chr12 (228 aa) initn: 1515 init1: 1515 opt: 1515 Z-score: 1758.4 bits: 332.6 E(32554): 1.3e-91 Smith-Waterman score: 1515; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228) 10 20 30 40 50 60 pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS 130 140 150 160 170 180 190 200 210 220 pF1KE0 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI :::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI 190 200 210 220 >>CCDS86.1 THAP3 gene_id:90326|Hs108|chr1 (175 aa) initn: 257 init1: 185 opt: 280 Z-score: 334.1 bits: 68.7 E(32554): 2.7e-12 Smith-Waterman score: 280; 34.6% identity (63.9% similar) in 133 aa overlap (1-131:1-130) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. CCDS86 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : .... .. ...:. : .. :.. . CCDS86 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSS---QKEKTSPCRS 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA ::: : . . .: CCDS86 QVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE 120 130 140 150 160 170 >>CCDS6136.1 THAP1 gene_id:55145|Hs108|chr8 (213 aa) initn: 339 init1: 171 opt: 277 Z-score: 329.4 bits: 68.1 E(32554): 5e-12 Smith-Waterman score: 366; 35.3% identity (63.2% similar) in 201 aa overlap (1-185:1-197) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFEA : .:.: :: . :.: .:::.::: :. ::: :::::: : :.. .::.:: CCDS61 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 SCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAG-P---SNLKSNI .:: ... :: .:::::: .::. .. :...::. ... : : :.. . : CCDS61 DCFKRECNNKLLKENAVPTIF-LCTEPHD---KKEDLLEPQEQLPPPPLPPPVSQVDAAI 70 80 90 100 110 120 130 140 150 160 pF1KE0 S----------SQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRR . . .:. .:.:. .. :. .::: .::... .::.:.:: :. :: :. CCDS61 GLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQ 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE0 WIKATCLVK-NLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTK : .:. . : ..: .: CCDS61 LEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA 180 190 200 210 >>CCDS55572.1 THAP3 gene_id:90326|Hs108|chr1 (239 aa) initn: 309 init1: 185 opt: 275 Z-score: 326.3 bits: 67.7 E(32554): 7.4e-12 Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. CCDS55 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : .... .. ...:. : CCDS55 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA CCDS55 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQPSDHSY 130 140 150 160 170 180 >>CCDS55573.1 THAP3 gene_id:90326|Hs108|chr1 (238 aa) initn: 309 init1: 185 opt: 270 Z-score: 320.6 bits: 66.6 E(32554): 1.6e-11 Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. CCDS55 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : CCDS55 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED 70 80 90 100 110 120 228 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 02:28:16 2016 done: Fri Nov 4 02:28:16 2016 Total Scan time: 2.220 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]