FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0115, 228 aa 1>>>pF1KE0115 228 - 228 aa - 228 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6906+/-0.00036; mu= 12.7359+/- 0.022 mean_var=77.8623+/-16.105, 0's: 0 Z-trim(115.1): 62 B-trim: 1433 in 2/51 Lambda= 0.145349 statistics sampled from 25277 (25341) to 25277 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.679), E-opt: 0.2 (0.297), width: 16 Scan time: 6.600 The best scores are: opt bits E(85289) NP_113623 (OMIM: 612531) THAP domain-containing pr ( 228) 1515 326.8 1.9e-89 NP_612359 (OMIM: 612532) THAP domain-containing pr ( 175) 280 67.7 1.4e-11 XP_011540703 (OMIM: 612532) PREDICTED: THAP domain ( 148) 275 66.7 2.5e-11 NP_060575 (OMIM: 602629,609520) THAP domain-contai ( 213) 277 67.2 2.5e-11 XP_016858250 (OMIM: 612532) PREDICTED: THAP domain ( 168) 275 66.7 2.7e-11 NP_001182682 (OMIM: 612532) THAP domain-containing ( 239) 275 66.8 3.7e-11 XP_005263589 (OMIM: 612532) PREDICTED: THAP domain ( 238) 270 65.7 7.6e-11 NP_001182681 (OMIM: 612532) THAP domain-containing ( 238) 270 65.7 7.6e-11 NP_057047 (OMIM: 612533) THAP domain-containing pr ( 577) 247 61.1 4.5e-09 XP_005247073 (OMIM: 612533) PREDICTED: THAP domain ( 711) 247 61.2 5.3e-09 NP_001123947 (OMIM: 612534) THAP domain-containing ( 395) 230 57.5 3.9e-08 NP_001318031 (OMIM: 612536) THAP domain-containing ( 233) 225 56.3 5.2e-08 NP_689871 (OMIM: 612536) THAP domain-containing pr ( 274) 225 56.3 5.9e-08 NP_004696 (OMIM: 607374) 52 kDa repressor of the i ( 761) 229 57.4 7.7e-08 XP_011529968 (OMIM: 612535) PREDICTED: THAP domain ( 222) 220 55.2 1e-07 XP_011529969 (OMIM: 612535) PREDICTED: THAP domain ( 222) 220 55.2 1e-07 NP_653322 (OMIM: 612535) THAP domain-containing pr ( 222) 220 55.2 1e-07 XP_016859745 (OMIM: 612533) PREDICTED: THAP domain ( 601) 197 50.7 6.6e-06 XP_011509593 (OMIM: 612533) PREDICTED: THAP domain ( 628) 197 50.7 6.9e-06 NP_078948 (OMIM: 612537) DNA transposase THAP9 iso ( 903) 199 51.2 6.9e-06 XP_005262831 (OMIM: 612535) PREDICTED: THAP domain ( 160) 185 47.8 1.3e-05 NP_085050 (OMIM: 609518) THAP domain-containing pr ( 309) 179 46.7 5.2e-05 NP_001008695 (OMIM: 609518) THAP domain-containing ( 309) 179 46.7 5.2e-05 XP_016863290 (OMIM: 612535) PREDICTED: THAP domain ( 147) 169 44.4 0.00012 XP_006714172 (OMIM: 612535) PREDICTED: THAP domain ( 172) 169 44.5 0.00014 NP_001304720 (OMIM: 612535) THAP domain-containing ( 180) 169 44.5 0.00014 XP_016863289 (OMIM: 612535) PREDICTED: THAP domain ( 180) 169 44.5 0.00014 XP_005262829 (OMIM: 612535) PREDICTED: THAP domain ( 181) 169 44.5 0.00014 XP_016864092 (OMIM: 612537) PREDICTED: DNA transpo ( 916) 178 46.8 0.00015 NP_064532 (OMIM: 612538) THAP domain-containing pr ( 257) 153 41.2 0.002 NP_001318032 (OMIM: 612536) THAP domain-containing ( 231) 148 40.1 0.0037 NP_001318033 (OMIM: 612536) THAP domain-containing ( 231) 148 40.1 0.0037 >>NP_113623 (OMIM: 612531) THAP domain-containing protei (228 aa) initn: 1515 init1: 1515 opt: 1515 Z-score: 1726.9 bits: 326.8 E(85289): 1.9e-89 Smith-Waterman score: 1515; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228) 10 20 30 40 50 60 pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_113 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_113 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_113 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS 130 140 150 160 170 180 190 200 210 220 pF1KE0 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI :::::::::::::::::::::::::::::::::::::::::::::::: NP_113 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI 190 200 210 220 >>NP_612359 (OMIM: 612532) THAP domain-containing protei (175 aa) initn: 257 init1: 185 opt: 280 Z-score: 329.0 bits: 67.7 E(85289): 1.4e-11 Smith-Waterman score: 280; 34.6% identity (63.9% similar) in 133 aa overlap (1-131:1-130) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. NP_612 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : .... .. ...:. : .. :.. . NP_612 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSS---QKEKTSPCRS 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA ::: : . . .: NP_612 QVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE 120 130 140 150 160 170 >>XP_011540703 (OMIM: 612532) PREDICTED: THAP domain-con (148 aa) initn: 257 init1: 185 opt: 275 Z-score: 324.5 bits: 66.7 E(85289): 2.5e-11 Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. XP_011 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : .... .. ...:. : XP_011 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA XP_011 DSPGRNMDTALEELQLPPNAEGHVKQIP 130 140 >>NP_060575 (OMIM: 602629,609520) THAP domain-containing (213 aa) initn: 339 init1: 171 opt: 277 Z-score: 324.4 bits: 67.2 E(85289): 2.5e-11 Smith-Waterman score: 366; 35.3% identity (63.2% similar) in 201 aa overlap (1-185:1-197) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFEA : .:.: :: . :.: .:::.::: :. ::: :::::: : :.. .::.:: NP_060 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 SCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAG-P---SNLKSNI .:: ... :: .:::::: .::. .. :...::. ... : : :.. . : NP_060 DCFKRECNNKLLKENAVPTIF-LCTEPHD---KKEDLLEPQEQLPPPPLPPPVSQVDAAI 70 80 90 100 110 120 130 140 150 160 pF1KE0 S----------SQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRR . . .:. .:.:. .. :. .::: .::... .::.:.:: :. :: :. NP_060 GLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQ 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE0 WIKATCLVK-NLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTK : .:. . : ..: .: NP_060 LEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA 180 190 200 210 >>XP_016858250 (OMIM: 612532) PREDICTED: THAP domain-con (168 aa) initn: 257 init1: 185 opt: 275 Z-score: 323.6 bits: 66.7 E(85289): 2.7e-11 Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. XP_016 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : .... .. ...:. : XP_016 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA XP_016 DSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE 130 140 150 160 >>NP_001182682 (OMIM: 612532) THAP domain-containing pro (239 aa) initn: 309 init1: 185 opt: 275 Z-score: 321.4 bits: 66.8 E(85289): 3.7e-11 Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : .... .. ...:. : NP_001 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA NP_001 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQPSDHSY 130 140 150 160 170 180 >>XP_005263589 (OMIM: 612532) PREDICTED: THAP domain-con (238 aa) initn: 309 init1: 185 opt: 270 Z-score: 315.7 bits: 65.7 E(85289): 7.6e-11 Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. XP_005 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : XP_005 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED 70 80 90 100 110 120 >>NP_001182681 (OMIM: 612532) THAP domain-containing pro (238 aa) initn: 309 init1: 185 opt: 270 Z-score: 315.7 bits: 65.7 E(85289): 7.6e-11 Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE :: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::. NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ ::. :. . :: .::::.: : NP_001 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED 70 80 90 100 110 120 >>NP_057047 (OMIM: 612533) THAP domain-containing protei (577 aa) initn: 186 init1: 126 opt: 247 Z-score: 283.9 bits: 61.1 E(85289): 4.5e-09 Smith-Waterman score: 247; 34.5% identity (61.5% similar) in 148 aa overlap (1-143:1-146) 10 20 30 40 50 pF1KE0 MPTNCAAAGCATTYNK--HINISFHRFPL-DPKRRKEWVRLVRRKNFVPGKHTFLCSKHF : :::..:.. .: . .::::::: : :: .:.. :.: :..: :..::::.:: NP_057 MVICCAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 EASCFD--LTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNI . :. : : : :: :::.:: . . .. ..:. .:. : . .: . .: NP_057 TKDSFSKRLEDQHRLLKPTAVPSIFHLTEKKRGAGGHGRTR-RKDASKATGGVRGHSSAA 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 SSQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKN .:. . . ::: :: . .:.. NP_057 TSRGAAGWSPSSSGNPM-AKPESRRLKQAALQGEATPRAAQEAASQEQAQQALERTPGDG 120 130 140 150 160 170 >>XP_005247073 (OMIM: 612533) PREDICTED: THAP domain-con (711 aa) initn: 166 init1: 126 opt: 247 Z-score: 282.6 bits: 61.2 E(85289): 5.3e-09 Smith-Waterman score: 247; 34.5% identity (61.5% similar) in 148 aa overlap (1-143:108-253) 10 20 pF1KE0 MPTNCAAAGCATTYNK--HINISFHRFPL- : :::..:.. .: . .::::::: XP_005 SPPRSLPRGGPRAGGRLGPGPGCAAGPRPAMVICCAAVNCSNRQGKGEKRAVSFHRFPLK 80 90 100 110 120 130 30 40 50 60 70 80 pF1KE0 DPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEASCFD--LTGQTRRLKMDAVPTIFDFCTH : :: .:.. :.: :..: :..::::.:: . :. : : : :: :::.:: . . XP_005 DSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDSFSKRLEDQHRLLKPTAVPSIFHLTEK 140 150 160 170 180 190 90 100 110 120 130 140 pF1KE0 IKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQVLLEHSYAFRNPMEAKKRIIKLEKEI .. ..:. .:. : . .: . .: .:. . . ::: :: . .:.. XP_005 KRGAGGHGRTR-RKDASKATGGVRGHSSAATSRGAAGWSPSSSGNPM-AKPESRRLKQAA 200 210 220 230 240 250 150 160 170 180 190 200 pF1KE0 ASLRRKMKTCLQKERRATRRWIKATCLVKNLEANSVLPKGTSEHMLPTALSSLPLEDFKI XP_005 LQGEATPRAAQEAASQEQAQQALERTPGDGLATMVAGSQGKAEASATDAGDESATSSIEG 260 270 280 290 300 310 228 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 02:28:17 2016 done: Fri Nov 4 02:28:18 2016 Total Scan time: 6.600 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]