FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0464, 222 aa 1>>>pF1KE0464 222 - 222 aa - 222 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2543+/-0.000325; mu= 9.3700+/- 0.020 mean_var=83.8383+/-16.548, 0's: 0 Z-trim(116.3): 32 B-trim: 211 in 2/51 Lambda= 0.140073 statistics sampled from 27336 (27368) to 27336 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.696), E-opt: 0.2 (0.321), width: 16 Scan time: 6.300 The best scores are: opt bits E(85289) XP_011529968 (OMIM: 612535) PREDICTED: THAP domain ( 222) 1533 319.1 3.6e-87 XP_011529969 (OMIM: 612535) PREDICTED: THAP domain ( 222) 1533 319.1 3.6e-87 NP_653322 (OMIM: 612535) THAP domain-containing pr ( 222) 1533 319.1 3.6e-87 XP_005262829 (OMIM: 612535) PREDICTED: THAP domain ( 181) 1240 259.9 2e-69 XP_005262831 (OMIM: 612535) PREDICTED: THAP domain ( 160) 982 207.7 8.9e-54 XP_016863290 (OMIM: 612535) PREDICTED: THAP domain ( 147) 970 205.3 4.4e-53 XP_006714172 (OMIM: 612535) PREDICTED: THAP domain ( 172) 967 204.7 7.8e-53 NP_001304720 (OMIM: 612535) THAP domain-containing ( 180) 679 146.5 2.7e-35 XP_016863289 (OMIM: 612535) PREDICTED: THAP domain ( 180) 679 146.5 2.7e-35 NP_078948 (OMIM: 612537) DNA transposase THAP9 iso ( 903) 310 72.2 3.1e-12 NP_113623 (OMIM: 612531) THAP domain-containing pr ( 228) 220 53.8 2.7e-07 NP_085050 (OMIM: 609518) THAP domain-containing pr ( 309) 220 53.9 3.6e-07 NP_001008695 (OMIM: 609518) THAP domain-containing ( 309) 220 53.9 3.6e-07 NP_060575 (OMIM: 602629,609520) THAP domain-contai ( 213) 211 52.0 9.1e-07 XP_016864092 (OMIM: 612537) PREDICTED: DNA transpo ( 916) 217 53.4 1.4e-06 NP_057047 (OMIM: 612533) THAP domain-containing pr ( 577) 200 49.9 1e-05 XP_005247073 (OMIM: 612533) PREDICTED: THAP domain ( 711) 200 50.0 1.2e-05 NP_001123947 (OMIM: 612534) THAP domain-containing ( 395) 188 47.4 3.9e-05 XP_011540703 (OMIM: 612532) PREDICTED: THAP domain ( 148) 176 44.8 8.9e-05 XP_016858250 (OMIM: 612532) PREDICTED: THAP domain ( 168) 176 44.9 0.0001 NP_612359 (OMIM: 612532) THAP domain-containing pr ( 175) 176 44.9 0.0001 NP_001182682 (OMIM: 612532) THAP domain-containing ( 239) 176 44.9 0.00014 NP_001182681 (OMIM: 612532) THAP domain-containing ( 238) 174 44.5 0.00018 XP_005263589 (OMIM: 612532) PREDICTED: THAP domain ( 238) 174 44.5 0.00018 NP_004696 (OMIM: 607374) 52 kDa repressor of the i ( 761) 155 40.9 0.0071 >>XP_011529968 (OMIM: 612535) PREDICTED: THAP domain-con (222 aa) initn: 1533 init1: 1533 opt: 1533 Z-score: 1686.0 bits: 319.1 E(85289): 3.6e-87 Smith-Waterman score: 1533; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ 130 140 150 160 170 180 190 200 210 220 pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS :::::::::::::::::::::::::::::::::::::::::: XP_011 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS 190 200 210 220 >>XP_011529969 (OMIM: 612535) PREDICTED: THAP domain-con (222 aa) initn: 1533 init1: 1533 opt: 1533 Z-score: 1686.0 bits: 319.1 E(85289): 3.6e-87 Smith-Waterman score: 1533; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ 130 140 150 160 170 180 190 200 210 220 pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS :::::::::::::::::::::::::::::::::::::::::: XP_011 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS 190 200 210 220 >>NP_653322 (OMIM: 612535) THAP domain-containing protei (222 aa) initn: 1533 init1: 1533 opt: 1533 Z-score: 1686.0 bits: 319.1 E(85289): 3.6e-87 Smith-Waterman score: 1533; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_653 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_653 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_653 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ 130 140 150 160 170 180 190 200 210 220 pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS :::::::::::::::::::::::::::::::::::::::::: NP_653 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS 190 200 210 220 >>XP_005262829 (OMIM: 612535) PREDICTED: THAP domain-con (181 aa) initn: 1240 init1: 1240 opt: 1240 Z-score: 1367.4 bits: 259.9 E(85289): 2e-69 Smith-Waterman score: 1240; 100.0% identity (100.0% similar) in 181 aa overlap (42-222:1-181) 20 30 40 50 60 70 pF1KE0 SRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDVLCSRHFKKTDF :::::::::::::::::::::::::::::: XP_005 MKRLDVNAAGIWEPKKGDVLCSRHFKKTDF 10 20 30 80 90 100 110 120 130 pF1KE0 DRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 DRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEE 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE0 FQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 FQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELK 100 110 120 130 140 150 200 210 220 pF1KE0 DECLISQETANRLDTFCWDCCQESIEQDYIS ::::::::::::::::::::::::::::::: XP_005 DECLISQETANRLDTFCWDCCQESIEQDYIS 160 170 180 >>XP_005262831 (OMIM: 612535) PREDICTED: THAP domain-con (160 aa) initn: 999 init1: 978 opt: 982 Z-score: 1086.4 bits: 207.7 E(85289): 8.9e-54 Smith-Waterman score: 982; 88.7% identity (94.3% similar) in 159 aa overlap (1-159:1-159) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ ::::::::::::::::::.: ... .. : . .. : XP_005 HLVGASSCIEEFQSQFIFKHRKRKQEQEEEQKPRREKCIS 130 140 150 160 190 200 210 220 pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS >>XP_016863290 (OMIM: 612535) PREDICTED: THAP domain-con (147 aa) initn: 991 init1: 970 opt: 970 Z-score: 1073.9 bits: 205.3 E(85289): 4.4e-53 Smith-Waterman score: 970; 97.9% identity (99.3% similar) in 141 aa overlap (1-141:1-141) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::: .. XP_016 HLVGASSCIEEFQSQFIFTYTSARSLL 130 140 >>XP_006714172 (OMIM: 612535) PREDICTED: THAP domain-con (172 aa) initn: 967 init1: 967 opt: 967 Z-score: 1069.6 bits: 204.7 E(85289): 7.8e-53 Smith-Waterman score: 967; 100.0% identity (100.0% similar) in 138 aa overlap (1-138:1-138) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::: XP_006 HLVGASSCIEEFQSQFIFISMFKRKCLFKAKNYFSVTIIANIYKVPIFIQST 130 140 150 160 170 >>NP_001304720 (OMIM: 612535) THAP domain-containing pro (180 aa) initn: 1231 init1: 679 opt: 679 Z-score: 754.7 bits: 146.5 E(85289): 2.7e-35 Smith-Waterman score: 1151; 81.1% identity (81.1% similar) in 222 aa overlap (1-222:1-180) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::: NP_001 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQ------------------------ 70 80 90 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::::::::::::::::::::::::::: NP_001 ------------------EHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ 100 110 120 130 190 200 210 220 pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS :::::::::::::::::::::::::::::::::::::::::: NP_001 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS 140 150 160 170 180 >>XP_016863289 (OMIM: 612535) PREDICTED: THAP domain-con (180 aa) initn: 1231 init1: 679 opt: 679 Z-score: 754.7 bits: 146.5 E(85289): 2.7e-35 Smith-Waterman score: 1151; 81.1% identity (81.1% similar) in 222 aa overlap (1-222:1-180) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH :::::::::::::::::::::::::::::::::::: XP_016 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQ------------------------ 70 80 90 130 140 150 160 170 180 pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ :::::::::::::::::::::::::::::::::::::::::: XP_016 ------------------EHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ 100 110 120 130 190 200 210 220 pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS :::::::::::::::::::::::::::::::::::::::::: XP_016 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS 140 150 160 170 180 >>NP_078948 (OMIM: 612537) DNA transposase THAP9 isoform (903 aa) initn: 284 init1: 284 opt: 310 Z-score: 340.7 bits: 72.2 E(85289): 3.1e-12 Smith-Waterman score: 333; 32.3% identity (59.9% similar) in 217 aa overlap (1-210:1-198) 10 20 30 40 50 60 pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV :.. :::.::..: :. .::.:: :::: . ::. :..:.: . :: : : . NP_078 MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSKWIRAVNRVDPRSKKIWIPGPGAI 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHL-QGKREKLHCRKNFTLKTVPATNYN :::.::...::. . ::: :..::. : :.. :: . : . :... . .: .. NP_078 LCSKHFQESDFESYGIRRKLKKGAVPSV--SLYKIPQGVHLKGKARQKILKQPLPDNS-- 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 HHLVGASSCIEEFQSQFIFEHSYSVMDSPKKL-KHKLDHVIGELEDTKESLRNVLD-REK .: .. .:.:: . .: . .:: .: :. .:. : .: . : NP_078 ----------QEVATE---DHNYS-LKTPLTIGAEKLAEVQQMLQVSKKRLISVKNYRMI 120 130 140 150 160 180 190 200 210 220 pF1KE0 RFQKSLRKTIRELKDECLISQETA----NRLDTFCWDCCQESIEQDYIS . .:.:: : : .: :.:.:: ... : :. NP_078 KKRKGLR-LIDALVEEKLLSEETECLLRAQFSDFKWELYNWRETDEYSAEMKQFACTLYL 170 180 190 200 210 220 NP_078 CSSKVYDYVRKILKLPHSSILRTWLSKCQPSPGFNSNIFSFLQRRVENGDQLYQYCSLLI 230 240 250 260 270 280 222 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 06:53:20 2016 done: Thu Nov 3 06:53:21 2016 Total Scan time: 6.300 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]