FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7930, 213 aa
1>>>pF1KB7930 213 - 213 aa - 213 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.5442+/-0.000265; mu= 6.4511+/- 0.017
mean_var=151.8000+/-30.355, 0's: 0 Z-trim(124.9): 29 B-trim: 727 in 1/56
Lambda= 0.104097
statistics sampled from 47521 (47553) to 47521 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.843), E-opt: 0.2 (0.558), width: 16
Scan time: 7.480
The best scores are: opt bits E(85289)
NP_060575 (OMIM: 602629,609520) THAP domain-contai ( 213) 1486 233.3 2.3e-61
XP_011540703 (OMIM: 612532) PREDICTED: THAP domain ( 148) 325 58.8 5.4e-09
XP_016858250 (OMIM: 612532) PREDICTED: THAP domain ( 168) 325 58.8 5.9e-09
NP_612359 (OMIM: 612532) THAP domain-containing pr ( 175) 325 58.8 6.1e-09
NP_001182681 (OMIM: 612532) THAP domain-containing ( 238) 325 58.9 7.8e-09
XP_005263589 (OMIM: 612532) PREDICTED: THAP domain ( 238) 325 58.9 7.8e-09
NP_001182682 (OMIM: 612532) THAP domain-containing ( 239) 325 58.9 7.8e-09
NP_113623 (OMIM: 612531) THAP domain-containing pr ( 228) 277 51.7 1.1e-06
NP_057047 (OMIM: 612533) THAP domain-containing pr ( 577) 268 50.7 5.9e-06
XP_005247073 (OMIM: 612533) PREDICTED: THAP domain ( 711) 268 50.7 6.9e-06
NP_001318031 (OMIM: 612536) THAP domain-containing ( 233) 248 47.4 2.3e-05
NP_689871 (OMIM: 612536) THAP domain-containing pr ( 274) 248 47.4 2.6e-05
NP_001123947 (OMIM: 612534) THAP domain-containing ( 395) 232 45.1 0.00019
XP_011529968 (OMIM: 612535) PREDICTED: THAP domain ( 222) 211 41.8 0.0011
XP_011529969 (OMIM: 612535) PREDICTED: THAP domain ( 222) 211 41.8 0.0011
NP_653322 (OMIM: 612535) THAP domain-containing pr ( 222) 211 41.8 0.0011
>>NP_060575 (OMIM: 602629,609520) THAP domain-containing (213 aa)
initn: 1486 init1: 1486 opt: 1486 Z-score: 1222.6 bits: 233.3 E(85289): 2.3e-61
Smith-Waterman score: 1486; 100.0% identity (100.0% similar) in 213 aa overlap (1-213:1-213)
10 20 30 40 50 60
pF1KB7 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_060 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 DCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLLM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_060 DCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLLM
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_060 PPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKL
130 140 150 160 170 180
190 200 210
pF1KB7 KEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA
:::::::::::::::::::::::::::::::::
NP_060 KEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA
190 200 210
>>XP_011540703 (OMIM: 612532) PREDICTED: THAP domain-con (148 aa)
initn: 316 init1: 256 opt: 325 Z-score: 282.5 bits: 58.8 E(85289): 5.4e-09
Smith-Waterman score: 325; 46.4% identity (70.1% similar) in 97 aa overlap (1-96:1-97)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT
: .::.: : :::. . : ..::.::..:: : ::: . : :::: ... ::::::
XP_011 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLL
:.::. : : ::.:::::.: .: .. .. .:
XP_011 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 MPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEK
XP_011 DSPGRNMDTALEELQLPPNAEGHVKQIP
130 140
>>XP_016858250 (OMIM: 612532) PREDICTED: THAP domain-con (168 aa)
initn: 316 init1: 256 opt: 325 Z-score: 281.7 bits: 58.8 E(85289): 5.9e-09
Smith-Waterman score: 325; 46.4% identity (70.1% similar) in 97 aa overlap (1-96:1-97)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT
: .::.: : :::. . : ..::.::..:: : ::: . : :::: ... ::::::
XP_016 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLL
:.::. : : ::.:::::.: .: .. .. .:
XP_016 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 MPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEK
XP_016 DSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE
130 140 150 160
>>NP_612359 (OMIM: 612532) THAP domain-containing protei (175 aa)
initn: 316 init1: 256 opt: 325 Z-score: 281.5 bits: 58.8 E(85289): 6.1e-09
Smith-Waterman score: 325; 46.4% identity (70.1% similar) in 97 aa overlap (1-96:1-97)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT
: .::.: : :::. . : ..::.::..:: : ::: . : :::: ... ::::::
NP_612 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLL
:.::. : : ::.:::::.: .: .. .. .:
NP_612 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKTSPCRSQVL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 MPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEK
NP_612 PEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE
130 140 150 160 170
>>NP_001182681 (OMIM: 612532) THAP domain-containing pro (238 aa)
initn: 369 init1: 256 opt: 325 Z-score: 279.6 bits: 58.9 E(85289): 7.8e-09
Smith-Waterman score: 344; 32.9% identity (51.6% similar) in 225 aa overlap (1-181:1-223)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT
: .::.: : :::. . : ..::.::..:: : ::: . : :::: ... ::::::
NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90
pF1KB7 PDCFKRECNNKLLKENAVPTIFLCTEP-----------------HDKKEDLLE-------
:.::. : : ::.:::::.: .: ..:: .:
NP_001 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED
70 80 90 100 110 120
100 110 120 130
pF1KB7 -P---------QEQLPP------PPLPPPVSQVDAAIGLLMPPL---QTPVNLSVFCDHN
: . :::: . : :. :.: : .:: . ::.
NP_001 SPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQP--SDHS
130 140 150 160 170
140 150 160 170 180 190
pF1KB7 YTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSER
:.. : .:.. .. :::::.:.. . ::. .:. :
NP_001 YALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQS
180 190 200 210 220 230
200 210
pF1KB7 GYVILPNDYFEIVEVPA
>>XP_005263589 (OMIM: 612532) PREDICTED: THAP domain-con (238 aa)
initn: 369 init1: 256 opt: 325 Z-score: 279.6 bits: 58.9 E(85289): 7.8e-09
Smith-Waterman score: 344; 32.9% identity (51.6% similar) in 225 aa overlap (1-181:1-223)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT
: .::.: : :::. . : ..::.::..:: : ::: . : :::: ... ::::::
XP_005 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90
pF1KB7 PDCFKRECNNKLLKENAVPTIFLCTEP-----------------HDKKEDLLE-------
:.::. : : ::.:::::.: .: ..:: .:
XP_005 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED
70 80 90 100 110 120
100 110 120 130
pF1KB7 -P---------QEQLPP------PPLPPPVSQVDAAIGLLMPPL---QTPVNLSVFCDHN
: . :::: . : :. :.: : .:: . ::.
XP_005 SPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQP--SDHS
130 140 150 160 170
140 150 160 170 180 190
pF1KB7 YTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSER
:.. : .:.. .. :::::.:.. . ::. .:. :
XP_005 YALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQS
180 190 200 210 220 230
200 210
pF1KB7 GYVILPNDYFEIVEVPA
>>NP_001182682 (OMIM: 612532) THAP domain-containing pro (239 aa)
initn: 369 init1: 256 opt: 325 Z-score: 279.6 bits: 58.9 E(85289): 7.8e-09
Smith-Waterman score: 342; 32.7% identity (51.3% similar) in 226 aa overlap (1-181:1-224)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT
: .::.: : :::. . : ..::.::..:: : ::: . : :::: ... ::::::
NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90
pF1KB7 PDCFKRECNNKLLKENAVPTIFLCTEP------------------HDKKEDLLE------
:.::. : : ::.:::::.: .: ..:: .:
NP_001 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
100 110 120 130
pF1KB7 --P---------QEQLPP------PPLPPPVSQVDAAIGLLMPPL---QTPVNLSVFCDH
: . :::: . : :. :.: : .:: . ::
NP_001 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQP--SDH
130 140 150 160 170
140 150 160 170 180 190
pF1KB7 NYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSE
.:.. : .:.. .. :::::.:.. . ::. .:. :
NP_001 SYALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQ
180 190 200 210 220 230
200 210
pF1KB7 RGYVILPNDYFEIVEVPA
NP_001 S
>>NP_113623 (OMIM: 612531) THAP domain-containing protei (228 aa)
initn: 339 init1: 171 opt: 277 Z-score: 240.9 bits: 51.7 E(85289): 1.1e-06
Smith-Waterman score: 366; 35.3% identity (63.2% similar) in 201 aa overlap (1-197:1-185)
10 20 30 40 50 60
pF1KB7 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP
: .:.: :: . :.: .:::.::: :. ::: :::::: : :.. .::.::
NP_113 MPTNCAAAGCATTYNKHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFEA
10 20 30 40 50
70 80 90 100 110
pF1KB7 DCFKRECNNKLLKENAVPTIF-LCTEPHD---KKEDLLEPQEQLPPPPLPPPVSQVDAAI
.:: ... :: .:::::: .::. .. :...::. ... : : :.. . :
NP_113 SCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAG-P---SNLKSNI
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB7 GLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQ
. . .:. .:.:. .. :. .::: .::... .::.:.:: :. :: :.
NP_113 S----------SQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRR
120 130 140 150 160
180 190 200 210
pF1KB7 LEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA
: .:. . : ..: .:
NP_113 WIKATCLVK-NLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTK
170 180 190 200 210 220
>>NP_057047 (OMIM: 612533) THAP domain-containing protei (577 aa)
initn: 267 init1: 178 opt: 268 Z-score: 227.9 bits: 50.7 E(85289): 5.9e-06
Smith-Waterman score: 268; 49.4% identity (68.5% similar) in 89 aa overlap (1-85:1-89)
10 20 30 40 50
pF1KB7 MVQSCSAYGCKNRYDKD--KPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHF
:: :.: .:.:: : . ::::.::: . .: ::.: :. ::::: .:::::
NP_057 MVICCAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHF
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 TPDCFKR--ECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAI
: : :.. : ...::: .:::.:: ::
NP_057 TKDSFSKRLEDQHRLLKPTAVPSIFHLTEKKRGAGGHGRTRRKDASKATGGVRGHSSAAT
70 80 90 100 110 120
>>XP_005247073 (OMIM: 612533) PREDICTED: THAP domain-con (711 aa)
initn: 267 init1: 178 opt: 268 Z-score: 226.6 bits: 50.7 E(85289): 6.9e-06
Smith-Waterman score: 268; 49.4% identity (68.5% similar) in 89 aa overlap (1-85:108-196)
10 20
pF1KB7 MVQSCSAYGCKNRYDKD--KPVSFHKFPLT
:: :.: .:.:: : . ::::.:::
XP_005 SPPRSLPRGGPRAGGRLGPGPGCAAGPRPAMVICCAAVNCSNRQGKGEKRAVSFHRFPLK
80 90 100 110 120 130
30 40 50 60 70 80
pF1KB7 RPSLCKEWEAAVRRKNFKPTKYSSICSEHFTPDCFKR--ECNNKLLKENAVPTIFLCTEP
. .: ::.: :. ::::: .:::::: : :.. : ...::: .:::.:: ::
XP_005 DSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDSFSKRLEDQHRLLKPTAVPSIFHLTEK
140 150 160 170 180 190
90 100 110 120 130 140
pF1KB7 HDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLLMPPLQTPVNLSVFCDHNYTVEDTMHQR
XP_005 KRGAGGHGRTRRKDASKATGGVRGHSSAATSRGAAGWSPSSSGNPMAKPESRRLKQAALQ
200 210 220 230 240 250
213 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:11:44 2016 done: Sat Nov 5 10:11:45 2016
Total Scan time: 7.480 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]