FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9646, 204 aa 1>>>pF1KB9646 204 - 204 aa - 204 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8648+/-0.000317; mu= 10.9256+/- 0.020 mean_var=83.2284+/-17.149, 0's: 0 Z-trim(117.4): 167 B-trim: 623 in 1/56 Lambda= 0.140585 statistics sampled from 29271 (29447) to 29271 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.729), E-opt: 0.2 (0.345), width: 16 Scan time: 5.540 The best scores are: opt bits E(85289) NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 1412 295.6 3.8e-80 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 439 98.4 1.8e-20 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 424 95.3 1.1e-19 NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 425 95.5 1.2e-19 NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 409 92.2 8.4e-19 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 402 90.7 2e-18 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 381 86.5 3.8e-17 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 352 80.7 3.4e-15 NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 346 79.5 8.3e-15 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 342 78.7 1.5e-14 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 329 76.1 8.5e-14 NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 322 74.6 1.9e-13 NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 321 74.5 3.1e-13 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 317 73.6 4.5e-13 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 317 73.6 4.9e-13 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 317 73.8 7e-13 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 317 73.8 7.7e-13 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 317 73.8 7.7e-13 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 317 73.8 7.7e-13 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 317 73.8 7.7e-13 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 317 73.8 7.7e-13 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 317 73.8 7.8e-13 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 317 73.8 7.8e-13 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 317 73.8 8e-13 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 317 73.8 8e-13 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 317 73.8 8e-13 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 317 73.8 8e-13 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 317 73.8 8e-13 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 317 73.8 8.1e-13 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 317 73.8 8.1e-13 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 317 73.8 8.3e-13 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 317 73.8 8.3e-13 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 314 73.1 8.8e-13 NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 312 72.6 1e-12 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 314 73.2 1.3e-12 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 314 73.2 1.3e-12 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 314 73.2 1.3e-12 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 314 73.2 1.3e-12 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 309 72.1 1.7e-12 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 305 71.3 3.7e-12 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 305 71.3 3.7e-12 NP_001295094 (OMIM: 606698) transcription factor S ( 448) 294 69.0 1.3e-11 >>NP_003131 (OMIM: 400044,400045,480000) sex-determining (204 aa) initn: 1412 init1: 1412 opt: 1412 Z-score: 1559.9 bits: 295.6 E(85289): 3.8e-80 Smith-Waterman score: 1412; 100.0% identity (100.0% similar) in 204 aa overlap (1-204:1-204) 10 20 30 40 50 60 pF1KB9 MQSYASAMLSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MQSYASAMLSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 KRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 KRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 REKYPNYKYRPRRKAKMLPKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 REKYPNYKYRPRRKAKMLPKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQL 130 140 150 160 170 180 190 200 pF1KB9 GHLPPINAASSPQQRDRYSHWTKL :::::::::::::::::::::::: NP_003 GHLPPINAASSPQQRDRYSHWTKL 190 200 >>NP_005625 (OMIM: 300123,312000,313430) transcription f (446 aa) initn: 449 init1: 430 opt: 439 Z-score: 488.3 bits: 98.4 E(85289): 1.8e-20 Smith-Waterman score: 452; 49.1% identity (72.3% similar) in 159 aa overlap (54-192:133-291) 30 40 50 60 70 80 pF1KB9 NIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALENP :. :::::::::::.:::: :::::::::: NP_005 PGGAGKSSANAAGGANSGGGSSGGASGGGGGTDQDRVKRPMNAFMVWSRGQRRKMALENP 110 120 130 140 150 160 90 100 110 120 130 140 pF1KB9 RMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKMLPKN-- .:.::::::.:: .::.::.::: ::..::..:.:.: ..::.::::::::.: : :. NP_005 KMHNSEISKRLGADWKLLTDAEKRPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDK 170 180 190 200 210 220 150 160 170 180 pF1KB9 ----CSLLP----------ADPASVLCSEV----QLDNRLYRDDCTKATHSRMEHQLGHL .::: : :.. : : .::. . . .....: ...:::. NP_005 YSLPSGLLPPGAAAAAAAAAAAAAAASSPVGVGQRLDTYTHVNGWANGAYSLVQEQLGYA 230 240 250 260 270 280 190 200 pF1KB9 PPINAASSPQQRDRYSHWTKL : . .: : NP_005 QPPSMSSPPPPPALPPMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGYGGMAPSATA 290 300 310 320 330 340 >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 434 init1: 416 opt: 424 Z-score: 474.1 bits: 95.3 E(85289): 1.1e-19 Smith-Waterman score: 439; 45.6% identity (69.6% similar) in 171 aa overlap (49-198:31-200) 20 30 40 50 60 70 pF1KB9 PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM : :.: : ::::::::::.:::: ::::: NP_003 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQK-NSPDRVKRPMNAFMVWSRGQRRKM 10 20 30 40 50 80 90 100 110 120 130 pF1KB9 ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAK-M : :::.:.::::::.:: .::.:.:.:: ::..::..:.:.: ...:.::::::::.: . NP_003 AQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTL 60 70 80 90 100 110 140 150 160 170 180 pF1KB9 LPKNCSLLP----ADPASVLCSEV------------QLDNRLYRDDCTKATHSRMEHQLG . :. :: : .. . : : ..:. . . .....: :. ::: NP_003 MKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLG 120 130 140 150 160 170 190 200 pF1KB9 H--LPPINA--ASSPQQRDRYSHWTKL . : .:: :.. : :: NP_003 YPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGS 180 190 200 210 220 230 >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 418 init1: 418 opt: 425 Z-score: 473.8 bits: 95.5 E(85289): 1.2e-19 Smith-Waterman score: 425; 66.3% identity (90.2% similar) in 92 aa overlap (49-140:41-131) 20 30 40 50 60 70 pF1KB9 PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM : ..:.: :::::::::::.:::: ::::: NP_005 HSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKAN-QDRVKRPMNAFMVWSRGQRRKM 20 30 40 50 60 80 90 100 110 120 130 pF1KB9 ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKML : :::.:.::::::.:: .::...:::: ::..::..:.:.: ...:.::::::::.: : NP_005 AQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTL 70 80 90 100 110 120 140 150 160 170 180 190 pF1KB9 PKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRY : NP_005 LKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGGYAHVNGWAN 130 140 150 160 170 180 >>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa) initn: 419 init1: 401 opt: 409 Z-score: 458.5 bits: 92.2 E(85289): 8.4e-19 Smith-Waterman score: 409; 69.9% identity (90.4% similar) in 83 aa overlap (58-140:6-88) 30 40 50 60 70 80 pF1KB9 LRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALENPRMRN :.::::::::.:::: :::::: :::.:.: NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN 10 20 30 90 100 110 120 130 140 pF1KB9 SEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKMLPKNCSLLPA :::::.:: .::.:::.:: ::..::..:.::: ...:.:::::::: : : : NP_009 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP 40 50 60 70 80 90 150 160 170 180 190 200 pF1KB9 DPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRYSHWTKL NP_009 VPYGLGGVADAEHPALKAGAGLHAGAGGGLVPESLLANPEKAAAAAAAAAARVFFPQSAA 100 110 120 130 140 150 >>NP_004180 (OMIM: 604747) transcription factor SOX-14 [ (240 aa) initn: 404 init1: 386 opt: 402 Z-score: 451.7 bits: 90.7 E(85289): 2e-18 Smith-Waterman score: 402; 62.2% identity (88.9% similar) in 90 aa overlap (58-146:6-95) 30 40 50 60 70 80 pF1KB9 LRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALENPRMRN :..:::::::.:::: :::::: :::.:.: NP_004 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHN 10 20 30 90 100 110 120 130 140 pF1KB9 SEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAK-MLPKNCSLLP :::::.:: .::.:.:::: :...::..:.:.: ...:.:::::::: : .: :. ..: NP_004 SEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFP 40 50 60 70 80 90 150 160 170 180 190 200 pF1KB9 ADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRYSHWTKL NP_004 LPYLGDTDPLKAAGLPVGASDGLLSAPEKARAFLPPASAPYSLLDPAQFSSSAIQKMGEV 100 110 120 130 140 150 >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 411 init1: 380 opt: 381 Z-score: 428.9 bits: 86.5 E(85289): 3.8e-17 Smith-Waterman score: 381; 62.5% identity (80.7% similar) in 88 aa overlap (58-142:47-134) 30 40 50 60 70 80 pF1KB9 LRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALENPRMRN ..::::::::.::: :::.:: .::.:.: NP_008 PAATAAASSSSGPQEREGAGSPAAPGTLPLEKVKRPMNAFMVWSSAQRRQMAQQNPKMHN 20 30 40 50 60 70 90 100 110 120 130 140 pF1KB9 SEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKML---PKNCSL :::::.:: :::.: : :: :: .::..:.: : . ::.:::::::::: :. : NP_008 SEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRDYPDYKYRPRRKAKSSGAGPSRCGQ 80 90 100 110 120 130 150 160 170 180 190 200 pF1KB9 LPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRYSHWTKL NP_008 GRGNLASGGPLWGPGYATTQPSRGFGYRPPSYSTAYLPGSYGSSHCKLEAPSPCSLPQSD 140 150 160 170 180 190 >>NP_113627 (OMIM: 612202) transcription factor SOX-7 [H (388 aa) initn: 352 init1: 320 opt: 352 Z-score: 393.8 bits: 80.7 E(85289): 3.4e-15 Smith-Waterman score: 352; 45.9% identity (79.3% similar) in 111 aa overlap (53-163:39-145) 30 40 50 60 70 80 pF1KB9 ENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALEN ::. ..:..::::::.::..:.:...:..: NP_113 PWPEGLECPALDAELSDGQSPPAVPRPPGDKGS-ESRIRRPMNAFMVWAKDERKRLAVQN 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 PRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKMLPKNC : ..:.:.::.:: .:: :: ..: :. .::..:. .: . ::::::::::: :. . : NP_113 PDLHNAELSKMLGKSWKALTLSQKRPYVDEAERLRLQHMQDYPNYKYRPRRK-KQAKRLC 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 SLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRYSHWT . . ::. .: : . .: : NP_113 KRV--DPGFLLSSLSRDQNALPEKRSGSRGALGEKEDRGEYSPGTALPSLRGCYHEGPAG 130 140 150 160 170 180 >>NP_071899 (OMIM: 610928,613674) transcription factor S (414 aa) initn: 380 init1: 346 opt: 346 Z-score: 386.8 bits: 79.5 E(85289): 8.3e-15 Smith-Waterman score: 346; 48.3% identity (85.4% similar) in 89 aa overlap (49-137:57-145) 20 30 40 50 60 70 pF1KB9 PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM : .... ..:..::::::.::..:.:... NP_071 LGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGESRIRRPMNAFMVWAKDERKRL 30 40 50 60 70 80 80 90 100 110 120 130 pF1KB9 ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKML : .:: ..:.:.::.:: .:: :: ::: :: .::..:...: . .:::::::::. .. NP_071 AQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVK 90 100 110 120 130 140 140 150 160 170 180 190 pF1KB9 PKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRY NP_071 RLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQFPEQGFPAGPPLLPPHMGGHYRDC 150 160 170 180 190 200 >>NP_003099 (OMIM: 600898,615866) transcription factor S (441 aa) initn: 366 init1: 337 opt: 342 Z-score: 382.0 bits: 78.7 E(85289): 1.5e-14 Smith-Waterman score: 342; 47.5% identity (75.2% similar) in 101 aa overlap (39-139:28-128) 10 20 30 40 50 60 pF1KB9 LSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFI .:. :. . ... ..:::::::. NP_003 MVQQAESLEAESNLPREALDTEEGEFMACSPVALDESDPDWCKTASGHIKRPMNAFM 10 20 30 40 50 70 80 90 100 110 120 pF1KB9 VWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYK :::. .:::. ..: :.:.::::.:: .:::: ..:: ::..::..:. : ::.:: NP_003 VWSKIERRKIMEQSPDMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHMADYPDYK 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB9 YRPRRKAKMLPKNCSLLPADPASVLCSEVQLDNRLYRDDCTKATHSRMEHQLGHLPPINA ::::.: :: : NP_003 YRPRKKPKMDPSAKPSASQSPEKSAAGGGGGSAGGGAGGAKTSKGSSKKCGKLKAPAAAG 120 130 140 150 160 170 204 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:56:47 2016 done: Fri Nov 4 17:56:48 2016 Total Scan time: 5.540 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]