FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9723, 414 aa 1>>>pF1KB9723 414 - 414 aa - 414 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.5939+/-0.000389; mu= -3.7470+/- 0.024 mean_var=459.3176+/-96.847, 0's: 0 Z-trim(125.1): 107 B-trim: 773 in 1/58 Lambda= 0.059844 statistics sampled from 47946 (48095) to 47946 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.835), E-opt: 0.2 (0.564), width: 16 Scan time: 7.360 The best scores are: opt bits E(85289) NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 2980 271.1 3.5e-72 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 847 87.0 9.2e-17 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 683 72.8 1.7e-12 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 534 60.1 1.5e-08 NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 482 55.5 3.1e-07 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 482 55.5 3.1e-07 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 466 53.9 6.4e-07 NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 451 52.8 1.8e-06 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 435 51.1 3.4e-06 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 430 50.7 4.6e-06 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 436 51.6 4.9e-06 NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 402 48.4 2.7e-05 NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 397 48.0 4e-05 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 385 47.0 9.1e-05 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 385 47.1 9.7e-05 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 389 47.7 9.9e-05 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 389 47.7 9.9e-05 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 385 47.3 0.00013 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 385 47.4 0.00014 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 385 47.4 0.00014 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 385 47.4 0.00014 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 385 47.4 0.00014 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 385 47.4 0.00014 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 385 47.4 0.00014 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 385 47.4 0.00014 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 385 47.4 0.00014 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 385 47.4 0.00014 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 385 47.4 0.00014 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 385 47.4 0.00014 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 385 47.4 0.00014 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 385 47.4 0.00014 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 385 47.4 0.00014 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 385 47.4 0.00015 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 385 47.4 0.00015 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 377 46.4 0.00016 NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 376 46.4 0.00018 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 379 46.9 0.00021 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 379 46.9 0.00021 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 379 46.9 0.00021 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 379 47.0 0.00022 NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 346 43.4 0.00064 >>NP_071899 (OMIM: 610928,613674) transcription factor S (414 aa) initn: 2980 init1: 2980 opt: 2980 Z-score: 1416.9 bits: 271.1 E(85289): 3.5e-72 Smith-Waterman score: 2980; 100.0% identity (100.0% similar) in 414 aa overlap (1-414:1-414) 10 20 30 40 50 60 pF1KB9 MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 RAKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 RAKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 AERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 AERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 LQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPDPAFFAAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 LQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPDPAFFAAP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 MPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 MPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 GSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPSQPAELLGEVDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 GSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPSQPAELLGEVDR 310 320 330 340 350 360 370 380 390 400 410 pF1KB9 TEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_071 TEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV 370 380 390 400 410 >>NP_113627 (OMIM: 612202) transcription factor SOX-7 [H (388 aa) initn: 775 init1: 525 opt: 847 Z-score: 422.0 bits: 87.0 E(85289): 9.2e-17 Smith-Waterman score: 847; 43.3% identity (61.9% similar) in 404 aa overlap (27-411:5-385) 10 20 30 40 50 pF1KB9 MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLS-PIGDMKVK-GEAPANSGAPAGA :: :: :.: : : ... :..: : : NP_113 MASLLGAYPWPEGLECPALDAELSDGQSPPAVPRPPGD 10 20 30 60 70 80 90 100 110 pF1KB9 AGRAKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFV : .::::::::::::::::::::::: :::::::::::::::::::::::..:::.: NP_113 KG---SESRIRRPMNAFMVWAKDERKRLAVQNPDLHNAELSKMLGKSWKALTLSQKRPYV 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB9 EEAERLRVQHMQDHPNYKYRPRRRKQVKRL-KRVEGGFL-HGLAEPQAAALGPEGGRVAM .::::::.:::::.:::::::::.::.::: :::. ::: .:.. : : :: . NP_113 DEAERLRLQHMQDYPNYKYRPRRKKQAKRLCKRVDPGFLLSSLSRDQNAL--PEKRSGSR 100 110 120 130 140 150 180 190 200 210 220 pF1KB9 DGLGLQFPEQGFPAGPPLLPPHMGGHYRDCQSLGA---PP--LDGYP--LPTP-DTSPLD .:: . . . : : : . : :.. . :. : .: :: :::: . :::: NP_113 GALGEKEDRGEYSPGTAL--PSLRGCYHEGPAGGGGGGTPSSVDTYPYGLPTPPEMSPLD 160 170 180 190 200 210 230 240 250 260 270 280 pF1KB9 GVDPDPAFFAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIP-GLL ..:. .::..: : . :. :. :. :. .: : : : : : NP_113 VLEPEQTFFSSP----CQEEHGHPRRI-------PHLPGHPYSPEYAPSPLHCSHPLGSL 220 230 240 250 260 290 300 310 320 330 340 pF1KB9 APPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTD : .. : .: :: : . . .. :.. :: ::::: : :. : NP_113 ALGQSPGV---SMMSPVPGCPPSPAYYSPATYHPLHSNLQAHLGQLSPPPEH-PGFDALD 270 280 290 300 310 350 360 370 380 390 400 pF1KB9 PSQPAELLGEVDRTEFEQYLHFVCKPEMG---LPYQGHD--SGVN-LPDSHGAISSVVSD . .::::..::.::.:::. .:. . . .:: : :. .. .. ::..: NP_113 QLSQVELLGDMDRNEFDQYLNTPGHPDSATGAMALSGHVPVSQVTPTGPTETSLISVLAD 320 330 340 350 360 370 410 pF1KB9 ASSAVYYCNYPDV :. :.:: .: NP_113 AT-ATYYNSYSVS 380 >>NP_060889 (OMIM: 137940,601618,607823) transcription f (384 aa) initn: 849 init1: 541 opt: 683 Z-score: 345.5 bits: 72.8 E(85289): 1.7e-12 Smith-Waterman score: 859; 44.0% identity (59.6% similar) in 423 aa overlap (3-408:23-378) 10 20 30 40 pF1KB9 MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIG .: : :.: .. .: ::..: .: : :: NP_060 MQRSPPGYGAQDDPPARRDCAWAPGHGAAAD--TRGLAAGPAALA--APAAPASPPSP-Q 10 20 30 40 50 50 60 70 80 90 pF1KB9 DMKVKGEAPANSG-APAGAAGR-AKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAEL .. :. : .::: . : : :::::::::::::::::::::::::::::::: : NP_060 RSPPRSPEPGRYGLSPAGRGERQAADESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVL 60 70 80 90 100 110 100 110 120 130 140 150 pF1KB9 SKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFL-H ::::::.:: :. :::::::::::::::::..:::::::::::.::... .:.: :.: NP_060 SKMLGKAWKELNAAEKRPFVEEAERLRVQHLRDHPNYKYRPRRKKQARKARRLEPGLLLP 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB9 GLAEPQAAALGPEGGRVAMDGLGLQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGY ::: :: : : . :::. . .:. ::: .:: NP_060 GLAPPQ-----P--------------PPEPFPAASG-----SARAFRELPPLGAE-FDGL 180 190 200 210 220 230 240 250 260 270 pF1KB9 PLPTPDTSPLDGVDP-DPAFFAAPM-PGDC---PAAGTYSYAQVS-DYAGPPEPPAGPMH ::::. :::::..: . ::: : : :: : . :. ...: : .: : . NP_060 GLPTPERSPLDGLEPGEAAFFPPPAAPEDCALRPFRAPYAPTELSRDPGGCYGAPLAEAL 220 230 240 250 260 270 280 290 300 310 320 330 pF1KB9 PRLGPEPAGPSIPGLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPG : .: ::.: . :: :::..:.:: : :: NP_060 -RTAP-PAAP-LAGL---------YYGTLGTPG-----------------------PYPG 280 290 340 350 360 370 380 pF1KB9 QPSPPPEALPCRDGTDPSQPA-ELLGEVDRTEFEQYLHFV-CKPEM-GLPY-----QGHD :::::: : ....: :: .: ..:: :::.:::. .:. :::: . NP_060 PLSPPPEAPPL-ESAEPLGPAADLWADVDLTEFDQYLNCSRTRPDAPGLPYHVALAKLGP 300 310 320 330 340 350 390 400 410 pF1KB9 SGVNLPDSHGAISSVVSDASSAVYYCNYPDV ... :. . ::.. ::::::::: NP_060 RAMSCPEESSLISAL-SDASSAVYYSACISG 360 370 380 >>NP_000337 (OMIM: 114290,608160,616425) transcription f (509 aa) initn: 513 init1: 429 opt: 534 Z-score: 274.6 bits: 60.1 E(85289): 1.5e-08 Smith-Waterman score: 534; 34.8% identity (55.4% similar) in 345 aa overlap (55-389:92-426) 30 40 50 60 70 80 pF1KB9 AGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGESRIRRPMNAFMVWAKDERK :. . : .:.. ...::::::::::. :. NP_000 KESEEDKFPVCIREAVSQVLKGYDWTLVPMPVRVNGSSKNKPHVKRPMNAFMVWAQAARR 70 80 90 100 110 120 90 100 110 120 130 140 pF1KB9 RLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQ .::.: : :::::::: ::: :. :. .:::::::::::::::: .:::.:::.:::::. NP_000 KLADQYPHLHNAELSKTLGKLWRLLNESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKS 130 140 150 160 170 180 150 160 170 180 190 pF1KB9 VKRLK-RVEGGFLHGLAEPQAA--ALGPEGGRVAMDGLGLQFP--EQGFPAGPPLLPPHM :: . ..: . . :.: :: .. . . .. : ..: ::: :: NP_000 VKNGQAEAEEATEQTHISPNAIFKALQADSPHSSSGMSEVHSPGEHSGQSQGPPT-PPTT 190 200 210 220 230 240 200 210 220 230 240 250 pF1KB9 GGHYRDCQSLGAPPL--DGYPLPTPDTSP-LDGVDPDPAFFAAPMPGDCPAAGTYSYAQV : : : : .: ::: .: .: : : . ... . .. :.. . NP_000 P--KTDVQP-GKADLKREGRPLPEGGRQPPIDFRDVDIGELSSDVISNIE---TFDVNEF 250 260 270 280 290 260 270 280 290 300 310 pF1KB9 SDYAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPP-SALHVYYGAMGSPGAGGGRGFQMQP ..: : :. : .. .: . : : :: ::... . .: . : : NP_000 DQYLPPNGHPGVPATHGQVTYTGSYGISSTAATPASAGHVWMSKQQAPPPPPQQPPQAPP 300 310 320 330 340 350 320 330 340 350 360 370 pF1KB9 QHQHQHQHQHHPPGPGQPSPPPEALPCRDGTD-PSQPAELLGEVDRTEFEQYLHFVCKPE : : : :: ::. ::. . : :.:.. .:: . :. . . NP_000 APQAPPQPQAAPPQ--QPAAPPQQPQAHTLTTLSSEPGQSQRTHIKTEQLSPSHYSEQQQ 360 370 380 390 400 410 380 390 400 410 pF1KB9 MGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV . : : : ::: NP_000 HS-PQQIAYSPFNLPHYSPSYPPITRSQYDYTDHQNSSSYYSHAAGQGTGLYSTFTYMNP 420 430 440 450 460 470 >>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa) initn: 496 init1: 424 opt: 482 Z-score: 251.0 bits: 55.5 E(85289): 3.1e-07 Smith-Waterman score: 509; 35.2% identity (54.9% similar) in 364 aa overlap (5-352:55-373) 10 20 30 pF1KB9 MSSPDAGYASDDQ--SQTQSALPAVMAGLGPCPW : . :.:.. . ..:. :. : : NP_055 VEDSDSDAPPSPAGSEGLGRAGVAVGGARGDPAEAADERFPACIRDAVSQVLKGY---DW 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB9 AESLSPIGDMKVKGEAPANSGAPAGAAGRAKGESRIRRPMNAFMVWAKDERKRLAQQNPD :: : : :.: :..: :.. ...::::::::::. :..::.: : NP_055 --SLVP---MPVRG----------GGGGALKAKPHVKRPMNAFMVWAQAARRKLADQYPH 90 100 110 120 100 110 120 130 140 150 pF1KB9 LHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVE :::::::: ::: :. :. .:::::::::::::::: .:::.:::.:::::..: NP_055 LHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAK------ 130 140 150 160 170 180 160 170 180 190 200 pF1KB9 GGFLHGLAEPQAAALGPE--GGRV--AMDGLG---LQFPEQGFPAGPPLLPPHMGGHYRD .: :. .. ..: :::. :: : : ::: . . : ::: :: . NP_055 AG--HSDSD-SGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHGPPT-PPTTPK--TE 190 200 210 220 230 210 220 230 240 250 260 pF1KB9 CQSLGAPP---LDGY-PLPTPDTSPLDGVDPDPAFFAAPMPGDCPAAGTYSYAQVSDYAG :. :: : :.: :. . . .: . : . ... . : : .. . : : NP_055 LQQAGAKPELKLEGRRPVDS-GRQNIDFSNVDISELSSEVMGTMDAFDVHEFDQ---YL- 240 250 260 270 280 270 280 290 300 310 320 pF1KB9 PPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQH : .:: :. : .: . . .: : . .: .:: : : . : . NP_055 ---PLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETG-------PPRPHIK 290 300 310 320 330 330 340 350 360 370 pF1KB9 QHQHHPPGPG-QPSPPPEALPC--RDGTDPSQPAELLGEVDRTEFEQYLHFVCKPEMGLP .: : : :: :. : .... :. :: NP_055 TEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYA 340 350 360 370 380 390 380 390 400 410 pF1KB9 YQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV NP_055 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP 400 410 420 430 440 >>NP_005625 (OMIM: 300123,312000,313430) transcription f (446 aa) initn: 431 init1: 372 opt: 482 Z-score: 251.0 bits: 55.5 E(85289): 3.1e-07 Smith-Waterman score: 483; 33.4% identity (58.2% similar) in 299 aa overlap (21-302:91-372) 10 20 30 40 50 pF1KB9 MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPA :. :: : : : . . .. .. : : . NP_005 PSPPATLAHLLPAPAMYSLLETELKNPVGTPTQAAGTGG-PAAPGGAGKSSANAAGGANS 70 80 90 100 110 60 70 80 90 100 pF1KB9 NSGAPAGAAGRAKG--ESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKA ..:. .::.: . : ..:..:::::::::.. .:...: .:: .::.:.:: :: .:: NP_005 GGGSSGGASGGGGGTDQDRVKRPMNAFMVWSRGQRRKMALENPKMHNSEISKRLGADWKL 120 130 140 150 160 170 110 120 130 140 150 160 pF1KB9 LTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALG :: ::::::..::.:::. ::...:.:::::::. .. ::. . .. :: : ::: . NP_005 LTDAEKRPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTL-LKKDKYSLPSGLLPPGAAAAA 180 190 200 210 220 230 170 180 190 200 210 pF1KB9 PEGGRVAMD-----GLGLQFPE----QGFPAGP-PLLPPHMGGHYRDCQSLGAPPLDGYP .. .: :.: .. .:. : :. ..: : . :...:: NP_005 AAAAAAAAAASSPVGVGQRLDTYTHVNGWANGAYSLVQEQLG--YAQPPSMSSPP----- 240 250 260 270 280 290 220 230 240 250 260 270 pF1KB9 LPTPDTSPLDGVDPDPAFFAAPMPGDCPAAGTY-----SYAQVSDYAGPPEPPAGPMHPR : : :. : .. :: :.: .: . : .: :.: .. NP_005 -PPPALPPMHRYDMAGLQYSPMMP---PGAQSYMNVAAAAAAASGYGGMAPSATAAAAAA 300 310 320 330 340 280 290 300 310 320 330 pF1KB9 LGPEPAGPSIPGLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQP : .:: . : .: . : ::: NP_005 YGQQPATAAA----AAAAAAAMSLGPMGSVVKSEPSSPPPAIASHSQRACLGDLRDMISM 350 360 370 380 390 400 >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 467 init1: 398 opt: 466 Z-score: 245.2 bits: 53.9 E(85289): 6.4e-07 Smith-Waterman score: 468; 31.3% identity (57.1% similar) in 310 aa overlap (36-331:9-305) 10 20 30 40 50 60 pF1KB9 AGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGE :.: : ....: . .:: : :..... .. NP_003 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP 10 20 30 70 80 90 100 110 120 pF1KB9 SRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLR .:..:::::::::.. .:...::.:: .::.:.:: :: :: :. .:::::..::.::: NP_003 DRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLR 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB9 VQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQFPE . ::..::.:::::::. .. .:. . . :: : . ... : : : :.. NP_003 ALHMKEHPDYKYRPRRKTKTL-MKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRM 100 110 120 130 140 150 190 200 210 220 230 pF1KB9 QGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPL-P------TPDTSPLDGVDPDPAFFA ... ::.: :. : ::: : . . .:. : . . NP_003 DSYA--------HMNGWSNGSYSMMQDQL-GYPQHPGLNAHGAAQMQPMHRYDVSALQYN 160 170 180 190 200 240 250 260 270 280 290 pF1KB9 APMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLL------APPSA . .. :. .:.. . : : : : . : :. : : . :: .: NP_003 SMTSSQTYMNGSPTYSMSYSQQGTPGMALGSMGSVVKSE-ASSSPPVVTSSSHSRAPCQA 210 220 230 240 250 260 300 310 320 330 340 350 pF1KB9 LHVY-YGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPSQP . . .: ::: . :.. :. :: . : :: NP_003 GDLRDMISMYLPGAEVPE--PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM 270 280 290 300 310 360 370 380 390 400 410 pF1KB9 AELLGEVDRTEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYCNY >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 479 init1: 405 opt: 451 Z-score: 237.1 bits: 52.8 E(85289): 1.8e-06 Smith-Waterman score: 471; 33.4% identity (59.5% similar) in 299 aa overlap (41-312:9-293) 20 30 40 50 60 pF1KB9 DDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKG--EAPANSGAPAGAAGRAKG---- :.. : .::.: ..::::.: . : NP_005 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGG 10 20 30 70 80 90 100 110 pF1KB9 ---------ESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKR ..:..:::::::::.. .:...::.:: .::.:.:: :: ::... :::: NP_005 GGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKR 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB9 PFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVA ::..::.:::. ::..::.:::::::. .. ::. . .. :: ::. : :. :: NP_005 PFIDEAKRLRALHMKEHPDYKYRPRRKTKTL-LKKDKYSLAGGLL---AAGAGGGGAAVA 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB9 MDGLGLQFPEQGFPAGPPLLPPH--MGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPD : :.:. . .: : : :: : .. . .:: . .. .. . NP_005 M-GVGVGVGAAA--VGQRLESPGGAAGGGYAHVNGWAN---GAYPGSVAAAAAAAAMMQE 160 170 180 190 200 240 250 260 270 280 pF1KB9 PAFFAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPR------LGP---EPAGPSIP . . :: :.:.. .:. . . : .: : : .:. .: : . : NP_005 AQLAYGQHPG---AGGAHPHAHPA-HPHPHHPHAHPHNPQPMHRYDMGALQYSPISNSQG 210 220 230 240 250 260 290 300 310 320 330 340 pF1KB9 GLLAPPSALH-VYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCR . : ::. . ::: .. .:..: . : NP_005 YMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSG 270 280 290 300 310 320 >>NP_004180 (OMIM: 604747) transcription factor SOX-14 [ (240 aa) initn: 409 init1: 388 opt: 435 Z-score: 232.1 bits: 51.1 E(85289): 3.4e-06 Smith-Waterman score: 435; 35.7% identity (59.2% similar) in 255 aa overlap (62-304:2-228) 40 50 60 70 80 90 pF1KB9 WAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGESRIRRPMNAFMVWAKDERKRLAQQNP .: ..:.:::::::::.. .:...::.:: NP_004 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENP 10 20 30 100 110 120 130 140 pF1KB9 DLHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQ--VKRLK .::.:.:: :: :: :. :::::...::.:::.:::..::.:::::::. . .:. . NP_004 KMHNSEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDR 40 50 60 70 80 90 150 160 170 180 190 200 pF1KB9 RVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQFPEQGFPAGPPLLPPHMGGHYRDCQSL : : ..: :: : : : ::: :. ::.. :: :. :: NP_004 YVFPLPYLGDTDPLKAAGLPVG---ASDGL-LSAPEKARAFLPPASAPY---------SL 100 110 120 130 210 220 230 240 250 260 pF1KB9 GAPPLDGYPLPTPDTSPLDGVDPDPAFFAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGP :: . ..: .. . : .:. : : :.: .: : . ... : NP_004 ----LDPAQF---SSSAIQKMGEVPHTLAT---GALPYASTLGY-QNGAFGSLSCPS--- 140 150 160 170 180 270 280 290 300 310 pF1KB9 MHPRLGPEPAGPS--IP--------GLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQH .: . : :..:. .: . : :: : .. . .: . : NP_004 QHTHTHPSPTNPGYVVPCNCTAWSASTLQPPVA-YILFPGMTKTGIDPYSSAHATAM 190 200 210 220 230 240 320 330 340 350 360 370 pF1KB9 QHQHQHHPPGPGQPSPPPEALPCRDGTDPSQPAELLGEVDRTEFEQYLHFVCKPEMGLPY >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 449 init1: 378 opt: 430 Z-score: 229.9 bits: 50.7 E(85289): 4.6e-06 Smith-Waterman score: 433; 38.1% identity (59.3% similar) in 231 aa overlap (29-249:17-225) 10 20 30 40 50 60 pF1KB9 MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAG : : . : : .. .: .:.:: : : NP_008 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREG-----AGSPA-APG 10 20 30 40 70 80 90 100 110 120 pF1KB9 RAKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEE : ...:::::::::.. .:...::::: .::.:.:: :: .:: : :::::::: NP_008 TLPLE-KVKRPMNAFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEE 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB9 AERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLG :.:::..:..:.:.:::::::. . . : : .. . :. :: : : . NP_008 AKRLRARHLRDYPDYKYRPRRKAKSSG----AGPSRCGQGRGNLASGGPLWG----PGYA 110 120 130 140 150 190 200 210 220 230 pF1KB9 LQFPEQGFPAGPP-----LLPPHMGGHYRDCQSLGAPPLDGYPLPTPDTSP-LDG-VDPD : .:: :: :: .:. . :. : :: : :...: :.: . : NP_008 TTQPSRGFGYRPPSYSTAYLPGSYGSSH--CK-LEAPS----PCSLPQSDPRLQGELLPT 160 170 180 190 200 240 250 260 270 280 290 pF1KB9 PAFF---AAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPP . . ..: : . : :: NP_008 YTHYLPPGSPTPYNPPLAGAPMPLTHL 210 220 230 414 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:32:10 2016 done: Fri Nov 4 18:32:11 2016 Total Scan time: 7.360 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]