FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7755, 446 aa 1>>>pF1KB7755 446 - 446 aa - 446 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0835+/-0.00033; mu= 2.1678+/- 0.021 mean_var=280.8083+/-58.044, 0's: 0 Z-trim(124.4): 150 B-trim: 45 in 1/59 Lambda= 0.076537 statistics sampled from 45812 (45986) to 45812 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.539), width: 16 Scan time: 9.980 The best scores are: opt bits E(85289) NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 3155 361.3 2.9e-99 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 1178 143.1 1.6e-33 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 869 108.9 2.8e-23 NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 482 66.2 1.9e-10 NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 448 62.4 2.5e-09 NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 442 61.8 4.5e-09 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 435 60.9 6.6e-09 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 433 60.6 6.8e-09 NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 428 60.0 9e-09 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 413 58.3 2.5e-08 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 413 58.6 4e-08 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 404 57.3 5.1e-08 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 408 58.0 5.3e-08 NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 400 57.0 8.4e-08 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 400 57.1 1.1e-07 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 327 49.0 2.5e-05 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 331 49.7 2.7e-05 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 327 49.0 2.7e-05 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 327 49.3 4e-05 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 327 49.3 4e-05 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 327 49.3 4e-05 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 327 49.3 4e-05 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 327 49.3 4e-05 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 327 49.3 4e-05 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 327 49.3 4e-05 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 327 49.3 4.1e-05 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 327 49.3 4.1e-05 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 327 49.3 4.1e-05 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 327 49.3 4.1e-05 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 327 49.3 4.1e-05 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 327 49.3 4.2e-05 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 327 49.3 4.2e-05 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 327 49.3 4.3e-05 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 327 49.3 4.3e-05 NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 312 47.1 5.2e-05 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 317 48.2 9.2e-05 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 317 48.2 9.3e-05 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 317 48.2 9.3e-05 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 317 48.2 9.6e-05 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 314 47.8 9.7e-05 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 314 47.8 9.8e-05 NP_001295094 (OMIM: 606698) transcription factor S ( 448) 270 42.8 0.0023 >>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa) initn: 3155 init1: 3155 opt: 3155 Z-score: 1903.2 bits: 361.3 E(85289): 2.9e-99 Smith-Waterman score: 3155; 100.0% identity (100.0% similar) in 446 aa overlap (1-446:1-446) 10 20 30 40 50 60 pF1KB7 MLDMSEARSQPPCSPSGTASSMSHVEDSDSDAPPSPAGSEGLGRAGVAVGGARGDPAEAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 MLDMSEARSQPPCSPSGTASSMSHVEDSDSDAPPSPAGSEGLGRAGVAVGGARGDPAEAA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 DERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 DERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRKL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 ADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 ADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AGHSDSDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHGPPTPPTTPKTELQQAGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 AGHSDSDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHGPPTPPTTPKTELQQAGA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 KPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEFDQYLPLGGPAPPEPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 KPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEFDQYLPLGGPAPPEPG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 QAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIKTEQPSPGHYGDQPRGSPDYGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 QAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIKTEQPSPGHYGDQPRGSPDYGS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 CSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYAPGLYQYPCFHSPRRPYASPLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 CSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYAPGLYQYPCFHSPRRPYASPLL 370 380 390 400 410 420 430 440 pF1KB7 NGLALPPAHSPTSHWDQPVYTTLTRP :::::::::::::::::::::::::: NP_055 NGLALPPAHSPTSHWDQPVYTTLTRP 430 440 >>NP_000337 (OMIM: 114290,608160,616425) transcription f (509 aa) initn: 1081 init1: 586 opt: 1178 Z-score: 722.7 bits: 143.1 E(85289): 1.6e-33 Smith-Waterman score: 1243; 48.5% identity (67.0% similar) in 470 aa overlap (16-419:19-480) 10 20 30 40 50 pF1KB7 MLDMSEARSQPPCSPSGTASSMSHVEDSDSDAPPSPAGSEGLGRAGVAVGGARGDP- :: : : . ::: .. :: .::. . .:.: NP_000 MNLLDPFMKMTDEQEKGLSG-APSPTMSEDSAGSPCPSGSGSDTENTRPQENTFPKGEPD 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 --AEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQ :. ...::.:::.:::::::::::.::::::: .:.. : :::::::::::::::: NP_000 LKKESEEDKFPVCIREAVSQVLKGYDWTLVPMPVRVNGSS--KNKPHVKRPMNAFMVWAQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB7 AARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPR :::::::::::::::::::::::::::::.:::::::::::::::::::::::::::::: NP_000 AARRKLADQYPHLHNAELSKTLGKLWRLLNESEKRPFVEEAERLRVQHKKDHPDYKYQPR 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 RRKSAKAGHSDSDSGAELGPHPGGGAVYKA--------EAGLGDGHHHGDHTGQTHGPPT ::::.: :..... ..: : . .:..:: .:... : :.:.::..:::: NP_000 RRKSVKNGQAEAEEATEQ-THISPNAIFKALQADSPHSSSGMSEVHSPGEHSGQSQGPPT 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB7 PPTTPKTELQQAGAKPELKLEGRRPVDSGRQN-IDFSNVDISELSSEVMGTMDAFDVHEF :::::::..: . : .:: ::: ..::: ::: .:::.::::.:......:::.:: NP_000 PPTTPKTDVQPG--KADLKREGRPLPEGGRQPPIDFRDVDIGELSSDVISNIETFDVNEF 240 250 260 270 280 290 290 300 310 320 pF1KB7 DQYLPLGG-PA-PPEPGQA-YGGAY-------FHAGASPVWAHKS-APSA------SASP ::::: .: :. : ::. : :.: :.:. :: :. :: .: : NP_000 DQYLPPNGHPGVPATHGQVTYTGSYGISSTAATPASAGHVWMSKQQAPPPPPQQPPQAPP 300 310 320 330 340 350 330 340 350 pF1KB7 TETGPPRP---------------------------------HIKTEQPSPGHYGDQPRGS . .::.: :::::: ::.::..: . : NP_000 APQAPPQPQAAPPQQPAAPPQQPQAHTLTTLSSEPGQSQRTHIKTEQLSPSHYSEQQQHS 360 370 380 390 400 410 360 370 380 390 400 410 pF1KB7 PDYGSCS--GQSSATPAAPAGPFAGSQGDYGDLQ-ASSYYGAYPGYAPGLYQYPCFHSP- :. . : . .:. : :.. :: :: : : .::::. : . :::. . .: NP_000 PQQIAYSPFNLPHYSPSYP--PITRSQYDYTDHQNSSSYYSHAAGQGTGLYSTFTYMNPA 420 430 440 450 460 470 420 430 440 pF1KB7 RRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP .::. .:. NP_000 QRPMYTPIADTSGVPSIPQTHSPQHWEQPVYTQLTRP 480 490 500 >>NP_008872 (OMIM: 602229,609136,611584,613266) transcri (466 aa) initn: 1010 init1: 579 opt: 869 Z-score: 538.8 bits: 108.9 E(85289): 2.8e-23 Smith-Waterman score: 1281; 50.3% identity (68.4% similar) in 475 aa overlap (2-446:10-466) 10 20 30 40 50 pF1KB7 MLDMSEARSQPP-CSPSGTASSMSHVEDSDSDAPPSPAGSEGLGRAGVAVGG ...: . :. : : :.: :.. :. . . : : : :. : : NP_008 MAEEQDLSEVELSPVGSEEPRCLSPGSAPSLG--PDGGGGGSGLRA-SPGPGELG-KVKK 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 ARGDPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMV . : .:: :..::.:::.::::::.::::.::::::: .: : :.::::::::::::: NP_008 EQQD-GEADDDKFPVCIREAVSQVLSGYDWTLVPMPVRVNG--ASKSKPHVKRPMNAFMV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB7 WAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKY ::::::::::::::::::::::::::::::::.::.::::.:::::::.::::::::::: NP_008 WAQAARRKLADQYPHLHNAELSKTLGKLWRLLNESDKRPFIEEAERLRMQHKKDHPDYKY 120 130 140 150 160 170 180 190 200 210 pF1KB7 QPRRRKSAKA--GHSDSDSG-AELGPHPGGGAVYKA------EAGLGDGHHHGD--H-TG ::::::..:: :... .: :: : . : ::. . : :. :. : .: NP_008 QPRRRKNGKAAQGEAECPGGEAEQGGTAAIQAHYKSAHLDHRHPGEGSPMSDGNPEHPSG 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB7 QTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDA :.:::::::::::::::.. : : : .:: ..:. .:::.::::.:.: :::..:.. NP_008 QSHGPPTPPTTPKTELQSGKADP--KRDGRSMGEGGKPHIDFGNVDIGEISHEVMSNMET 240 250 260 270 280 290 280 290 300 310 320 330 pF1KB7 FDVHEFDQYLPLGG-PAP----PEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPP ::: :.::::: .: :. : . :.: :.. .: : : . : :: ..:: NP_008 FDVAELDQYLPPNGHPGHVSSYSAAGYGLGSALAVASGHSAWISK--PPGVALPT-VSPP 300 310 320 330 340 340 350 360 370 380 pF1KB7 ----RPHIKTEQ--PS-PGHYGDQPRGSP-DYGSCS--GQSSATPAAPAGPFAGSQGDYG . ..::: :. : :: ::: : : : : .:: :. : ::. NP_008 GVDAKAQVKTETAGPQGPPHYTDQPSTSQIAYTSLSLPHYGSAFPSISRPQF-----DYS 350 360 370 380 390 400 390 400 410 420 430 440 pF1KB7 DLQASSYYGAYPGYAPGLYQYPCFHSP-RRPYASPLLN-GLALPPAHSPTSHWDQPVYTT : : :. : .. : : :::. . .: .:: . . . . . : .:::: ::.:::::: NP_008 DHQPSGPYYGHSGQASGLYSAFSYMGPSQRPLYTAISDPSPSGPQSHSPT-HWEQPVYTT 410 420 430 440 450 460 pF1KB7 LTRP :.:: NP_008 LSRP >>NP_071899 (OMIM: 610928,613674) transcription factor S (414 aa) initn: 496 init1: 424 opt: 482 Z-score: 308.5 bits: 66.2 E(85289): 1.9e-10 Smith-Waterman score: 509; 34.6% identity (54.4% similar) in 364 aa overlap (55-373:5-352) 30 40 50 60 70 80 pF1KB7 VEDSDSDAPPSPAGSEGLGRAGVAVGGARGDPAEAADERFPACIRDAVSQVLKGYD---W : . :.:.. . ..:. :. : : NP_071 MSSPDAGYASDDQ--SQTQSALPAVMAGLGPCPW 10 20 30 90 100 110 120 pF1KB7 --SLVP---MPVRG----------GGGGALKAKPHVKRPMNAFMVWAQAARRKLADQYPH :: : : :.: :..: :.. ...::::::::::. :..::.: : NP_071 AESLSPIGDMKVKGEAPANSGAPAGAAGRAKGESRIRRPMNAFMVWAKDERKRLAQQNPD 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB7 LHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAK------ :::::::: ::: :. :. .:::::::::::::::: .:::.:::.:::::..: NP_071 LHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVE 100 110 120 130 140 150 190 200 210 220 230 pF1KB7 AG--HSDSD-SGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHGPPT-PPTTPK--TE .: :. .. ..: :::. :: : : ::: . . : ::: :: . NP_071 GGFLHGLAEPQAAALGPE--GGRV--AMDGLG---LQFPEQGFPAGPPLLPPHMGGHYRD 160 170 180 190 200 240 250 260 270 280 290 pF1KB7 LQQAGAKPELKLEGRRPVDSGRQN-IDFSNVDISELSSEVMGTMDAFDVHEFDQYLPLGG :. :: : :.: :. . . .: . : . ... . : : .. . : .: NP_071 CQSLGAPP---LDGY-PLPTPDTSPLDGVDPDPAFFAAPMPGDCPAAGTYSYAQVSDYAG 210 220 230 240 250 260 300 310 320 330 pF1KB7 PAPPEPGQAY-------GGAYFHAGASPVWAHKSAPSASASPTETG-------PPRPHIK : : : . .: . . .: : . .: .:: : : . : . NP_071 PPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQH 270 280 290 300 310 320 340 350 360 370 380 390 pF1KB7 TEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYA .: : : :: :. : .... :. :: NP_071 QHQHHPPGPG-QPSPPPEALPC--RDGTDPSQPAELLGEVDRTEFEQYLHFVCKPEMGLP 330 340 350 360 370 400 410 420 430 440 pF1KB7 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP NP_071 YQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV 380 390 400 410 >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 466 init1: 410 opt: 448 Z-score: 288.5 bits: 62.4 E(85289): 2.5e-09 Smith-Waterman score: 478; 32.9% identity (54.0% similar) in 350 aa overlap (90-428:39-357) 60 70 80 90 100 110 pF1KB7 ADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRK :::::: . .:::::::::::... ::: NP_005 DLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRK 10 20 30 40 50 60 120 130 140 150 160 170 pF1KB7 LADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRR-KS .:.. :..::.:.:: :: :...::.:::::..::.:::. : :.::::::.:::. :. NP_005 MAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKT 70 80 90 100 110 120 180 190 200 210 220 230 pF1KB7 AKAGHSDSDSGAELGPHPGGG-AVYKAEAGLGDGHHHGDHTGQTHGPPTPPTTPKTELQQ . : .:. :. ::: :. .:.: : . .:: : NP_005 LLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG---AAAVGQRLESPGG---------A 130 140 150 160 170 240 250 260 270 280 290 pF1KB7 AGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMD-AFDVHEFDQYLPLGGPAP ::. : :.. .. ..: . .. .: . :. : : .: : NP_005 AGG-------GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQH------PGAGGAH 180 190 200 210 220 300 310 320 330 340 350 pF1KB7 PEPGQAYGGAYFHAGASPVWAHKSAP--SASASPTETGPPRPHIKTEQPSPGHYGDQPRG :. :. . : : : :. : . . . .: . ::. :: : : NP_005 PHAHPAHPHPH-HPHAHP---HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYG 230 240 250 260 270 360 370 380 390 400 pF1KB7 SPDYGSCSG----QSSATPAAPAGPFA--GSQGDYGDLQASSYYGAYPGYAPGLYQYPCF . .. .. :.::. :: :. : :. : :.: : :. : ::. . :: NP_005 AAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPP--APAHSRAPCP 280 290 300 310 320 330 410 420 430 440 pF1KB7 HSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP . :. . : : . :: NP_005 GDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI 340 350 360 370 380 390 >>NP_003098 (OMIM: 184430) transcription factor SOX-4 [H (474 aa) initn: 500 init1: 403 opt: 442 Z-score: 283.9 bits: 61.8 E(85289): 4.5e-09 Smith-Waterman score: 448; 34.3% identity (59.6% similar) in 329 aa overlap (101-400:58-375) 80 90 100 110 120 130 pF1KB7 AVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRKLADQYPHLHNA :.::::::::::.: :::. .: : .::: NP_003 LGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDMHNA 30 40 50 60 70 80 140 150 160 170 180 190 pF1KB7 ELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAKAGHSDSDSGA :.:: ::: :.::..:.: ::..::::::..: :.:::::.: ::..:.:...:.:.: NP_003 EISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKYRP--RKKVKSGNANSSSSA 90 100 110 120 130 140 200 210 220 230 pF1KB7 ELGPHPG---------GGAVYKAEAGLGDGHHHGDHTGQTHGPP-TPPTTPKTELQQ-AG . .:: ::. . . .: :... : : . : . :. :. .. :: NP_003 AASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGGGGGASGGGANSKPAQKKSCGSKVAG 150 160 170 180 190 200 240 250 260 270 280 290 pF1KB7 ------AKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEF-DQYLPLG .::. :: . .: . . . . ...: :. . . :.. NP_003 GAGGGVSKPHAKL----ILAGGGGGGKAAAAAAASFAAEQAGAAALLPLGAAADHHSLYK 210 220 230 240 250 260 300 310 320 330 340 pF1KB7 GPAPPEPGQAYGGAYFHAG-ASPV--WAHKSAPSA---SASPTETGPPRPHIKTEQPS-P . .: ..: ..: :. :.: :.:.. . .. : ..: .:: : NP_003 ARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVGAGADPSDP 270 280 290 300 310 320 350 360 370 380 390 400 pF1KB7 -GHYGDQPRG-SPDYGSCSGQSSA--TPAAPAGPFAGSQGDYGDLQASSYYGAYPGYAPG : : .. : ::: : ::.::: .::: .: : .: :..:.:.: :. :: NP_003 LGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSP-ADHRG-YASLRAAS---PAPSSAPS 330 340 350 360 370 410 420 430 440 pF1KB7 LYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP NP_003 HASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLGSFSSSSALDRDLDFNF 380 390 400 410 420 430 >>NP_060889 (OMIM: 137940,601618,607823) transcription f (384 aa) initn: 487 init1: 397 opt: 435 Z-score: 280.9 bits: 60.9 E(85289): 6.6e-09 Smith-Waterman score: 446; 31.7% identity (48.2% similar) in 398 aa overlap (31-407:12-381) 10 20 30 40 50 pF1KB7 MLDMSEARSQPPCSPSGTASSMSHVEDSDSDAPPSP---AGSEGLGRAGVAVGGARGDPA : ::. : . : : :. . : : : :: NP_060 MQRSPPGYGAQDDPPARRDCAWAPGHGAAADTRGLAAG-PA 10 20 30 40 60 70 80 90 100 110 pF1KB7 EAADERFPACIRDAVSQVLKGYDWSLVPMPVRGG----GGGALKA--KPHVKRPMNAFMV : :: : . : : : : : : .: . ...:::::::: NP_060 ALAAPAAPA------SPPSPQRSPPRSPEPGRYGLSPAGRGERQAADESRIRRPMNAFMV 50 60 70 80 90 120 130 140 150 160 170 pF1KB7 WAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKY ::. :..::.: : :::: ::: ::: :. :. .:::::::::::::::: .:::.::: NP_060 WAKDERKRLAQQNPDLHNAVLSKMLGKAWKELNAAEKRPFVEEAERLRVQHLRDHPNYKY 100 110 120 130 140 150 180 190 200 210 220 pF1KB7 QPRRRKSAKAGHSDSDSG---AELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTH---GPP .:::.:.:. .. . : :.: . : .: . . .. : : : NP_060 RPRRKKQARKARR-LEPGLLLPGLAPPQPPPEPFPAASGSARAFRELPPLGAEFDGLGLP 160 170 180 190 200 210 230 240 250 260 270 280 pF1KB7 TPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEF :: .: :. . : . : : . . . . .::: . : . : . NP_060 TPERSPLDGLEPGEAA--FFPPPAAPEDCALRPFRAPYAP-TELSRDPGGCYGA----PL 220 230 240 250 260 290 300 310 320 330 340 pF1KB7 DQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIKTEQPSP . : . :: : : :: :. . .: : : :.. : :. NP_060 AEALRTAPPAAPLAGLYYGTL----GTPGPYPGPLSPPPEAPPLESAEPL------GPAA 270 280 290 300 310 350 360 370 380 390 pF1KB7 GHYGDQPRGSPD-YGSCSGQSSATPAAPAGPFAGSQGDYGDL-----QASSYYGAYPGYA ..: : : .:: . : ::. :. . . : . :: .: . NP_060 DLWADVDLTEFDQYLNCS---RTRPDAPGLPYHVALAKLGPRAMSCPEESSLISALSDAS 320 330 340 350 360 370 400 410 420 430 440 pF1KB7 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP ..: : NP_060 SAVYYSACISG 380 >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 387 init1: 387 opt: 433 Z-score: 280.7 bits: 60.6 E(85289): 6.8e-09 Smith-Waterman score: 441; 31.5% identity (56.0% similar) in 327 aa overlap (85-396:14-313) 60 70 80 90 100 pF1KB7 DPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGA---------LKAKP-HVKR :. . ::::: : .: .::: NP_003 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKR 10 20 30 40 110 120 130 140 150 160 pF1KB7 PMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKK ::::::::... :::.:.. :..::.:.:: :: :.::::.:::::..::.:::. : : NP_003 PMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMK 50 60 70 80 90 100 170 180 190 200 210 pF1KB7 DHPDYKYQPRRR-KSAKAGHSDSDSGAELGPHPGGGAVYKAE---AGLGDG-HHHGDHTG .::::::.:::. :. . . :. :.: ::... .. :::: : ... : . NP_003 EHPDYKYRPRRKTKTLMKKDKYTLPGGLLAP--GGNSMASGVGVGAGLGAGVNQRMDSYA 110 120 130 140 150 160 220 230 240 250 260 270 pF1KB7 QTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDA . .: . . . .: :. .: . : . :.: :. . : . .. NP_003 HMNGWSNGSYSMMQDQLGYPQHPGLNAHG------AAQMQPMHRYDVSALQYNSMTSSQT 170 180 190 200 210 280 290 300 310 320 330 pF1KB7 FDVHEFDQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIK :. :.:. :. .: . : .: : : :. : . ..:: . NP_003 --------YMN-GSPT-------YSMSYSQQG-TPGMALGSMGSVVKSEASSSPPVVTSS 220 230 240 250 340 350 360 370 380 390 pF1KB7 TEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYA ... .: . :: : : ::::. ... . : . ... :. : NP_003 SHSRAPCQAGDLRDMISMY--LPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSH 260 270 280 290 300 310 400 410 420 430 440 pF1KB7 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP NP_003 M >>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa) initn: 466 init1: 393 opt: 428 Z-score: 278.5 bits: 60.0 E(85289): 9e-09 Smith-Waterman score: 428; 34.4% identity (58.0% similar) in 262 aa overlap (98-346:2-247) 70 80 90 100 110 120 pF1KB7 IRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKP--HVKRPMNAFMVWAQAARRKLADQYP .:: ::::::::::::..: :::.:.. : NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENP 10 20 30 130 140 150 160 170 180 pF1KB7 HLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAKAGHSD ..::.:.:: :: :.::.:::::::..::.:::..: :.::::::.:::. .. NP_009 KMHNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLL---K 40 50 60 70 80 190 200 210 220 230 240 pF1KB7 SDSGAELGPHPGGGAVYKAEAGL--GDGHHHGDHTGQTHGPPTPPTTPKTELQQAGAK-- .:. : :. ::.. . .: : : : : : . : . ..:. :.: NP_009 KDKFAFPVPYGLGGVADAEHPALKAGAGLHAGAGGGLV--PESLLANPEKAAAAAAAAAA 90 100 110 120 130 140 250 260 270 280 290 pF1KB7 ----PELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEFDQYLPLGGPAPP :. . . .. . .: .:.. .:. .. ... :: :. NP_009 RVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSGLPYAS-----SLGYPT-- 150 160 170 180 190 300 310 320 330 340 350 pF1KB7 EPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPR---PHIKTEQPSPGHYGDQPRG : .::. :.:. . : .: . . : : : : . :::: NP_009 ----AGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYI 200 210 220 230 240 250 360 370 380 390 400 410 pF1KB7 SPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYAPGLYQYPCFHSPRRP NP_009 LLPGMGKPQLDPYPAAYAAAL 260 270 >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 400 init1: 377 opt: 413 Z-score: 270.4 bits: 58.3 E(85289): 2.5e-08 Smith-Waterman score: 430; 38.6% identity (62.8% similar) in 223 aa overlap (85-302:29-224) 60 70 80 90 100 110 pF1KB7 DPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAK-P--HVKRPMNAFMV :. .:.:. : . : .:::::::::: NP_008 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAGSPAAPGTLPLEKVKRPMNAFMV 10 20 30 40 50 120 130 140 150 160 170 pF1KB7 WAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKY :..: ::..:.: :..::.:.:: :: :.::.:.:::::::::.:::..: .:.::::: NP_008 WSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRDYPDYKY 60 70 80 90 100 110 180 190 200 210 220 pF1KB7 QPRRR-KSAKAGHSDSDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHG-PPTPPT .:::. ::. :: : .: : .:: .. : :. : ..: ::. NP_008 RPRRKAKSSGAGPSRCGQGR--GNLASGGPLW------GPGYA---TTQPSRGFGYRPPS 120 130 140 150 160 230 240 250 260 270 280 pF1KB7 TPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEFDQYL . : . .. . :::. : . ... .:..:.. : . .:: NP_008 YSTAYLPGSYGSSHCKLEAPSPCSLPQSD--------PRLQGELLPT--------YTHYL 170 180 190 200 210 290 300 310 320 330 340 pF1KB7 PLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIKTEQPSPGHYG : :.:.: .: : NP_008 PPGSPTPYNPPLAGAPMPLTHL 220 230 446 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:23:25 2016 done: Fri Nov 4 09:23:27 2016 Total Scan time: 9.980 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]