FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9653, 276 aa 1>>>pF1KB9653 276 - 276 aa - 276 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.2220+/-0.000359; mu= 3.5606+/- 0.022 mean_var=216.0047+/-45.684, 0's: 0 Z-trim(121.3): 175 B-trim: 1754 in 1/56 Lambda= 0.087266 statistics sampled from 37575 (37768) to 37575 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.768), E-opt: 0.2 (0.443), width: 16 Scan time: 7.460 The best scores are: opt bits E(85289) NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 1856 245.6 7.5e-65 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 664 95.5 1e-19 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 620 90.1 5.7e-18 NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 611 89.0 1.5e-17 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 602 87.9 3.5e-17 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 471 71.2 2e-12 NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 456 69.6 1.3e-11 NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 439 67.3 4.1e-11 NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 428 66.0 1.4e-10 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 423 65.5 2.4e-10 NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 409 63.3 4.1e-10 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 415 64.4 4.3e-10 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 410 63.8 6.9e-10 NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 402 62.7 1.3e-09 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 398 62.2 1.7e-09 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 365 58.1 3.1e-08 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 331 53.8 5.8e-07 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 331 53.8 6.2e-07 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 331 54.0 8.3e-07 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 331 54.0 8.3e-07 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 331 54.0 8.5e-07 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 331 54.0 9.2e-07 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 331 54.0 9.2e-07 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 331 54.0 9.2e-07 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 331 54.0 9.2e-07 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 331 54.0 9.2e-07 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 331 54.0 9.3e-07 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 331 54.0 9.4e-07 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 331 54.0 9.5e-07 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 331 54.0 9.5e-07 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 331 54.0 9.6e-07 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 331 54.0 9.6e-07 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 331 54.0 9.6e-07 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 331 54.0 9.7e-07 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 331 54.0 9.7e-07 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 331 54.1 9.9e-07 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 331 54.1 9.9e-07 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 325 53.3 1.7e-06 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 325 53.3 1.7e-06 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 325 53.3 1.7e-06 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 325 53.3 1.7e-06 XP_011532722 (OMIM: 606698) PREDICTED: transcripti ( 448) 299 49.8 1.1e-05 >>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa) initn: 1856 init1: 1856 opt: 1856 Z-score: 1285.3 bits: 245.6 E(85289): 7.5e-65 Smith-Waterman score: 1856; 100.0% identity (100.0% similar) in 276 aa overlap (1-276:1-276) 10 20 30 40 50 60 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHNSEISKRLGAEWKLLTESEKRPFIDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHNSEISKRLGAEWKLLTESEKRPFIDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 AKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFPVPYGLGGVADAEHPALKAGAGLHAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_009 AKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFPVPYGLGGVADAEHPALKAGAGLHAG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 AGGGLVPESLLANPEKAAAAAAAAAARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_009 AGGGLVPESLLANPEKAAAAAAAAAARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 EISSSSSGLPYASSLGYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_009 EISSSSSGLPYASSLGYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNC 190 200 210 220 230 240 250 260 270 pF1KB9 SAWPSPGLQPPLAYILLPGMGKPQLDPYPAAYAAAL :::::::::::::::::::::::::::::::::::: NP_009 SAWPSPGLQPPLAYILLPGMGKPQLDPYPAAYAAAL 250 260 270 >>NP_004180 (OMIM: 604747) transcription factor SOX-14 [ (240 aa) initn: 1046 init1: 627 opt: 664 Z-score: 475.1 bits: 95.5 E(85289): 1e-19 Smith-Waterman score: 978; 57.9% identity (73.7% similar) in 285 aa overlap (1-276:1-240) 10 20 30 40 50 60 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHNSEISKRLGAEWKLLTESEKRPFIDE :::: ::.::::::::::::.::::::::::::::::::::::::::::.:.::::.::: NP_004 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSEAEKRPYIDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 AKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFPVPYGLGGVADAEHPALKAGAGLHAG ::::::.:::::::::::::::::.:::::...::.:: :: :.. : ::: ::: .: NP_004 AKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFPLPY-LG---DTD-P-LKA-AGLPVG 70 80 90 100 110 130 140 150 160 170 pF1KB9 AGGGLVPESLLANPEKAAAAAAAAAARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGS--- :. :: :. :::: :.:.: ..: :::::: .. NP_004 ASDGL-----LSAPEKA---------RAFLPPASA-------------PYSLLDPAQFSS 120 130 140 180 190 200 210 220 230 pF1KB9 ----KMAEISSS--SSGLPYASSLGYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGN ::.:. . ...:::::.::: . ::: :. . . :::.:::: : NP_004 SAIQKMGEVPHTLATGALPYASTLGYQN---GAF-GSLSCPSQ-------HTHTHPSPTN 150 160 170 180 190 240 250 260 270 pF1KB9 PGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDPYPAAYAAAL :::..::::.:: . ::::.::::.::: : .::: .:.:.:. NP_004 PGYVVPCNCTAWSASTLQPPVAYILFPGMTKTGIDPYSSAHATAM 200 210 220 230 240 >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 624 init1: 569 opt: 620 Z-score: 443.6 bits: 90.1 E(85289): 5.7e-18 Smith-Waterman score: 620; 43.0% identity (62.9% similar) in 272 aa overlap (6-269:39-304) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN :.:::::::::::::.:::::::::::::: NP_003 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB9 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP ::::::::::::::.:.::::::::::::::.::::::::::::::: :::.::::...: NP_003 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB9 VPYGLGGVADAEHPALKAGAGLHAGAGGGLVP--ESLLANPEKAAAAAAAAAARVFFPQS ::. .. .:.:. :: :.:. .: . .. . .. .:: NP_003 -----GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQH 130 140 150 160 170 180 160 170 180 190 200 pF1KB9 AAAAAAAAAAAAAGSPYSLLDLG-SKMAEISSSSSGLP-YA---SSLGYPTAGAGAFHGA . : .:: :.. : ..:. .. .: : :. :. : : . :.. :. NP_003 PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM-GS 190 200 210 220 230 240 210 220 230 240 250 260 pF1KB9 AAAAAAAAAAAGGHTHSHP-SPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDP .. . :... . :: .: . : . :. . : : : . : : NP_003 VVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGP 250 260 270 280 290 300 270 pF1KB9 YPAAYAAAL : NP_003 VPGTAINGTLPLSHM 310 >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 742 init1: 555 opt: 611 Z-score: 436.3 bits: 89.0 E(85289): 1.5e-17 Smith-Waterman score: 684; 46.2% identity (64.9% similar) in 305 aa overlap (6-258:49-349) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN :.:::::::::::::.:::::::::::::: NP_005 PTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHN 20 30 40 50 60 70 40 50 60 70 80 90 pF1KB9 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP ::::::::::::...:.::::::::::::::.::::::::::::::: ::::::::... NP_005 SEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLA 80 90 100 110 120 130 100 110 120 130 pF1KB9 ---VPYGLGGVADAEHPALKAGAGLHA----------GAGGGLVPESLLAN---PEKAAA . : :: . : .. .:.: : .:::: . . :: : ..:: NP_005 GGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGGYAHVNGWANGAYPGSVAA 140 150 160 170 180 190 140 150 160 170 180 pF1KB9 AAAAAA----ARVFFPQSAAAAAAAAAAAAAG-------------SPYSLLDLGS-KMAE :::::: :.. . : .:..: : : .:. :.:. ... NP_005 AAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMGALQYSP 200 210 220 230 240 250 190 200 210 220 pF1KB9 ISSSSS----------GLPYASSLGYPTAGAGAFHGAAAAAAAAAAAA--------GGHT ::.:.. ::::... . .:..:: ...:.::::::::: :. . NP_005 ISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLV 260 270 280 290 300 310 230 240 250 260 270 pF1KB9 HSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDPYPAAYAAAL .:.:: . :. : . : : :: . . :: NP_005 KSEPSGSPPA---PAHSRA-PCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQH 320 330 340 350 360 370 NP_005 YQGAGAGVNGTVPLTHI 380 390 >>NP_005625 (OMIM: 300123,312000,313430) transcription f (446 aa) initn: 565 init1: 565 opt: 602 Z-score: 429.4 bits: 87.9 E(85289): 3.5e-17 Smith-Waterman score: 612; 48.2% identity (66.3% similar) in 249 aa overlap (6-232:137-383) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN :.:::::::::::::.:::::: ::::::: NP_005 GKSSANAAGGANSGGGSSGGASGGGGGTDQDRVKRPMNAFMVWSRGQRRKMALENPKMHN 110 120 130 140 150 160 40 50 60 70 80 90 pF1KB9 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP :::::::::.:::::..::::::::::::::.::::.:::::::::: ::::::::...: NP_005 SEISKRLGADWKLLTDAEKRPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDKYSLP 170 180 190 200 210 220 100 110 120 130 140 150 pF1KB9 VPYGLGGVADAEHPALKAGAGLHAGAGGGLVPESLLANPEKAAAAAAAAAARVFFPQSAA :.: : : :.:. . .: : .. : .: . . .. . : . NP_005 SGLLPPGAAAAAAAAAAAAAAASSPVGVGQRLDTYTHVNGWANGAYSLVQEQLGYAQPPS 230 240 250 260 270 280 160 170 180 190 200 pF1KB9 AAAAAAAAA--------AAGSPYS-LLDLGSK----MAEISSSSSGLPYASSLGYPTAGA .. : :: :: .. :.. .: ....:: :.. ::.: NP_005 MSSPPPPPALPPMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASG--YGGMAPSATAAA 290 300 310 320 330 340 210 220 230 240 250 pF1KB9 GAFHG---AAAAAAAAAAAA------GGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPLA .: .: :.:::::::::: :. ..:.:: : NP_005 AAAYGQQPATAAAAAAAAAAMSLGPMGSVVKSEPSSPPPAIASHSQRACLGDLRDMISMY 350 360 370 380 390 400 260 270 pF1KB9 YILLPGMGKPQLDPYPAAYAAAL NP_005 LPPGGDAADAASPLPGGRLHGVHQHYQGAGTAVNGTVPLTHI 410 420 430 440 >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 476 init1: 458 opt: 471 Z-score: 343.9 bits: 71.2 E(85289): 2e-12 Smith-Waterman score: 474; 43.6% identity (62.7% similar) in 204 aa overlap (4-202:45-226) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKM :...:::::::::::: ::::.:::.:::: NP_008 EPPAATAAASSSSGPQEREGAGSPAAPGTLPLEKVKRPMNAFMVWSSAQRRQMAQQNPKM 20 30 40 50 60 70 40 50 60 70 80 90 pF1KB9 HNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFA :::::::::::.:::: :.:::::..::::::: :....:::::::::: :. NP_008 HNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRDYPDYKYRPRRKAKS-------- 80 90 100 110 120 100 110 120 130 140 150 pF1KB9 FPVPYGLGGVADAEHPALKAGAGLHAGAGGGLV--PESLLANPEKAAAAAAAAAARVFFP : : :. . : : :.:: . : ..: .. . . . ...: NP_008 ----SGAG-------PS-RCGQGRGNLASGGPLWGPGYATTQPSRGFGYRPPSYSTAYLP 130 140 150 160 170 160 170 180 190 200 pF1KB9 QSAAAAAAAAAAAAAGS-PYSLLDLGSKMAEISSSSSGLPYASSLGY--PTAGAGAFHGA : ... : . : : : : ... . . . :: .: : : ::: NP_008 GSYGSSHCKLEAPSPCSLPQSDPRLQGEL--LPTYTHYLPPGSPTPYNPPLAGAPMPLTH 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB9 AAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDPY NP_008 L >>NP_003098 (OMIM: 184430) transcription factor SOX-4 [H (474 aa) initn: 505 init1: 393 opt: 456 Z-score: 329.7 bits: 69.6 E(85289): 1.3e-11 Smith-Waterman score: 464; 40.2% identity (65.2% similar) in 244 aa overlap (4-222:55-287) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKM : :.:::::::::::. .:::. ...: : NP_003 GLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDM 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB9 HNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFA ::.::::::: .:::: .:.: ::: ::.::: :: ..::::::::.: :. ... . NP_003 HNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSS 90 100 110 120 130 140 100 110 120 130 pF1KB9 F-----PVPYG--LGGVADAEHPALKAGAGLHAGAGGGLV--------PESLLANPEKAA : : .:: . . : . .:.. .::.::: . : . . :.: NP_003 AAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGGGGGASGGGANSKPAQKKSCGSKVA 150 160 170 180 190 200 140 150 160 170 180 pF1KB9 AAAAAAA----ARVFFP------QSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSG ..:.... :.... ..::::::. :: ::. .:: ::. ... . NP_003 GGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAEQAGAA-ALLPLGA-----AADHHS 210 220 230 240 250 190 200 210 220 230 240 pF1KB9 LPYASSLGYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGL : : . :.:.:.: ..::.:.:: :: : : NP_003 LYKART---PSASASA--SSAASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVG 260 270 280 290 300 310 250 260 270 pF1KB9 QPPLAYILLPGMGKPQLDPYPAAYAAAL NP_003 AGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRGYASLRAASPAPSS 320 330 340 350 360 370 >>NP_008874 (OMIM: 601947) transcription factor SOX-12 [ (315 aa) initn: 487 init1: 385 opt: 439 Z-score: 320.4 bits: 67.3 E(85289): 4.1e-11 Smith-Waterman score: 439; 34.4% identity (58.2% similar) in 273 aa overlap (4-265:36-295) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKM : :.:::::::::::. .:::. .. : : NP_008 GARAKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMVWSQHERRKIMDQWPDM 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB9 HNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFA ::.::::::: .:.:: .::: ::. ::.::: :: ..::::::::.: : : : NP_008 HNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRKKSKGAPAK---A 70 80 90 100 110 120 100 110 120 130 140 pF1KB9 FPVPYGLGGVADAEHPALK-AGAGLHAGAGGGL-----VPESLLANPEKAAAAA--AAAA : : : .: .. .:. . : : . .::: : .::. . .. . . . NP_008 RPRPPGGSGGGSRLKPGPQLPGRGGRRAAGGPLGGGAAAPEDDDEDDDEELLEVRLVETP 130 140 150 160 170 180 150 160 170 180 190 200 pF1KB9 ARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSGLPYASSLGYPTAGAGAF .: .. . :. :: . : : .: . .. : . : . : .:.: NP_008 GRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEE----EEAAAAE 190 200 210 220 230 210 220 230 240 250 260 pF1KB9 HGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWP-SPGLQPP--LAYILLPGMGK .: ..:.. . : .. :.:.. .::: .: :::: ... .: . NP_008 EGEEETVASGEESLGFLSRLPPGPAG------LDCSALDRDPDLQPPSGTSHFEFPDYCT 240 250 260 270 280 290 270 pF1KB9 PQLDPYPAAYAAAL :.. NP_008 PEVTEMIAGDWRPSSIADLVFTY 300 310 >>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa) initn: 466 init1: 393 opt: 428 Z-score: 311.0 bits: 66.0 E(85289): 1.4e-10 Smith-Waterman score: 428; 34.4% identity (58.0% similar) in 262 aa overlap (2-247:98-346) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENP .:: ::::::::::::..: :::.:.. : NP_055 IRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKP--HVKRPMNAFMVWAQAARRKLADQYP 70 80 90 100 110 120 40 50 60 70 80 pF1KB9 KMHNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLL---K ..::.:.:: :: :.::.:::::::..::.:::..: :.::::::.:::. .. NP_055 HLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSAKAGHSD 130 140 150 160 170 180 90 100 110 120 130 140 pF1KB9 KDKFAFPVPYGLGGVADAEHPALKAGA--GLHAGAGGGLVPESLLANPEKAAAAAAAAAA .:. : :. ::.. . .: : : :.: : : . ..:. :.: NP_055 SDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHG--PPTPPTTPKTELQQAGAK-- 190 200 210 220 230 240 150 160 170 180 190 pF1KB9 RVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSGLPYAS-----SLGYPT-- :. . . .. . .: .:.. .:. .. ... :: :. NP_055 ----PELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEFDQYLPLGGPAPP 250 260 270 280 290 200 210 220 230 240 250 pF1KB9 ----AGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYI : .::. :.:. . : .: . . : : : : . :::: NP_055 EPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPR---PHIKTEQPSPGHYGDQPRG 300 310 320 330 340 350 260 270 pF1KB9 LLPGMGKPQLDPYPAAYAAAL NP_055 SPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYAPGLYQYPCFHSPRRP 360 370 380 390 400 410 >>NP_000337 (OMIM: 114290,608160,616425) transcription f (509 aa) initn: 422 init1: 396 opt: 423 Z-score: 306.9 bits: 65.5 E(85289): 2.4e-10 Smith-Waterman score: 433; 32.5% identity (55.4% similar) in 289 aa overlap (2-271:99-364) 10 20 30 pF1KB9 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENP :: ::::::::::::..: :::.:.. : NP_000 FPVCIREAVSQVLKGYDWTLVPMPVRVNGSSKNKPHVKRPMNAFMVWAQAARRKLADQYP 70 80 90 100 110 120 40 50 60 70 80 90 pF1KB9 KMHNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDK ..::.:.:: :: :.::.:::::::..::.:::..: :.::::::.:::. :.. . . NP_000 HLHNAELSKTLGKLWRLLNESEKRPFVEEAERLRVQHKKDHPDYKYQPRRR-KSVKNGQA 130 140 150 160 170 180 100 110 120 130 pF1KB9 FAFPV-------PYGLGGVADAEHPALKAGA------GLHAGAGGGLVPESLLANPEKAA : . : .. . .:. : ..: : :.: . : : . ..:. . NP_000 EAEEATEQTHISPNAIFKALQADSPHSSSGMSEVHSPGEHSGQSQG--PPTPPTTPKTDV 190 200 210 220 230 240 140 150 160 170 180 190 pF1KB9 AAAAAAAARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSGLP------YA . : : : .. . .:.: ... :. . : NP_000 QPGKADLKREGRPLPEGGRQPPI-------DFRDVDIGELSSDVISNIETFDVNEFDQYL 250 260 270 280 290 200 210 220 230 240 250 pF1KB9 SSLGYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPL :.: :. : :: .. ... : . . .:.. :.. . .: : : ::: NP_000 PPNGHP--GVPATHGQVTYTGSY-----GISSTAATPASAGHVWMSKQQAPPPPPQQPPQ 300 310 320 330 340 350 260 270 pF1KB9 AYILLPGMGKPQLDPYPAAYAAAL : :. :: : : : NP_000 AP---PA---PQAPPQPQAAPPQQPAAPPQQPQAHTLTTLSSEPGQSQRTHIKTEQLSPS 360 370 380 390 400 276 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:59:14 2016 done: Fri Nov 4 17:59:15 2016 Total Scan time: 7.460 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]