FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9652, 317 aa 1>>>pF1KB9652 317 - 317 aa - 317 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0291+/-0.00069; mu= 9.2110+/- 0.042 mean_var=119.0964+/-24.141, 0's: 0 Z-trim(114.1): 76 B-trim: 150 in 2/51 Lambda= 0.117524 statistics sampled from 14635 (14715) to 14635 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.452), width: 16 Scan time: 2.940 The best scores are: opt bits E(32554) CCDS3239.1 SOX2 gene_id:6657|Hs108|chr3 ( 317) 2167 377.6 7.1e-105 CCDS14669.1 SOX3 gene_id:6658|Hs108|chrX ( 446) 923 166.8 2.9e-41 CCDS9523.1 SOX1 gene_id:6656|Hs108|chr13 ( 391) 780 142.5 5.2e-34 CCDS9473.1 SOX21 gene_id:11166|Hs108|chr13 ( 276) 620 115.3 5.7e-26 CCDS3094.1 SOX14 gene_id:8403|Hs108|chr3 ( 240) 601 112.0 4.8e-25 CCDS32549.1 SOX15 gene_id:6665|Hs108|chr17 ( 233) 499 94.7 7.5e-20 CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8 ( 414) 466 89.2 5.8e-18 CCDS10428.1 SOX8 gene_id:30812|Hs108|chr16 ( 446) 433 83.7 3e-16 CCDS14772.1 SRY gene_id:6736|Hs108|chrY ( 204) 424 81.9 4.5e-16 CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20 ( 384) 422 81.8 9.6e-16 CCDS12995.1 SOX12 gene_id:6666|Hs108|chr20 ( 315) 415 80.5 1.9e-15 CCDS13964.1 SOX10 gene_id:6663|Hs108|chr22 ( 466) 412 80.1 3.7e-15 CCDS1654.1 SOX11 gene_id:6664|Hs108|chr2 ( 441) 410 79.8 4.4e-15 CCDS11689.1 SOX9 gene_id:6662|Hs108|chr17 ( 509) 407 79.3 7.1e-15 CCDS4547.1 SOX4 gene_id:6659|Hs108|chr6 ( 474) 402 78.4 1.2e-14 CCDS5977.1 SOX7 gene_id:83595|Hs108|chr8 ( 388) 386 75.7 6.7e-14 CCDS41761.1 SOX5 gene_id:6660|Hs108|chr12 ( 377) 338 67.5 1.8e-11 CCDS58216.1 SOX5 gene_id:6660|Hs108|chr12 ( 642) 338 67.7 2.9e-11 CCDS81672.1 SOX5 gene_id:6660|Hs108|chr12 ( 728) 338 67.7 3.2e-11 CCDS44844.1 SOX5 gene_id:6660|Hs108|chr12 ( 750) 338 67.7 3.2e-11 CCDS58217.1 SOX5 gene_id:6660|Hs108|chr12 ( 753) 338 67.7 3.3e-11 CCDS8699.1 SOX5 gene_id:6660|Hs108|chr12 ( 763) 338 67.7 3.3e-11 CCDS53604.1 SOX6 gene_id:55553|Hs108|chr11 ( 801) 329 66.2 9.9e-11 CCDS53605.1 SOX6 gene_id:55553|Hs108|chr11 ( 804) 329 66.2 9.9e-11 CCDS7821.1 SOX6 gene_id:55553|Hs108|chr11 ( 808) 329 66.2 9.9e-11 >>CCDS3239.1 SOX2 gene_id:6657|Hs108|chr3 (317 aa) initn: 2167 init1: 2167 opt: 2167 Z-score: 1996.3 bits: 377.6 E(32554): 7.1e-105 Smith-Waterman score: 2167; 100.0% identity (100.0% similar) in 317 aa overlap (1-317:1-317) 10 20 30 40 50 60 pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS 250 260 270 280 290 300 310 pF1KB9 GPVPGTAINGTLPLSHM ::::::::::::::::: CCDS32 GPVPGTAINGTLPLSHM 310 >>CCDS14669.1 SOX3 gene_id:6658|Hs108|chrX (446 aa) initn: 1183 init1: 656 opt: 923 Z-score: 854.2 bits: 166.8 E(32554): 2.9e-41 Smith-Waterman score: 1153; 52.8% identity (74.0% similar) in 377 aa overlap (1-317:76-446) 10 20 pF1KB9 MYNMMETELKPP-G-PQQTSG-------GG ::...::::: : : : :..: :: CCDS14 ESQGLFTVAAPAPGAPSPPATLAHLLPAPAMYSLLETELKNPVGTPTQAAGTGGPAAPGG 50 60 70 80 90 100 30 40 50 60 pF1KB9 GGNSTAAAAGGNQKNS--------------PDRVKRPMNAFMVWSRGQRRKMAQENPKMH .:.:.: :::: .... :::::::::::::::::::::: :::::: CCDS14 AGKSSANAAGGANSGGGSSGGASGGGGGTDQDRVKRPMNAFMVWSRGQRRKMALENPKMH 110 120 130 140 150 160 70 80 90 100 110 120 pF1KB9 NSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTL ::::::::::.::::...::::::::::::::.::::.::::::::::::::.:::::.: CCDS14 NSEISKRLGADWKLLTDAEKRPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDKYSL 170 180 190 200 210 220 130 140 150 160 170 180 pF1KB9 PGGLLAPGGNSMASGVGVGAGLGA---GVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHP :.::: ::. . :......:. .. ::.::.:.:.:.:::.::.::..:.:::: : : CCDS14 PSGLLPPGAAAAAAAAAAAAAAASSPVGVGQRLDTYTHVNGWANGAYSLVQEQLGYAQPP 230 240 250 260 270 280 190 200 210 220 pF1KB9 GLNAHGAAQ-MQPMHRYDVSALQYNSMT--SSQTYMN---------G----SPTYSMS-- .... . ::::::...:::. : ..:.::: : .:. . . CCDS14 SMSSPPPPPALPPMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGYGGMAPSATAAAA 290 300 310 320 330 340 230 240 250 260 270 pF1KB9 --YSQQ---------GTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMIS :.:: .. .:.:: :::::::: :: ::.. .:::. : :::::::: CCDS14 AAYGQQPATAAAAAAAAAAMSLGPMGSVVKSEPSSPPPAI--ASHSQRACL-GDLRDMIS 350 360 370 380 390 400 280 290 300 310 pF1KB9 MYLP-GAEVPEPAAP---SRLH-MSQHYQSGPVPGTAINGTLPLSHM :::: :... . :.: .::: . ::::.. :::.:::.::.:. CCDS14 MYLPPGGDAADAASPLPGGRLHGVHQHYQGA---GTAVNGTVPLTHI 410 420 430 440 >>CCDS9523.1 SOX1 gene_id:6656|Hs108|chr13 (391 aa) initn: 1037 init1: 728 opt: 780 Z-score: 724.0 bits: 142.5 E(32554): 5.2e-34 Smith-Waterman score: 1095; 54.0% identity (69.7% similar) in 363 aa overlap (1-288:1-358) 10 20 30 40 50 pF1KB9 MYNMM-ETELKPPGPQQT---------SGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMV ::.:: ::.:. :: :. .:::::.. ....::. : . :::::::::::: CCDS95 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKY ::::::::::::::::::::::::::::::..::.::::::::::::::::::::::::: CCDS95 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY 70 80 90 100 110 120 120 130 140 150 pF1KB9 RPRRKTKTLMKKDKYTLPGGLLAPG----GNSMASGVGVGAGLGAGVNQRMDS------- :::::::::.:::::.: ::::: : : ..: :::::.: .:.:.::..: CCDS95 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG-AAAVGQRLESPGGAAGG 130 140 150 160 170 160 170 180 190 pF1KB9 -YAHMNGWSNGSY-----------SMMQD-QLGYPQHPGL-----NAHGAAQM------- :::.:::.::.: .:::. ::.: :::: .:: : CCDS95 GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAH 180 190 200 210 220 230 200 210 220 230 pF1KB9 ----QPMHRYDVSALQYNSMTSSQTYMNGSPT------YSMSYSQQGTPGMA-------- :::::::..::::. ...:: ::..::. :. . . .. : : CCDS95 PHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAA 240 250 260 270 280 290 240 250 260 270 280 pF1KB9 -----------LGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPE ::..::.:::: :.::: . .:::::: ::::.:::::::..: . CCDS95 AAAAAAASSGALGALGSLVKSEPSGSPP---APAHSRAPCP-GDLREMISMYLPAGEGGD 300 310 320 330 340 350 290 300 310 pF1KB9 PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM ::: CCDS95 PAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI 360 370 380 390 >>CCDS9473.1 SOX21 gene_id:11166|Hs108|chr13 (276 aa) initn: 624 init1: 569 opt: 620 Z-score: 579.7 bits: 115.3 E(32554): 5.7e-26 Smith-Waterman score: 620; 43.0% identity (62.9% similar) in 272 aa overlap (39-304:6-269) 10 20 30 40 50 60 pF1KB9 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN :.:::::::::::::.:::::::::::::: CCDS94 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN 10 20 30 70 80 90 100 110 120 pF1KB9 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP ::::::::::::::.:.::::::::::::::.::::::::::::::: :::.::::...: CCDS94 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB9 -----GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQH ::. .. .:.:. :: :.:. .: . .. . .. .:: CCDS94 VPYGLGGVADAEHPALKAGAGLHAGAGGGLVP--ESLLANPEKAAAAAAAAAARVFFPQS 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB9 PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM-GS . : .:: :.. : ..:. .. .: : :. :. : : . :.. :. CCDS94 AAAAAAAAAAAAAGSPYSLLDLG-SKMAEISSSSSGLP-YA---SSLGYPTAGAGAFHGA 160 170 180 190 200 250 260 270 280 290 300 pF1KB9 VVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGP .. . :... . :: .: . : . :. . : : : . : : CCDS94 AAAAAAAAAAAGGHTHSHP-SPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDP 210 220 230 240 250 260 310 pF1KB9 VPGTAINGTLPLSHM : CCDS94 YPAAYAAAL 270 >>CCDS3094.1 SOX14 gene_id:8403|Hs108|chr3 (240 aa) initn: 626 init1: 574 opt: 601 Z-score: 563.2 bits: 112.0 E(32554): 4.8e-25 Smith-Waterman score: 601; 48.6% identity (69.5% similar) in 220 aa overlap (39-254:6-216) 10 20 30 40 50 60 pF1KB9 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN :..::::::::::::::::::::::::::: CCDS30 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHN 10 20 30 70 80 90 100 110 120 pF1KB9 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP ::::::::::::::::.::::.::::::::: ::::::::::::::: :.:.:::.:..: CCDS30 SEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFP 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB9 GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNA :. :.:. :::. : .. . : . ... ::... : . . .: CCDS30 LPYLGDTDPLKAAGLPVGASDGL-LSAPEKARAFLPP-ASAPYSLLD-----PAQFSSSA 100 110 120 130 140 190 200 210 220 230 240 pF1KB9 HGAAQMQPMHRYDVSALQYNSMTSSQTYMNGS---PT-YSMSYSQQGTPGMALGSMGSVV : : ..:: : : . :. :: :. .. .. . .::... . .. CCDS30 IQKMGEVP-HTLATGALPYASTLGYQNGAFGSLSCPSQHTHTHPSPTNPGYVV-PCNCTA 150 160 170 180 190 200 250 260 270 280 290 300 pF1KB9 KSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGPVP : .. .::: CCDS30 WSASTLQPPVAYILFPGMTKTGIDPYSSAHATAM 210 220 230 240 >>CCDS32549.1 SOX15 gene_id:6665|Hs108|chr17 (233 aa) initn: 477 init1: 458 opt: 499 Z-score: 469.9 bits: 94.7 E(32554): 7.5e-20 Smith-Waterman score: 499; 43.9% identity (64.5% similar) in 214 aa overlap (13-221:28-227) 10 20 30 40 pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP-DRVKRP :::. :.: . :: : . : ..:::: CCDS32 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAG----SPAAPG----TLPLEKVKRP 10 20 30 40 50 50 60 70 80 90 100 pF1KB9 MNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKE :::::::: .:::.:::.:::::::::::::::.::::.: :::::..::::::: :... CCDS32 MNAFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRD 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB9 HPDYKYRPRRKTKTLMKKDKYTLPG-GLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHM .::::::::::.:. . : : :: :: . : .. . : . : ::. CCDS32 YPDYKYRPRRKAKSSGAGPSRCGQGRGNLASGGPLWGPGYATTQP-SRGFGYRPPSYS-- 120 130 140 150 160 170 180 190 200 210 220 pF1KB9 NGWSNGSYSMMQDQLGYPQH---PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGS ... :::. . .: :. : . . ... : . . : .: : . . :. CCDS32 TAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTH---YLPPGSPTPYNPPLAGA 170 180 190 200 210 220 230 240 250 260 270 280 pF1KB9 PTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPG : CCDS32 PMPLTHL 230 >>CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8 (414 aa) initn: 467 init1: 398 opt: 466 Z-score: 435.9 bits: 89.2 E(32554): 5.8e-18 Smith-Waterman score: 468; 31.7% identity (56.7% similar) in 312 aa overlap (9-305:36-331) 10 20 30 pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP :.: : ....: . .:: : :..... .. CCDS61 AGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGE 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB9 DRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLR .:..:::::::::.. .:...::.:: .::.:.:: :: :: :. .:::::..::.::: CCDS61 SRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLR 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB9 ALHMKEHPDYKYRPRRKTKTLMKKDKYTLPG---GLLAPGGNSMASGVGVGAGLGAGVNQ . ::..::.:::::::. . .:. : . : :: : . ... : : : :.. CCDS61 VQHMQDHPNYKYRPRRRKQ--VKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQF 130 140 150 160 170 180 160 170 180 190 200 pF1KB9 RMDSYA--------HMNGWSNGSYSMMQDQL-GYPQHPGLNAHGAAQMQPMHRYDVSALQ ... ::.: :. : ::: : . . .:. : . CCDS61 PEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPL-P------TPDTSPLDGVDPDPAF 190 200 210 220 230 210 220 230 240 250 260 pF1KB9 YNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSMGSVVKSE-ASSSPPVVTSSSHSRAPC . . .. :. .:.. . : : : : . : :. : : . :: CCDS61 FAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLL------APP 240 250 260 270 280 290 270 280 290 300 310 pF1KB9 QAGDLRDMISMYLPGAEVPE--PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM .: . . .: ::: . :.. :. :: . : :: CCDS61 SALHVY-YGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPS 300 310 320 330 340 CCDS61 QPAELLGEVDRTEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYC 350 360 370 380 390 400 >>CCDS10428.1 SOX8 gene_id:30812|Hs108|chr16 (446 aa) initn: 387 init1: 387 opt: 433 Z-score: 405.2 bits: 83.7 E(32554): 3e-16 Smith-Waterman score: 441; 31.5% identity (56.0% similar) in 327 aa overlap (14-313:85-396) 10 20 30 40 pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKR :. . :::: : : .: .::: CCDS10 DPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGG---------GALKAKP-HVKR 60 70 80 90 100 50 60 70 80 90 100 pF1KB9 PMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMK ::::::::... :::.:.. :..::.:.:: :: :.::::.:::::..::.:::. : : CCDS10 PMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKK 110 120 130 140 150 160 110 120 130 140 150 160 pF1KB9 EHPDYKYRPRRKTKTLMKKDKYTLPGGLLAP--GGNSMASGVGVGAGLGAGVNQRMDSYA .::::::.:::. :. . . :. :.: ::... . . :::: : ... : . CCDS10 DHPDYKYQPRRR-KSAKAGHSDSDSGAELGPHPGGGAVYK---AEAGLGDG-HHHGDHTG 170 180 190 200 210 170 180 190 200 210 pF1KB9 HMNGWSNGSYSMMQDQLGYPQHPGLNAHG------AAQMQPMHRYDVSALQYNSMTSSQT . .: . . . .: :. .: . : . :.: :. . : . .. CCDS10 QTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDA 220 230 240 250 260 270 220 230 240 250 pF1KB9 --------YMN-GSPT-------YSMSYSQQG-TPGMALGSMGSVVKSEASSSPPVVTSS :. :.:. :. .: . : .: : : :. : . ..:: . CCDS10 FDVHEFDQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIK 280 290 300 310 320 330 260 270 280 290 300 310 pF1KB9 SHSRAPCQAGDLRDMISMY--LPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSH ... .: . :: : : ::::. ... . : . ... :. : CCDS10 TEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYA 340 350 360 370 380 390 pF1KB9 M CCDS10 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP 400 410 420 430 440 >>CCDS14772.1 SRY gene_id:6736|Hs108|chrY (204 aa) initn: 434 init1: 416 opt: 424 Z-score: 402.0 bits: 81.9 E(32554): 4.5e-16 Smith-Waterman score: 439; 45.6% identity (69.6% similar) in 171 aa overlap (31-200:49-198) 10 20 30 40 50 pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQK-NSPDRVKRPMNAFMVWSRGQRRKM : :.: : ::::::::::.:::: ::::: CCDS14 PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM 20 30 40 50 60 70 60 70 80 90 100 110 pF1KB9 AQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTL : :::.:.::::::.:: .::.:.:.:: ::..::..:.:.: ...:.::::::::.: . CCDS14 ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAK-M 80 90 100 110 120 130 120 130 140 150 160 170 pF1KB9 MKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLG . :. :: : .. . : : ..:. . . .....: :. ::: CCDS14 LPKNCSLLP----ADPASVLCSEV------------QLDNRLYRDDCTKATHSRMEHQLG 140 150 160 170 180 180 190 200 210 220 230 pF1KB9 YPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGS . : .:: :.. : :: CCDS14 H--LPPINA--ASSPQQRDRYSHWTKL 190 200 >>CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20 (384 aa) initn: 439 init1: 383 opt: 422 Z-score: 396.1 bits: 81.8 E(32554): 9.6e-16 Smith-Waterman score: 422; 46.5% identity (74.4% similar) in 129 aa overlap (11-135:51-176) 10 20 30 pF1KB9 MYNMMETELKPPGPQQTSGGGG--GNSTAAAAGGNQKNSP ::.::.. . : . :: ..... CCDS13 AWAPGHGAAADTRGLAAGPAALAAPAAPASPPSPQRSPPRSPEPGRYGLSPAGRGERQAA 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB9 D--RVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKR : :..:::::::::.. .:...::.:: .::. .:: :: :: :. .:::::..::.: CCDS13 DESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLGKAWKELNAAEKRPFVEEAER 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB9 LRALHMKEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQR ::. :...::.:::::::: .. .: . :: :: :: CCDS13 LRVQHLRDHPNYKYRPRRKKQA--RKARRLEPG-LLLPGLAPPQPPPEPFPAASGSARAF 150 160 170 180 190 160 170 180 190 200 210 pF1KB9 MDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTY CCDS13 RELPPLGAEFDGLGLPTPERSPLDGLEPGEAAFFPPPAAPEDCALRPFRAPYAPTELSRD 200 210 220 230 240 250 317 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:58:40 2016 done: Fri Nov 4 17:58:41 2016 Total Scan time: 2.940 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]