FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7628, 373 aa 1>>>pF1KB7628 373 - 373 aa - 373 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.8649+/-0.000733; mu= 7.5956+/- 0.045 mean_var=180.0620+/-37.154, 0's: 0 Z-trim(116.1): 12 B-trim: 0 in 0/52 Lambda= 0.095579 statistics sampled from 16664 (16675) to 16664 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.811), E-opt: 0.2 (0.512), width: 16 Scan time: 2.830 The best scores are: opt bits E(32554) CCDS6442.1 DMRT1 gene_id:1761|Hs108|chr9 ( 373) 2563 364.8 6.8e-101 CCDS44141.1 DMRTA2 gene_id:63950|Hs108|chr1 ( 542) 436 71.7 1.8e-12 CCDS6514.1 DMRTA1 gene_id:63951|Hs108|chr9 ( 504) 435 71.5 1.8e-12 CCDS6445.1 DMRT2 gene_id:10655|Hs108|chr9 ( 226) 416 68.6 6.1e-12 CCDS6444.1 DMRT2 gene_id:10655|Hs108|chr9 ( 561) 416 68.9 1.2e-11 >>CCDS6442.1 DMRT1 gene_id:1761|Hs108|chr9 (373 aa) initn: 2563 init1: 2563 opt: 2563 Z-score: 1924.9 bits: 364.8 E(32554): 6.8e-101 Smith-Waterman score: 2563; 100.0% identity (100.0% similar) in 373 aa overlap (1-373:1-373) 10 20 30 40 50 60 pF1KB7 MPNDEAFSKPSTPSEAPHAPGVPPQGRAGGFGKASGALVGAASGSSAGGSSRGGGSGSGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MPNDEAFSKPSTPSEAPHAPGVPPQGRAGGFGKASGALVGAASGSSAGGSSRGGGSGSGA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SDLGAGSKKSPRLPKCARCRNHGYASPLKGHKRFCMWRDCQCKKCNLIAERQRVMAAQVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 SDLGAGSKKSPRLPKCARCRNHGYASPLKGHKRFCMWRDCQCKKCNLIAERQRVMAAQVA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LRRQQAQEEELGISHPIPLPSAAELLVKRENNGSNPCLMTECSGTSQPPPASVPTTAASE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 LRRQQAQEEELGISHPIPLPSAAELLVKRENNGSNPCLMTECSGTSQPPPASVPTTAASE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 GRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSFYQPSLFPYYNNLYNCPQYSMALAADSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSFYQPSLFPYYNNLYNCPQYSMALAADSA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 SGEVGNPLGGSPVKNSLRGLPGPYVPGQTGNQWQMKNMENRHAMSSQYRMHSYYPPPSYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 SGEVGNPLGGSPVKNSLRGLPGPYVPGQTGNQWQMKNMENRHAMSSQYRMHSYYPPPSYL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GQSVPQFFTFEDAPSYPEARASVFSPPSSQDSGLVSLSSSSPISNKSTKAVLECEPASEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GQSVPQFFTFEDAPSYPEARASVFSPPSSQDSGLVSLSSSSPISNKSTKAVLECEPASEP 310 320 330 340 350 360 370 pF1KB7 SSFTVTPVIEEDE ::::::::::::: CCDS64 SSFTVTPVIEEDE 370 >>CCDS44141.1 DMRTA2 gene_id:63950|Hs108|chr1 (542 aa) initn: 436 init1: 393 opt: 436 Z-score: 337.6 bits: 71.7 E(32554): 1.8e-12 Smith-Waterman score: 439; 34.5% identity (58.2% similar) in 275 aa overlap (14-272:5-275) 10 20 30 40 50 pF1KB7 MPNDEAFSKPSTPSEAPHAPGVPPQGRAGGFGKASGALVGAASGSSAGGS---SRGGGSG :: : .::. . : . : ......:....:..: : .:: CCDS44 MELRSELPSVPGAATAAAATATGPPVASVASVAAAAAAAASLPVSVAGGLL 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 SGASDLGAGSKKSPRLPKCARCRNHGYASPLKGHKRFCMWRDCQCKKCNLIAERQRVMAA : : ...: :: :::::::::: .: ::::::.: :.:: : ::.::::::::::: CCDS44 RGPPLLLRAAEKYPRTPKCARCRNHGVVSALKGHKRYCRWKDCLCAKCTLIAERQRVMAA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB7 QVALRRQQAQEEELGISHPIPLPSAAELLVKRENNGSNP------CLMTECSGTSQPPPA ::::::::::::. . . : ..:: :. :: : . . :.. . : : CCDS44 QVALRRQQAQEENEARELQL-LYGTAEGLALAAANGIIPPRPAYEVFGSVCAADGGGPGA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB7 SVPTTAASEGRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSFYQPSLFPYYNNLYNC-PQ ..: :. : . . .. . . : . .. .: : . : . . : CCDS44 GAP---AGTGGGAAGAGGSEAKLQKFDLFPKTLLQAGRPGSPLPPPVKPLSPDGADSGPG 180 190 200 210 220 240 250 260 270 280 pF1KB7 YSMALAADSASGEVGN--PLGGSPV----KNSLRGLPGPYVPGQTGNQWQMKNMENRHAM : . ....: :. ..:::. :.. . :: :: :.. CCDS44 TSSPEVRPGSGSENGDGESFSGSPLARASKEAGGSCPGSAGPGGGGEEDSPGSASPLGSE 230 240 250 260 270 280 290 300 310 320 330 340 pF1KB7 SSQYRMHSYYPPPSYLGQSVPQFFTFEDAPSYPEARASVFSPPSSQDSGLVSLSSSSPIS CCDS44 SGSEADKEEGEAAPAPGLGGGSGPRQRTPLDILTRVFPGHRRGVLELVLQGCGGDVVQAI 290 300 310 320 330 340 >>CCDS6514.1 DMRTA1 gene_id:63951|Hs108|chr9 (504 aa) initn: 442 init1: 407 opt: 435 Z-score: 337.3 bits: 71.5 E(32554): 1.8e-12 Smith-Waterman score: 435; 57.9% identity (72.2% similar) in 126 aa overlap (10-129:28-150) 10 20 30 pF1KB7 MPNDEAFSKPSTPSEAPHAPG---VPPQG-RAGG-FGKASGA : :: : .:. ::: : . : .:..: CCDS65 MERSQCGSRDRGVSGRPHLAPGLVVAAPPPPSPALPVPSGMQVPPAFLRPPSLFLRAAAA 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB7 LVGAASGSS-AGGSSRGGGSGSGASDLGAGSKKSPRLPKCARCRNHGYASPLKGHKRFCM ..::...: .:: . : ::.. .: : :: :::::::::: .: :::::::: CCDS65 AAAAAAATSGSGGCPPAPGLESGVGAVGCG---YPRTPKCARCRNHGVVSALKGHKRFCR 70 80 90 100 110 100 110 120 130 140 150 pF1KB7 WRDCQCKKCNLIAERQRVMAAQVALRRQQAQEEELGISHPIPLPSAAELLVKRENNGSNP :::: : ::.::::::::::::::::::::::: CCDS65 WRDCACAKCTLIAERQRVMAAQVALRRQQAQEESEARGLQRLLCSGLSWPPGGRASGGGG 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB7 CLMTECSGTSQPPPASVPTTAASEGRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSFYQP CCDS65 RAENPQSTGGPAAGAALGLGALRQASGSATPAFEVFQQDYPEEKQEQKESKCESCQNGQE 180 190 200 210 220 230 >>CCDS6445.1 DMRT2 gene_id:10655|Hs108|chr9 (226 aa) initn: 433 init1: 371 opt: 416 Z-score: 327.8 bits: 68.6 E(32554): 6.1e-12 Smith-Waterman score: 416; 50.4% identity (65.6% similar) in 131 aa overlap (4-134:56-181) 10 20 30 pF1KB7 MPNDEAFSKPSTPSEAPHAPGVPPQGRAGGFGK :: . . :: .::.: : . : . CCDS64 VCGAPRSTPPGPSPPPADGDCEDDEDDDGVDEDAEEEGDGEEAGASPGMPGQPEQRGGPQ 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB7 ASGALVGAASGSSAGGSSRGGGSGSGASDLGAGSKKSPRLPKCARCRNHGYASPLKGHKR :. :: ...: : .:.:: .: : :::::::::: .: :::::: CCDS64 PRPPLAPQASPAGTGPRERCTPAGGGAEP-----RKLSRTPKCARCRNHGVVSCLKGHKR 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB7 FCMWRDCQCKKCNLIAERQRVMAAQVALRRQQAQEEELGISHPIPLPSAAELLVKRENNG :: :::::: .: :..::::::::::::::::: :.. :.: CCDS64 FCRWRDCQCANCLLVVERQRVMAAQVALRRQQATEDKKGLSGKQNNFERKAVYQRQVRAP 150 160 170 180 190 200 160 170 180 190 200 210 pF1KB7 SNPCLMTECSGTSQPPPASVPTTAASEGRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSF CCDS64 SLLAKSILEVLLGLFYSYYVYIMNHL 210 220 >>CCDS6444.1 DMRT2 gene_id:10655|Hs108|chr9 (561 aa) initn: 402 init1: 371 opt: 416 Z-score: 322.5 bits: 68.9 E(32554): 1.2e-11 Smith-Waterman score: 416; 50.4% identity (65.6% similar) in 131 aa overlap (4-134:56-181) 10 20 30 pF1KB7 MPNDEAFSKPSTPSEAPHAPGVPPQGRAGGFGK :: . . :: .::.: : . : . CCDS64 VCGAPRSTPPGPSPPPADGDCEDDEDDDGVDEDAEEEGDGEEAGASPGMPGQPEQRGGPQ 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB7 ASGALVGAASGSSAGGSSRGGGSGSGASDLGAGSKKSPRLPKCARCRNHGYASPLKGHKR :. :: ...: : .:.:: .: : :::::::::: .: :::::: CCDS64 PRPPLAPQASPAGTGPRERCTPAGGGAEP-----RKLSRTPKCARCRNHGVVSCLKGHKR 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB7 FCMWRDCQCKKCNLIAERQRVMAAQVALRRQQAQEEELGISHPIPLPSAAELLVKRENNG :: :::::: .: :..::::::::::::::::: :.. :.: CCDS64 FCRWRDCQCANCLLVVERQRVMAAQVALRRQQATEDKKGLSGKQNNFERKAVYQRQVRAP 150 160 170 180 190 200 160 170 180 190 200 210 pF1KB7 SNPCLMTECSGTSQPPPASVPTTAASEGRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSF CCDS64 SLLAKSILEGYRPIPAETYVGGTFPLPPPVSDRMRKRRAFADKELENIMLEREYKEREML 210 220 230 240 250 260 373 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:03:43 2016 done: Fri Nov 4 09:03:43 2016 Total Scan time: 2.830 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]