FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8993, 401 aa 1>>>pF1KB8993 401 - 401 aa - 401 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3186+/-0.000829; mu= 12.8123+/- 0.051 mean_var=85.9818+/-17.223, 0's: 0 Z-trim(108.8): 23 B-trim: 0 in 0/51 Lambda= 0.138316 statistics sampled from 10416 (10438) to 10416 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.679), E-opt: 0.2 (0.321), width: 16 Scan time: 2.490 The best scores are: opt bits E(32554) CCDS35475.1 HSFY1 gene_id:86614|Hs108|chrY ( 401) 2641 536.7 1.5e-152 CCDS14791.1 HSFY2 gene_id:159119|Hs108|chrY ( 401) 2641 536.7 1.5e-152 CCDS14790.1 HSFY1 gene_id:86614|Hs108|chrY ( 203) 1125 234.0 9.5e-62 CCDS35476.1 HSFY2 gene_id:159119|Hs108|chrY ( 203) 1125 234.0 9.5e-62 CCDS44011.1 HSFX1 gene_id:100506164|Hs108|chrX ( 423) 512 111.8 1.2e-24 CCDS48179.1 HSFX2 gene_id:100130086|Hs108|chrX ( 423) 512 111.8 1.2e-24 CCDS83499.1 LOC101928917 gene_id:101928917|Hs108|c ( 333) 331 75.7 7.2e-14 >>CCDS35475.1 HSFY1 gene_id:86614|Hs108|chrY (401 aa) initn: 2641 init1: 2641 opt: 2641 Z-score: 2852.4 bits: 536.7 E(32554): 1.5e-152 Smith-Waterman score: 2641; 100.0% identity (100.0% similar) in 401 aa overlap (1-401:1-401) 10 20 30 40 50 60 pF1KB8 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 ASKNLNMPLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 ASKNLNMPLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVEPSAVPTRYPLVSVNEAPYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVEPSAVPTRYPLVSVNEAPYR 310 320 330 340 350 360 370 380 390 400 pF1KB8 NMLPAGNPWLQMPTIADRSAAPHSRLALQPSPLDKYHPNYN ::::::::::::::::::::::::::::::::::::::::: CCDS35 NMLPAGNPWLQMPTIADRSAAPHSRLALQPSPLDKYHPNYN 370 380 390 400 >>CCDS14791.1 HSFY2 gene_id:159119|Hs108|chrY (401 aa) initn: 2641 init1: 2641 opt: 2641 Z-score: 2852.4 bits: 536.7 E(32554): 1.5e-152 Smith-Waterman score: 2641; 100.0% identity (100.0% similar) in 401 aa overlap (1-401:1-401) 10 20 30 40 50 60 pF1KB8 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 ASKNLNMPLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ASKNLNMPLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVEPSAVPTRYPLVSVNEAPYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVEPSAVPTRYPLVSVNEAPYR 310 320 330 340 350 360 370 380 390 400 pF1KB8 NMLPAGNPWLQMPTIADRSAAPHSRLALQPSPLDKYHPNYN ::::::::::::::::::::::::::::::::::::::::: CCDS14 NMLPAGNPWLQMPTIADRSAAPHSRLALQPSPLDKYHPNYN 370 380 390 400 >>CCDS14790.1 HSFY1 gene_id:86614|Hs108|chrY (203 aa) initn: 1143 init1: 1125 opt: 1125 Z-score: 1222.1 bits: 234.0 E(32554): 9.5e-62 Smith-Waterman score: 1125; 98.9% identity (100.0% similar) in 174 aa overlap (1-174:1-174) 10 20 30 40 50 60 pF1KB8 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF :::::::::::::::::::::::::::::::::::::::::::::::::::..: CCDS14 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKIRFTKMKLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS CCDS14 RSSTYENRYLCCNLHLKDESNYS 190 200 >>CCDS35476.1 HSFY2 gene_id:159119|Hs108|chrY (203 aa) initn: 1143 init1: 1125 opt: 1125 Z-score: 1222.1 bits: 234.0 E(32554): 9.5e-62 Smith-Waterman score: 1125; 98.9% identity (100.0% similar) in 174 aa overlap (1-174:1-174) 10 20 30 40 50 60 pF1KB8 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNF :::::::::::::::::::::::::::::::::::::::::::::::::::..: CCDS35 APYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKIRFTKMKLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEASEESLFS CCDS35 RSSTYENRYLCCNLHLKDESNYS 190 200 >>CCDS44011.1 HSFX1 gene_id:100506164|Hs108|chrX (423 aa) initn: 496 init1: 300 opt: 512 Z-score: 556.0 bits: 111.8 E(32554): 1.2e-24 Smith-Waterman score: 563; 35.9% identity (59.5% similar) in 370 aa overlap (35-397:55-395) 10 20 30 40 50 60 pF1KB8 SSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESPSYTV :. .:: . :: ::: :.. . .. : CCDS44 ERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLRLLTEEIAFQPLAEEASFRRPHPDG 30 40 50 60 70 80 70 80 90 100 110 120 pF1KB8 CVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETKAPYR : :. .:..::: ::.:::..: :.::.:: ::..:.: :::..::.::::. . .. CCDS44 DVP-PQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDSGACRVINQKLFEKEILKRDVAHK 90 100 110 120 130 140 130 140 150 160 170 180 pF1KB8 IFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNFKRGY .: : .:::: :::::::: : .: :. : : : .. :.:.::.:: .: :.: CCDS44 VFATTSIKSFFRQLNLYGFRKRRQCTFRT-FTRIF-SAKRLVSILNKLEFYCHPYFQRDS 150 160 170 180 190 200 190 200 210 220 230 240 pF1KB8 PQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALA-AEASEESLFSASK :.::::.:::.:::.: : .:: : :: : :: :.. ... : .. CCDS44 PHLLVRMKRRVGVKSA-PRHQ--EED---KPEAAG-------SCLAPADTEQQDHTSPNE 210 220 230 240 250 260 270 280 290 300 pF1KB8 NLNM-PLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIHMH : .. : :: :. .. :::: ::. . : :. .:.:. . . CCDS44 NDQVTPQHREP------AGPNTQIRSGSAPPATPVMV-PDSAVASDNSPVTQPAGEWSEG 250 260 270 280 290 300 310 320 330 340 350 pF1KB8 S--HSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVE-PSAVPTRYPL-VSVNEAP : : : . : : .. : :: : . .: .: :.: .: : .... : CCDS44 SQAHVTPVAA----VPGPAALPFLYVPGSPTQMNSYGPVVALPTA--SRSTLAMDTTGLP 310 320 330 340 350 360 370 380 390 400 pF1KB8 YRNMLPAGNPWLQMPTIADRSAAPHSRLALQPS-PLDKYHPNYN .::: . :. . .: .: : . ... : : ..: CCDS44 APGMLPFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHRTSQYMPASDGPQAY 360 370 380 390 400 410 CCDS44 PDYADQST 420 >>CCDS48179.1 HSFX2 gene_id:100130086|Hs108|chrX (423 aa) initn: 496 init1: 300 opt: 512 Z-score: 556.0 bits: 111.8 E(32554): 1.2e-24 Smith-Waterman score: 563; 35.9% identity (59.5% similar) in 370 aa overlap (35-397:55-395) 10 20 30 40 50 60 pF1KB8 SSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQGSLLESPSYTV :. .:: . :: ::: :.. . .. : CCDS48 ERVPFPPQLQSETYLHPADPSPAWDDPGSTGSPNLRLLTEEIAFQPLAEEASFRRPHPDG 30 40 50 60 70 80 70 80 90 100 110 120 pF1KB8 CVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEILETKAPYR : :. .:..::: ::.:::..: :.::.:: ::..:.: :::..::.::::. . .. CCDS48 DVP-PQGEDNLLSLPFPQKLWRLVSSNQFSSIWWDDSGACRVINQKLFEKEILKRDVAHK 90 100 110 120 130 140 130 140 150 160 170 180 pF1KB8 IFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYYNPNFKRGY .: : .:::: :::::::: : .: :. : : : .. :.:.::.:: .: :.: CCDS48 VFATTSIKSFFRQLNLYGFRKRRQCTFRT-FTRIF-SAKRLVSILNKLEFYCHPYFQRDS 150 160 170 180 190 200 190 200 210 220 230 240 pF1KB8 PQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALA-AEASEESLFSASK :.::::.:::.:::.: : .:: : :: : :: :.. ... : .. CCDS48 PHLLVRMKRRVGVKSA-PRHQ--EED---KPEAAG-------SCLAPADTEQQDHTSPNE 210 220 230 240 250 260 270 280 290 300 pF1KB8 NLNM-PLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAILNQLTTIHMH : .. : :: :. .. :::: ::. . : :. .:.:. . . CCDS48 NDQVTPQHREP------AGPNTQIRSGSAPPATPVMV-PDSAVASDNSPVTQPAGEWSEG 250 260 270 280 290 300 310 320 330 340 350 pF1KB8 S--HSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVE-PSAVPTRYPL-VSVNEAP : : : . : : .. : :: : . .: .: :.: .: : .... : CCDS48 SQAHVTPVAA----VPGPAALPFLYVPGSPTQMNSYGPVVALPTA--SRSTLAMDTTGLP 310 320 330 340 350 360 370 380 390 400 pF1KB8 YRNMLPAGNPWLQMPTIADRSAAPHSRLALQPS-PLDKYHPNYN .::: . :. . .: .: : . ... : : ..: CCDS48 APGMLPFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHHCPHSHRTSQYMPASDGPQAY 360 370 380 390 400 410 CCDS48 PDYADQST 420 >>CCDS83499.1 LOC101928917 gene_id:101928917|Hs108|chrX (333 aa) initn: 328 init1: 307 opt: 331 Z-score: 362.4 bits: 75.7 E(32554): 7.2e-14 Smith-Waterman score: 377; 25.8% identity (57.4% similar) in 357 aa overlap (1-350:1-327) 10 20 30 40 50 pF1KB8 MAHVSSETQDVSPKDELTASEASTRSPLCEHTFPGDSDLRSMIEEHAFQVLSQ--GSLLE :: ..: . . ...: .. .: : . : ....: :..:: :: . CCDS83 MASQNTEQEYEAKLAPSVGGEPTSGGPSGSSPDP-NPDSSEVLDRHEDQAMSQDPGSQDN 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 SP--SYTVCVSEPDKDDDFLSLNFPRKLWKIVESDQFKSISWDENGTCIVINEELFKKEI :: . . : . . . ... :.:::::: ::: : :::.::...: ..:...::..:. CCDS83 SPPEDRNQRVVNVEDNHNLFRLSFPRKLWTIVEEDTFKSVSWNDDGDAVIIDKDLFQREV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 LETKAPYRIFQTDAIKSFVRQLNLYGFSKIQQNFQRSAFLATFLSEEKESSVLSKLKFYY :. :. :::.::.. ::.:::::::: : .. ..: .:. .: CCDS83 LQRKGAERIFKTDSLTSFIRQLNLYGFCK---------------TRPSNSPGNKKMMIYC 120 130 140 150 160 180 190 200 210 220 230 pF1KB8 NPNFKRGYPQLLVRVKRRIGVKNASPISTLFNEDFNKKHFRAGANMENHNSALAAEA--- : ::.: :.:: ..:. ...:.. .: :. . ... .. :: CCDS83 NSNFQRDKPRLLENIQRKDALRNTAQQATRVPTPKRKNLVATRRSLRIYHINARKEAIKM 170 180 190 200 210 220 240 250 260 270 280 290 pF1KB8 SEESLFSASKNLNMPLTRESSVRQIIANSSVPIRSGFPPPSPSTSVGPSEQIATDQHAIL ... :.. . :.:.. . . . :. .: :: :. ::: . .:. .. . CCDS83 CQQGAPSVQGPSGTQSFRRSGMWSKKSATRHPLGNG-PPQEPN---GPSWE-GTSGNVTF 230 240 250 260 270 300 310 320 330 340 350 pF1KB8 NQLTTIHMHSHSTYMQARGHIVNFITTTTSQYHIISPLQNGYFGLTVEPSAVPTRYPLVS .. : .:.:.. : . ... . ... ..: . :..: . :.. : CCDS83 TS-------SATTWMEGTGILSSLVYSDNGS--VMSLYNICYYALLASLSVMSPNEPSDD 280 290 300 310 320 330 360 370 380 390 400 pF1KB8 VNEAPYRNMLPAGNPWLQMPTIADRSAAPHSRLALQPSPLDKYHPNYN CCDS83 EEE 401 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:54:22 2016 done: Fri Nov 4 16:54:22 2016 Total Scan time: 2.490 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]