Result of FASTA (ccds) for pF1KB5555
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB5555, 452 aa
  1>>>pF1KB5555 452 - 452 aa - 452 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.0999+/-0.000743; mu= 15.0489+/- 0.045
 mean_var=97.7054+/-19.685, 0's: 0 Z-trim(111.0): 73  B-trim: 0 in 0/52
 Lambda= 0.129752
 statistics sampled from 11993 (12066) to 11993 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.722), E-opt: 0.2 (0.371), width:  16
 Scan time:  2.750

The best scores are:                                      opt bits E(32554)
CCDS44768.1 FLI1 gene_id:2313|Hs108|chr11          ( 452) 3132 596.4 1.9e-170
CCDS53725.1 FLI1 gene_id:2313|Hs108|chr11          ( 419) 2923 557.2 1.1e-158
CCDS59231.1 FLI1 gene_id:2313|Hs108|chr11          ( 386) 2610 498.6 4.5e-141
CCDS82674.1 ERG gene_id:2078|Hs108|chr21           ( 455) 2057 395.2 7.4e-110
CCDS13657.1 ERG gene_id:2078|Hs108|chr21           ( 462) 2047 393.3 2.7e-109
CCDS59230.1 FLI1 gene_id:2313|Hs108|chr11          ( 259) 1776 342.4 3.2e-94
CCDS58789.1 ERG gene_id:2078|Hs108|chr21           ( 363) 1731 334.1 1.5e-91
CCDS46649.1 ERG gene_id:2078|Hs108|chr21           ( 387) 1274 248.5 8.6e-66
CCDS13658.1 ERG gene_id:2078|Hs108|chr21           ( 479) 1274 248.6   1e-65
CCDS46648.1 ERG gene_id:2078|Hs108|chr21           ( 486) 1274 248.6   1e-65
CCDS2428.1 FEV gene_id:54738|Hs108|chr2            ( 238)  623 126.5 2.8e-29
CCDS53724.1 ETS1 gene_id:2113|Hs108|chr11          ( 225)  465 96.9 2.2e-20
CCDS8475.1 ETS1 gene_id:2113|Hs108|chr11           ( 441)  465 97.1 3.7e-20
CCDS44767.1 ETS1 gene_id:2113|Hs108|chr11          ( 485)  465 97.2   4e-20
CCDS13659.1 ETS2 gene_id:2114|Hs108|chr21          ( 469)  461 96.4 6.5e-20
CCDS81648.1 ETS1 gene_id:2113|Hs108|chr11          ( 354)  425 89.6 5.6e-18
CCDS12600.1 ERF gene_id:2077|Hs108|chr19           ( 548)  410 86.9 5.5e-17
CCDS44250.1 ETV3 gene_id:2117|Hs108|chr1           ( 512)  401 85.2 1.7e-16
CCDS13575.1 GABPA gene_id:2551|Hs108|chr21         ( 454)  400 85.0 1.7e-16
CCDS74341.1 ETV2 gene_id:2116|Hs108|chr19          ( 249)  395 83.9 2.1e-16
CCDS32995.2 ETV2 gene_id:2116|Hs108|chr19          ( 342)  395 84.0 2.7e-16
CCDS1164.1 ETV3 gene_id:2117|Hs108|chr1            ( 143)  389 82.6 2.9e-16
CCDS33906.1 ETV5 gene_id:2119|Hs108|chr3           ( 510)  393 83.7 4.7e-16
CCDS77281.1 ETV2 gene_id:2116|Hs108|chr19          ( 155)  383 81.5 6.8e-16
CCDS30893.1 ETV3L gene_id:440695|Hs108|chr1        ( 361)  384 81.9 1.2e-15
CCDS55084.1 ETV1 gene_id:2115|Hs108|chr7           ( 374)  384 81.9 1.2e-15
CCDS55083.1 ETV1 gene_id:2115|Hs108|chr7           ( 419)  384 82.0 1.3e-15
CCDS55085.1 ETV1 gene_id:2115|Hs108|chr7           ( 437)  384 82.0 1.3e-15
CCDS55087.1 ETV1 gene_id:2115|Hs108|chr7           ( 454)  384 82.0 1.4e-15
CCDS55086.1 ETV1 gene_id:2115|Hs108|chr7           ( 459)  384 82.0 1.4e-15
CCDS55088.1 ETV1 gene_id:2115|Hs108|chr7           ( 477)  384 82.0 1.4e-15
CCDS59292.1 ETV4 gene_id:2118|Hs108|chr17          ( 207)  365 78.2 8.8e-15
CCDS58553.1 ETV4 gene_id:2118|Hs108|chr17          ( 445)  365 78.4 1.6e-14
CCDS11465.1 ETV4 gene_id:2118|Hs108|chr17          ( 484)  365 78.5 1.7e-14
CCDS9060.1 ELK3 gene_id:2004|Hs108|chr12           ( 407)  360 77.5 2.9e-14
CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX           (  95)  351 75.3 2.9e-14
CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX           ( 428)  359 77.3 3.4e-14
CCDS1457.1 ELK4 gene_id:2005|Hs108|chr1            ( 405)  344 74.5 2.3e-13
CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1            ( 431)  344 74.5 2.4e-13


>>CCDS44768.1 FLI1 gene_id:2313|Hs108|chr11               (452 aa)
 initn: 3132 init1: 3132 opt: 3132  Z-score: 3173.3  bits: 596.4 E(32554): 1.9e-170
Smith-Waterman score: 3132; 100.0% identity (100.0% similar) in 452 aa overlap (1-452:1-452)

               10        20        30        40        50        60
pF1KB5 MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQ
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 PVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTN
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 ERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATT
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 LYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNK
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB5 SPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCI
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB5 TWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKF
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB5 DFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAAS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 DFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAAS
              370       380       390       400       410       420

              430       440       450  
pF1KB5 QYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       ::::::::::::::::::::::::::::::::
CCDS44 QYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
              430       440       450  

>>CCDS53725.1 FLI1 gene_id:2313|Hs108|chr11               (419 aa)
 initn: 2923 init1: 2923 opt: 2923  Z-score: 2962.4  bits: 557.2 E(32554): 1.1e-158
Smith-Waterman score: 2923; 100.0% identity (100.0% similar) in 419 aa overlap (34-452:1-419)

            10        20        30        40        50        60   
pF1KB5 TIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQPVR
                                     ::::::::::::::::::::::::::::::
CCDS53                               MTASGSPDYGQPHKINPLPPQQEWINQPVR
                                             10        20        30

            70        80        90       100       110       120   
pF1KB5 VNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTNERR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 VNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTNERR
               40        50        60        70        80        90

           130       140       150       160       170       180   
pF1KB5 VIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATTLYN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 VIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATTLYN
              100       110       120       130       140       150

           190       200       210       220       230       240   
pF1KB5 TEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNKSPP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 TEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNKSPP
              160       170       180       190       200       210

           250       260       270       280       290       300   
pF1KB5 LGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCITWE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 LGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCITWE
              220       230       240       250       260       270

           310       320       330       340       350       360   
pF1KB5 GTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 GTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFH
              280       290       300       310       320       330

           370       380       390       400       410       420   
pF1KB5 GIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYW
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 GIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYW
              340       350       360       370       380       390

           430       440       450  
pF1KB5 TSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       :::::::::::::::::::::::::::::
CCDS53 TSPTGGIYPNPNVPRHPNTHVPSHLGSYY
              400       410         

>>CCDS59231.1 FLI1 gene_id:2313|Hs108|chr11               (386 aa)
 initn: 2610 init1: 2610 opt: 2610  Z-score: 2646.2  bits: 498.6 E(32554): 4.5e-141
Smith-Waterman score: 2610; 99.7% identity (100.0% similar) in 377 aa overlap (76-452:10-386)

          50        60        70        80        90       100     
pF1KB5 HKINPLPPQQEWINQPVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSY
                                     .:::::::::::::::::::::::::::::
CCDS59                      MEGGLAGERARESPVDCSVSKCSKLVGGGESNPMNYNSY
                                    10        20        30         

         110       120       130       140       150       160     
pF1KB5 MDEKNGPPPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MDEKNGPPPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGK
      40        50        60        70        80        90         

         170       180       190       200       210       220     
pF1KB5 ELCKMNKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ELCKMNKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSV
     100       110       120       130       140       150         

         230       240       250       260       270       280     
pF1KB5 RRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 RRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQ
     160       170       180       190       200       210         

         290       300       310       320       330       340     
pF1KB5 FLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 FLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDK
     220       230       240       250       260       270         

         350       360       370       380       390       400     
pF1KB5 NIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 NIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHP
     280       290       300       310       320       330         

         410       420       430       440       450  
pF1KB5 SSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       :::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
     340       350       360       370       380      

>>CCDS82674.1 ERG gene_id:2078|Hs108|chr21                (455 aa)
 initn: 1951 init1: 1172 opt: 2057  Z-score: 2085.8  bits: 395.2 E(32554): 7.4e-110
Smith-Waterman score: 2057; 67.0% identity (84.3% similar) in 460 aa overlap (1-452:1-455)

               10        20        30        40        50        60
pF1KB5 MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQ
       : .::::::::::.:::::. :::.  :: :..::::.: ::::  :..:  :::.:..:
CCDS82 MASTIKEALSVVSEDQSLFECAYGTP-HLAKTEMTASSSSDYGQTSKMSPRVPQQDWLSQ
               10        20         30        40        50         

                70          80        90       100       110       
pF1KB5 P-VRVNVKREYD--HMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNM
       : .::..: : .  ..::::.:: .:::.: .:.::. ..  :::.:::.::. ::: ::
CCDS82 PPARVTIKMECNPSQVNGSRNSPDECSVAKGGKMVGSPDTVGMNYGSYMEEKHMPPP-NM
      60        70        80        90       100       110         

       120       130       140       150       160       170       
pF1KB5 TTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLR
       ::::::::::::::::. .:::::::::.:::.: ...  .:::.::::::::.:.:: :
CCDS82 TTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKMTKDDFQR
      120       130       140       150       160       170        

       180       190         200       210          220       230  
pF1KB5 ATTLYNTEVLLSHLSYLRESSL--LAYNTTSHTDQSS-RL--SVKEDPSYDSVRRGAWGN
        :  ::...::::: ::::. :  :. . .... :.: ::  . . :  :.  ::.:: .
CCDS82 LTPSYNADILLSHLHYLRETPLPHLTSDDVDKALQNSPRLMHARNTDLPYEPPRRSAWTG
      180       190       200       210       220       230        

            240       250       260       270       280       290  
pF1KB5 NMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLS
       . .   ...    . .:. :. .:::: ::::::::::::::::::::::::::::::::
CCDS82 HGHPTPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLS
      240       250       260       270       280       290        

            300       310       320       330       340       350  
pF1KB5 DSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVH
       ::.:.:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS82 DSSNSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVH
      300       310       320       330       340       350        

            360       370       380       390       400       410  
pF1KB5 GKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTS
       ::::::::::::::::::::: :::.::::::. :: ::::: ::.::: ::: ..::::
CCDS82 GKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSYHAHPQKMNFVAPHPPALPVTS
      360       370       380       390       400       410        

            420       430       440       450  
pF1KB5 SSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       ::::.: . ::.:::::::::    : :..:.:::::.::
CCDS82 SSFFAAPNPYWNSPTGGIYPNT---RLPTSHMPSHLGTYY
      420       430       440          450     

>>CCDS13657.1 ERG gene_id:2078|Hs108|chr21                (462 aa)
 initn: 1951 init1: 1172 opt: 2047  Z-score: 2075.5  bits: 393.3 E(32554): 2.7e-109
Smith-Waterman score: 2047; 67.0% identity (83.8% similar) in 457 aa overlap (5-452:12-462)

                      10        20        30        40        50   
pF1KB5        MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPP
                  :::::::::.:::::. :::.  :: :..::::.: ::::  :..:  :
CCDS13 MIQTVPDPAAHIKEALSVVSEDQSLFECAYGTP-HLAKTEMTASSSSDYGQTSKMSPRVP
               10        20        30         40        50         

            60         70          80        90       100       110
pF1KB5 QQEWINQP-VRVNVKREYD--HMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKN
       ::.:..:: .::..: : .  ..::::.:: .:::.: .:.::. ..  :::.:::.::.
CCDS13 QQDWLSQPPARVTIKMECNPSQVNGSRNSPDECSVAKGGKMVGSPDTVGMNYGSYMEEKH
      60        70        80        90       100       110         

              120       130       140       150       160       170
pF1KB5 GPPPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKM
        ::: ::::::::::::::::::. .:::::::::.:::.: ...  .:::.::::::::
CCDS13 MPPP-NMTTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKM
     120        130       140       150       160       170        

              180       190       200       210             220    
pF1KB5 NKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQ----SSRL--SVKEDPSYDS
       .:.:: : :  ::...::::: ::::. :  . :.. .:.    : ::  . . :  :. 
CCDS13 TKDDFQRLTPSYNADILLSHLHYLRETPL-PHLTSDDVDKALQNSPRLMHARNTDLPYEP
      180       190       200        210       220       230       

          230       240       250       260       270       280    
pF1KB5 VRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLW
        ::.:: .. .   ...    . .:. :. .:::: ::::::::::::::::::::::::
CCDS13 PRRSAWTGHGHPTPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLW
       240       250       260       270       280       290       

          290       300       310       320       330       340    
pF1KB5 QFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYD
       ::::::::::.:.:::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 QFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYD
       300       310       320       330       340       350       

          350       360       370       380       390       400    
pF1KB5 KNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPH
       ::::::::::::::::::::::::::::: :::.::::::. :: ::::: ::.::: ::
CCDS13 KNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSYHAHPQKMNFVAPH
       360       370       380       390       400       410       

          410       420       430       440       450  
pF1KB5 PSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       : ..::::::::.: . ::.:::::::::    : :..:.:::::.::
CCDS13 PPALPVTSSSFFAAPNPYWNSPTGGIYPNT---RLPTSHMPSHLGTYY
       420       430       440          450       460  

>>CCDS59230.1 FLI1 gene_id:2313|Hs108|chr11               (259 aa)
 initn: 1776 init1: 1776 opt: 1776  Z-score: 1805.0  bits: 342.4 E(32554): 3.2e-94
Smith-Waterman score: 1776; 100.0% identity (100.0% similar) in 255 aa overlap (198-452:5-259)

       170       180       190       200       210       220       
pF1KB5 CKMNKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSVRR
                                     ::::::::::::::::::::::::::::::
CCDS59                           MDPGSLLAYNTTSHTDQSSRLSVKEDPSYDSVRR
                                         10        20        30    

       230       240       250       260       270       280       
pF1KB5 GAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 GAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFL
           40        50        60        70        80        90    

       290       300       310       320       330       340       
pF1KB5 LELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 LELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNI
          100       110       120       130       140       150    

       350       360       370       380       390       400       
pF1KB5 MTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSS
          160       170       180       190       200       210    

       410       420       430       440       450  
pF1KB5 MPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       :::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
          220       230       240       250         

>>CCDS58789.1 ERG gene_id:2078|Hs108|chr21                (363 aa)
 initn: 1739 init1: 1172 opt: 1731  Z-score: 1757.3  bits: 334.1 E(32554): 1.5e-91
Smith-Waterman score: 1731; 69.0% identity (84.2% similar) in 368 aa overlap (91-452:1-363)

               70        80        90       100       110       120
pF1KB5 PVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTN
                                     .::. ..  :::.:::.::. ::: :::::
CCDS58                               MVGSPDTVGMNYGSYMEEKHMPPP-NMTTN
                                             10        20          

              130       140       150       160       170       180
pF1KB5 ERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATT
       :::::::::::::. .:::::::::.:::.: ...  .:::.::::::::.:.:: : : 
CCDS58 ERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKMTKDDFQRLTP
      30        40        50        60        70        80         

              190       200       210             220       230    
pF1KB5 LYNTEVLLSHLSYLRESSLLAYNTTSHTDQ----SSRL--SVKEDPSYDSVRRGAWGNNM
        ::...::::: ::::. :  . :.. .:.    : ::  . . :  :.  ::.:: .. 
CCDS58 SYNADILLSHLHYLRETPL-PHLTSDDVDKALQNSPRLMHARNTDLPYEPPRRSAWTGHG
      90       100        110       120       130       140        

          240       250       260       270       280       290    
pF1KB5 NSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDS
       .   ...    . .:. :. .:::: ::::::::::::::::::::::::::::::::::
CCDS58 HPTPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDS
      150       160       170       180       190       200        

          300       310       320       330       340       350    
pF1KB5 ANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGK
       .:.:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 SNSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGK
      210       220       230       240       250       260        

          360       370       380       390       400       410    
pF1KB5 RYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSS
       ::::::::::::::::::: :::.::::::. :: ::::: ::.::: ::: ..::::::
CCDS58 RYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSYHAHPQKMNFVAPHPPALPVTSSS
      270       280       290       300       310       320        

          420       430       440       450  
pF1KB5 FFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY
       ::.: . ::.:::::::::    : :..:.:::::.::
CCDS58 FFAAPNPYWNSPTGGIYPNT---RLPTSHMPSHLGTYY
      330       340          350       360   

>>CCDS46649.1 ERG gene_id:2078|Hs108|chr21                (387 aa)
 initn: 1682 init1: 1210 opt: 1274  Z-score: 1294.6  bits: 248.5 E(32554): 8.6e-66
Smith-Waterman score: 1707; 64.7% identity (79.8% similar) in 391 aa overlap (91-452:1-387)

               70        80        90       100       110       120
pF1KB5 PVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTN
                                     .::. ..  :::.:::.::. ::: :::::
CCDS46                               MVGSPDTVGMNYGSYMEEKHMPPP-NMTTN
                                             10        20          

              130       140       150       160       170       180
pF1KB5 ERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATT
       :::::::::::::. .:::::::::.:::.: ...  .:::.::::::::.:.:: : : 
CCDS46 ERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKMTKDDFQRLTP
      30        40        50        60        70        80         

              190                                 200          210 
pF1KB5 LYNTEVLLSHLSYLRESSL--------------------------LAY---NTTSHTDQS
        ::...::::: ::::. :                           :.   ::. . . .
CCDS46 SYNADILLSHLHYLRETPLPHLTSDDVDKALQNSPRLMHARNTGGAAFIFPNTSVYPEAT
      90       100       110       120       130       140         

             220       230       240       250       260       270 
pF1KB5 SRLSVKEDPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSS
       .:.... :  :.  ::.:: .. .   ...    . .:. :. .:::: :::::::::::
CCDS46 QRITTRPDLPYEPPRRSAWTGHGHPTPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSS
     150       160       170       180       190       200         

             280       290       300       310       320       330 
pF1KB5 RLANPGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMN
       :::::::::::::::::::::::.:.::::::::::::::::::::::::::::::::::
CCDS46 RLANPGSGQIQLWQFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMN
     210       220       230       240       250       260         

             340       350       360       370       380       390 
pF1KB5 YDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSY
       :::::::::::::::::::::::::::::::::::::::::: :::.::::::. :: ::
CCDS46 YDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSY
     270       280       290       300       310       320         

             400       410       420       430       440       450 
pF1KB5 HAHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSY
       ::: ::.::: ::: ..::::::::.: . ::.:::::::::    : :..:.:::::.:
CCDS46 HAHPQKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNT---RLPTSHMPSHLGTY
     330       340       350       360       370          380      

        
pF1KB5 Y
       :
CCDS46 Y
        

>>CCDS13658.1 ERG gene_id:2078|Hs108|chr21                (479 aa)
 initn: 1819 init1: 1210 opt: 1274  Z-score: 1293.3  bits: 248.6 E(32554): 1e-65
Smith-Waterman score: 2033; 63.4% identity (80.2% similar) in 484 aa overlap (1-452:1-479)

               10        20        30        40        50        60
pF1KB5 MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQ
       : .::::::::::.:::::. :::.  :: :..::::.: ::::  :..:  :::.:..:
CCDS13 MASTIKEALSVVSEDQSLFECAYGTP-HLAKTEMTASSSSDYGQTSKMSPRVPQQDWLSQ
               10        20         30        40        50         

                70          80        90       100       110       
pF1KB5 P-VRVNVKREYD--HMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNM
       : .::..: : .  ..::::.:: .:::.: .:.::. ..  :::.:::.::. ::: ::
CCDS13 PPARVTIKMECNPSQVNGSRNSPDECSVAKGGKMVGSPDTVGMNYGSYMEEKHMPPP-NM
      60        70        80        90       100       110         

       120       130       140       150       160       170       
pF1KB5 TTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLR
       ::::::::::::::::. .:::::::::.:::.: ...  .:::.::::::::.:.:: :
CCDS13 TTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKMTKDDFQR
      120       130       140       150       160       170        

       180       190                                 200           
pF1KB5 ATTLYNTEVLLSHLSYLRESSL--------------------------LAY---NTTSHT
        :  ::...::::: ::::. :                           :.   ::. . 
CCDS13 LTPSYNADILLSHLHYLRETPLPHLTSDDVDKALQNSPRLMHARNTGGAAFIFPNTSVYP
      180       190       200       210       220       230        

      210       220       230       240       250       260        
pF1KB5 DQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGP
       . ..:.... :  :.  ::.:: .. .   ...    . .:. :. .:::: ::::::::
CCDS13 EATQRITTRPDLPYEPPRRSAWTGHGHPTPQSKAAQPSPSTVPKTEDQRPQLDPYQILGP
      240       250       260       270       280       290        

      270       280       290       300       310       320        
pF1KB5 TSSRLANPGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKP
       ::::::::::::::::::::::::::.:.:::::::::::::::::::::::::::::::
CCDS13 TSSRLANPGSGQIQLWQFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRWGERKSKP
      300       310       320       330       340       350        

      330       340       350       360       370       380        
pF1KB5 NMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYM
       ::::::::::::::::::::::::::::::::::::::::::::: :::.::::::. ::
CCDS13 NMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYM
      360       370       380       390       400       410        

      390       400       410       420       430       440        
pF1KB5 PSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHL
        ::::: ::.::: ::: ..::::::::.: . ::.:::::::::    : :..:.::::
CCDS13 GSYHAHPQKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNT---RLPTSHMPSHL
      420       430       440       450       460          470     

      450  
pF1KB5 GSYY
       :.::
CCDS13 GTYY
           

>>CCDS46648.1 ERG gene_id:2078|Hs108|chr21                (486 aa)
 initn: 1819 init1: 1210 opt: 1274  Z-score: 1293.2  bits: 248.6 E(32554): 1e-65
Smith-Waterman score: 2023; 63.5% identity (80.2% similar) in 480 aa overlap (5-452:12-486)

                      10        20        30        40        50   
pF1KB5        MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPP
                  :::::::::.:::::. :::.  :: :..::::.: ::::  :..:  :
CCDS46 MIQTVPDPAAHIKEALSVVSEDQSLFECAYGTP-HLAKTEMTASSSSDYGQTSKMSPRVP
               10        20        30         40        50         

            60         70          80        90       100       110
pF1KB5 QQEWINQP-VRVNVKREYD--HMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKN
       ::.:..:: .::..: : .  ..::::.:: .:::.: .:.::. ..  :::.:::.::.
CCDS46 QQDWLSQPPARVTIKMECNPSQVNGSRNSPDECSVAKGGKMVGSPDTVGMNYGSYMEEKH
      60        70        80        90       100       110         

              120       130       140       150       160       170
pF1KB5 GPPPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKM
        ::: ::::::::::::::::::. .:::::::::.:::.: ...  .:::.::::::::
CCDS46 MPPP-NMTTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKM
     120        130       140       150       160       170        

              180       190                                 200    
pF1KB5 NKEDFLRATTLYNTEVLLSHLSYLRESSL--------------------------LAY--
       .:.:: : :  ::...::::: ::::. :                           :.  
CCDS46 TKDDFQRLTPSYNADILLSHLHYLRETPLPHLTSDDVDKALQNSPRLMHARNTGGAAFIF
      180       190       200       210       220       230        

             210       220       230       240       250       260 
pF1KB5 -NTTSHTDQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPD
        ::. . . ..:.... :  :.  ::.:: .. .   ...    . .:. :. .:::: :
CCDS46 PNTSVYPEATQRITTRPDLPYEPPRRSAWTGHGHPTPQSKAAQPSPSTVPKTEDQRPQLD
      240       250       260       270       280       290        

             270       280       290       300       310       320 
pF1KB5 PYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRW
       :::::::::::::::::::::::::::::::::.:.::::::::::::::::::::::::
CCDS46 PYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRW
      300       310       320       330       340       350        

             330       340       350       360       370       380 
pF1KB5 GERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKY
       :::::::::::::::::::::::::::::::::::::::::::::::::::: :::.:::
CCDS46 GERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKY
      360       370       380       390       400       410        

             390       400       410       420       430       440 
pF1KB5 PSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPN
       :::. :: ::::: ::.::: ::: ..::::::::.: . ::.:::::::::    : :.
CCDS46 PSDLPYMGSYHAHPQKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNT---RLPT
      420       430       440       450       460       470        

             450  
pF1KB5 THVPSHLGSYY
       .:.:::::.::
CCDS46 SHMPSHLGTYY
         480      




452 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Thu Nov  3 17:34:10 2016 done: Thu Nov  3 17:34:10 2016
 Total Scan time:  2.750 Total Display time:  0.060

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com