Result of FASTA (ccds) for pF1KB7749
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB7749, 443 aa
  1>>>pF1KB7749 443 - 443 aa - 443 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 13.9168+/-0.00117; mu= -16.4737+/- 0.071
 mean_var=687.4279+/-141.307, 0's: 0 Z-trim(118.1): 102  B-trim: 259 in 2/53
 Lambda= 0.048917
 statistics sampled from 18885 (18988) to 18885 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.583), width:  16
 Scan time:  2.910

The best scores are:                                      opt bits E(32554)
CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7           ( 443) 3151 236.7 3.5e-62
CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17         ( 431)  959 82.0 1.3e-15
CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2           ( 432)  883 76.7 5.2e-14
CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17         ( 358)  746 66.9 3.7e-11


>>CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7                (443 aa)
 initn: 3151 init1: 3151 opt: 3151  Z-score: 1229.9  bits: 236.7 E(32554): 3.5e-62
Smith-Waterman score: 3151; 100.0% identity (100.0% similar) in 443 aa overlap (1-443:1-443)

               10        20        30        40        50        60
pF1KB7 MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB7 HPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 HPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPP
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB7 PPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCA
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB7 GDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLTERQIKIWFQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 GDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLTERQIKIWFQ
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB7 NRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPYEPQSPPPFSKP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 NRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPYEPQSPPPFSKP
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB7 PQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGLQGNGSYGTPHI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGLQGNGSYGTPHI
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB7 QGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHHHGPGPGEPHPT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHHHGPGPGEPHPT
              370       380       390       400       410       420

              430       440   
pF1KB7 YTDLTGHHPSQGRIQEAPKLTHL
       :::::::::::::::::::::::
CCDS54 YTDLTGHHPSQGRIQEAPKLTHL
              430       440   

>>CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17              (431 aa)
 initn: 831 init1: 605 opt: 959  Z-score: 394.1  bits: 82.0 E(32554): 1.3e-15
Smith-Waterman score: 1415; 50.4% identity (70.6% similar) in 476 aa overlap (1-443:1-431)

               10           20        30        40        50       
pF1KB7 MQKATYYDSSA--IYGGYP-YQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSS
       ::::::::..:  ..:::  : ..:::....  :: : .::   .:.:.: :::::: ..
CCDS11 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVPPQP-PFQAATHLEGDYQRSACSLQSLGN
               10        20        30         40        50         

        60        70        80        90       100       110       
pF1KB7 AGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPA
       :. : :..::. .:.:         :.:.      : : .:::.  ::. ::   . . .
CCDS11 AAPHAKSKELNGSCMR---------PGLA------PEPLSAPPGSPPPSAAPTSATSNSS
      60        70                       80        90       100    

       120       130       140       150       160       170       
pF1KB7 APPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGE
           ::...::. .            :  :: :..::::::::::::..: :..: ...:
CCDS11 NGGGPSKSGPPKCG------------PGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAE
          110                   120        130       140       150 

       180                              190       200       210    
pF1KB7 SCAG-----------------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNR
       .:.:                       ::::::.:.::::::::::::::::::::::::
CCDS11 GCGGGGGGGGGGGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNR
             160       170       180       190       200       210 

          220       230       240       250       260       270    
pF1KB7 YLCRPRRVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--G
       ::::::::::::::::.::::::::::::::::::::.::. .:::: ::. ::  :  .
CCDS11 YLCRPRRVEMANLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQS
             220       230       240       250       260       270 

            280       290       300       310       320       330  
pF1KB7 AGGYLNSMHSLVNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAA
       ..:..:..::.. :  ::  ::: :.:  :..:.:: ..:   : .:. :    ..:  .
CCDS11 TAGFMNALHSMTPS--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPT
             280         290       300        310           320    

            340       350        360       370         380         
pF1KB7 GAGAGGTPDYDPHAHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLP
        :     :.:.::.  ::.:: .:::: .:::::.:::. :..:.   .::.:.::.:: 
CCDS11 PA-----PEYEPHV--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLS
               330         340       350       360       370       

     390       400       410       420         430       440   
pF1KB7 HAASGAMDYGGAGPLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL
       :  :: .::.:: :.. ..::::   :::::::::..::  : :::::::::::::
CCDS11 HHPSGNLDYNGAPPMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL
       380       390       400         410       420       430 

>>CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2                (432 aa)
 initn: 989 init1: 570 opt: 883  Z-score: 365.1  bits: 76.7 E(32554): 5.2e-14
Smith-Waterman score: 1401; 52.0% identity (72.4% similar) in 450 aa overlap (1-443:17-432)

                               10         20        30        40   
pF1KB7                 MQKATYYDSSAIYGGYPY-QAANGFAYNANQQPYPASAALGA-D
                       ::::.::.. ...::: : .... ..:.. .::::  :: .. :
CCDS22 MLFEQGQQALELPECTMQKAAYYENPGLFGGYGYSKTTDTYGYSTPHQPYPPPAAASSLD
               10        20        30        40        50        60

             50          60        70        80        90       100
pF1KB7 GEYHRPACSLQS--PSSAGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPP
        .:   :::.::  :  : .: :. ::. .:.:     :.   : :      ::   .  
CCDS22 TDYPGSACSIQSSAPLRAPAH-KGAELNGSCMR-----PGTGNSQGGGGGSQPPGLNSE-
               70        80         90            100       110    

              110       120       130       140       150       160
pF1KB7 APQPPQPAPQPPAPTPAAPPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMK
         ::::: : ::.  :..:  :... : .. ...:   ::..:    : :..::::::::
CCDS22 -QQPPQPPPPPPTLPPSSPTNPGGGVPAKKPKGGP---NASSS----SATISKQIFPWMK
            120       130       140          150           160     

              170       180       190       200       210       220
pF1KB7 ESRQNTKQKTSSSSSGESCAGDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPR
       :::::.:::.: ...::::  :::::: :: ::.::::::::::::::::::::::::::
CCDS22 ESRQNSKQKNSCATAGESCE-DKSPPGPAS-KRVRTAYTSAQLVELEKEFHFNRYLCRPR
         170       180        190        200       210       220   

              230       240       250       260       270       280
pF1KB7 RVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSM
       :::::::::::::::::::::::::::::::.::.: : ..::: :::   ::.:..   
CCDS22 RVEMANLLNLTERQIKIWFQNRRMKYKKDQKAKGILHSPASQSPERSPPLGGAAGHVAYS
           230       240       250       260       270       280   

                290       300       310       320       330        
pF1KB7 HSL--VNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGG
        .:  : .. :.  ::: :.:   . :::  :.: : : :: :    ::::.:       
CCDS22 GQLPPVPGLAYDAPSPPAFAKSQPNMYGL--AAYTAPLSSCLP---QQKRYAA-------
           290       300       310         320          330        

      340       350       360       370        380       390       
pF1KB7 TPDYDPHAHGLQGNGSYGTPHIQGSPVFVGGSYVEPMS-NSGPALFGLTHLPHAASGAMD
        :...::  . .:.: ... ..:::::.:::..:: :.  ::: .:.: :: : .:...:
CCDS22 -PEFEPHPMASNGGG-FASANLQGSPVYVGGNFVESMAPASGP-VFNLGHLSHPSSASVD
              340        350       360       370        380        

       400       410       420       430       440   
pF1KB7 YGGAGPLGSGHHHGPGPGEPHPTYTDLTGHHPSQGRIQEAPKLTHL
       :. :. . ..:::::   .::::::::..:: ::::. ::::::::
CCDS22 YSCAAQIPGNHHHGPC--DPHPTYTDLSAHHSSQGRLPEAPKLTHL
      390       400         410       420       430  

>>CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17              (358 aa)
 initn: 831 init1: 605 opt: 746  Z-score: 313.8  bits: 66.9 E(32554): 3.7e-11
Smith-Waterman score: 1229; 53.9% identity (74.3% similar) in 373 aa overlap (105-443:3-358)

           80        90       100       110       120          130 
pF1KB7 LSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPPP---PSSASPPQNA
                                     :  ::.: .  :..:::   :.::.  .. 
CCDS82                             MRPGLAPEPLSAPPGSPPPSAAPTSATSNSSN
                                           10        20        30  

             140        150       160       170       180          
pF1KB7 SNNPTPANAAK-SPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCAG---------
       ...:. ..  : .:  :: :..::::::::::::..: :..: ...:.:.:         
CCDS82 GGGPSKSGPPKCGPGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGGG
             40        50         60        70        80        90 

                           190       200       210       220       
pF1KB7 --------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANL
                     ::::::.:.:::::::::::::::::::::::::::::::::::::
CCDS82 SGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANL
             100       110       120       130       140       150 

       230       240       250       260       270         280     
pF1KB7 LNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--GAGGYLNSMHSLVN
       :::.::::::::::::::::::::.::. .:::: ::. ::  :  ...:..:..::.. 
CCDS82 LNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMTP
             160       170       180       190       200       210 

         290       300       310       320       330       340     
pF1KB7 SVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPH
       :  ::  ::: :.:  :..:.:: ..:   : .:. :    ..:  . :     :.:.::
CCDS82 S--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPTPA-----PEYEPH
               220       230        240           250              

         350        360       370         380       390       400  
pF1KB7 AHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLPHAASGAMDYGGAG
       .  ::.:: .:::: .:::::.:::. :..:.   .::.:.::.:: :  :: .::.:: 
CCDS82 V--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAP
     260         270       280       290       300       310       

            410       420         430       440   
pF1KB7 PLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL
       :.. ..::::   :::::::::..::  : :::::::::::::
CCDS82 PMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL
       320         330       340       350        




443 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 09:22:49 2016 done: Fri Nov  4 09:22:49 2016
 Total Scan time:  2.910 Total Display time:  0.000

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com