Result of FASTA (ccds) for pF1KB5459
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB5459, 391 aa
  1>>>pF1KB5459 391 - 391 aa - 391 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.5865+/-0.000812; mu= 9.8949+/- 0.049
 mean_var=75.9360+/-15.064, 0's: 0 Z-trim(107.6): 19  B-trim: 0 in 0/51
 Lambda= 0.147181
 statistics sampled from 9692 (9703) to 9692 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.676), E-opt: 0.2 (0.298), width:  16
 Scan time:  2.910

The best scores are:                                      opt bits E(32554)
CCDS72876.1 TXNIP gene_id:10628|Hs108|chr1         ( 391) 2628 567.4  8e-162
CCDS81368.1 TXNIP gene_id:10628|Hs108|chr1         ( 336) 2097 454.6  6e-128
CCDS10377.1 ARRDC4 gene_id:91947|Hs108|chr15       ( 418) 1143 252.1 7.1e-67
CCDS34202.1 ARRDC3 gene_id:57561|Hs108|chr5        ( 414)  946 210.2 2.7e-54
CCDS12370.1 ARRDC2 gene_id:27106|Hs108|chr19       ( 407)  891 198.5 8.8e-51
CCDS32956.1 ARRDC2 gene_id:27106|Hs108|chr19       ( 402)  868 193.7 2.6e-49


>>CCDS72876.1 TXNIP gene_id:10628|Hs108|chr1              (391 aa)
 initn: 2628 init1: 2628 opt: 2628  Z-score: 3018.8  bits: 567.4 E(32554): 8e-162
Smith-Waterman score: 2628; 100.0% identity (100.0% similar) in 391 aa overlap (1-391:1-391)

               10        20        30        40        50        60
pF1KB5 MVMFKKIKSFEVVFNDPEKVYGSGEKVAGRVIVEVCEVTRVKAVRILACGVAKVLWMQGS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 MVMFKKIKSFEVVFNDPEKVYGSGEKVAGRVIVEVCEVTRVKAVRILACGVAKVLWMQGS
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 QQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFGFELPQGPLGTSFKGKYGC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 QQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFGFELPQGPLGTSFKGKYGC
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 VDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKEKKVSCMFIPDGRVSV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 VDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKEKKVSCMFIPDGRVSV
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 SARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANGQTKVLTQKLSSVRGN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 SARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANGQTKVLTQKLSSVRGN
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB5 HIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGSKKVILDLPLVIGSRS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 HIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGSKKVILDLPLVIGSRS
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB5 GLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIPEDHRLESPTTPLLDDMDGSQD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 GLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIPEDHRLESPTTPLLDDMDGSQD
              310       320       330       340       350       360

              370       380       390 
pF1KB5 SPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ
       :::::::::::::::::::::::::::::::
CCDS72 SPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ
              370       380       390 

>>CCDS81368.1 TXNIP gene_id:10628|Hs108|chr1              (336 aa)
 initn: 2097 init1: 2097 opt: 2097  Z-score: 2410.6  bits: 454.6 E(32554): 6e-128
Smith-Waterman score: 2097; 99.7% identity (100.0% similar) in 310 aa overlap (82-391:27-336)

              60        70        80        90       100       110 
pF1KB5 AKVLWMQGSQQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFGFELPQGPLG
                                     :.::::::::::::::::::::::::::::
CCDS81     MPPKHSLSHRCILSVTASLMATRFSFPSGENEMVIMRPGNKYEYKFGFELPQGPLG
                   10        20        30        40        50      

             120       130       140       150       160       170 
pF1KB5 TSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKEKKVSCM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 TSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKEKKVSCM
         60        70        80        90       100       110      

             180       190       200       210       220       230 
pF1KB5 FIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANGQTKVLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 FIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANGQTKVLT
        120       130       140       150       160       170      

             240       250       260       270       280       290 
pF1KB5 QKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGSKKVILD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 QKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGSKKVILD
        180       190       200       210       220       230      

             300       310       320       330       340       350 
pF1KB5 LPLVIGSRSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIPEDHRLESPTTPL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 LPLVIGSRSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIPEDHRLESPTTPL
        240       250       260       270       280       290      

             360       370       380       390 
pF1KB5 LDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ
       ::::::::::::::::::::::::::::::::::::::::
CCDS81 LDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ
        300       310       320       330      

>>CCDS10377.1 ARRDC4 gene_id:91947|Hs108|chr15            (418 aa)
 initn: 986 init1: 717 opt: 1143  Z-score: 1314.1  bits: 252.1 E(32554): 7.1e-67
Smith-Waterman score: 1143; 45.4% identity (72.8% similar) in 394 aa overlap (6-383:16-403)

                         10         20        30        40         
pF1KB5           MVMFKKIKSFEVVFNDPEK-VYGSGEKVAGRVIVEVCEVTRVKAVRILAC
                      ..::. .::.: .:  :.::: :::.:..:. : . ..:.:. : 
CCDS10 MGGEAGCAAAVGAEGRVKSLGLVFEDERKGCYSSGETVAGHVLLEASEPVALRALRLEAQ
               10        20        30        40        50        60

      50        60                    70        80        90       
pF1KB5 GVAKVLWMQGSQQCKQTS------------EYLRYEDTLLLEDQPTGENEMVIMRPGNKY
       : : . :  : . : ..:            :::  .  : :.. :.::. .....:: :.
CCDS10 GRATAAW--GPSTCPRASASTAALAVFSEVEYLNVR--LSLREPPAGEG-IILLQPG-KH
                 70        80        90         100        110     

       100       110       120       130       140       150       
pF1KB5 EYKFGFELPQGPLGTSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLM
       :. : :.::. :: ::: :::: ..: :.: :.::. : : .:....::. :::::: :.
CCDS10 EFPFRFQLPSEPLVTSFTGKYGSIQYCVRAVLERPKVPDQSVKRELQVVSHVDVNTPALL
          120       130       140       150       160       170    

       160       170       180       190       200       210       
pF1KB5 APVSAKKEKKVSCMFIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVA
       .::   .:: :.: :. .: ::.::.:.:::.:.:. : :.:..::  ::..::::::  
CCDS10 TPVLKTQEKMVGCWFFTSGPVSLSAKIERKGYCNGEAIPIYAEIENCSSRLIVPKAAIFQ
          180       190       200       210       220       230    

       220       230       240       250       260       270       
pF1KB5 RHTYLANGQTKVLTQKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLL
        .::::.:.::.. . ...:::::: ::.  .: ::.:..  . :::: : :.::.::: 
CCDS10 TQTYLASGKTKTIRHMVANVRGNHIASGSTDTWNGKTLKIPPVTPSILDCCIIRVDYSLA
          240       250       260       270       280       290    

       280       290         300       310       320       330     
pF1KB5 IYVSVPGSKKVILDLPLVIGS--RSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYM
       .:. .::.::..:.::::::.   .:..::.::.::. : .:::. :..:. ::::: : 
CCDS10 VYIHIPGAKKLMLELPLVIGTIPYNGFGSRNSSIASQFSMDMSWLTLTLPEQPEAPPNYA
          300       310       320       330       340       350    

         340        350       360       370       380       390    
pF1KB5 DVIPEDHRLES-PTTPLLDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ   
       ::. :..  .  :  :   . .:    :.:    ::.:.::: :.::::           
CCDS10 DVVSEEEFSRHIPPYPQPPNCEGEVCCPVFACIQEFRFQPPPLYSEVDPHPSDVEESQPV
          360       370       380       390       400       410    

CCDS10 SFIL
           

>>CCDS34202.1 ARRDC3 gene_id:57561|Hs108|chr5             (414 aa)
 initn: 1038 init1: 672 opt: 946  Z-score: 1088.1  bits: 210.2 E(32554): 2.7e-54
Smith-Waterman score: 1049; 42.1% identity (70.8% similar) in 401 aa overlap (2-383:1-399)

               10            20        30        40        50      
pF1KB5 MVMFKKIKSFEVVF---NDPE-KVYGSGEKVAGRVIVEVCEVTRVKAVRILACGVAKVLW
        ... :.::. . :   :: .  ::.::. :.::: .::    :::...: : : ::: :
CCDS34  MVLGKVKSLTISFDCLNDSNVPVYSSGDTVSGRVNLEVTGEIRVKSLKIHARGHAKVRW
                10        20        30        40        50         

             60             70          80        90       100     
pF1KB5 MQ----GS-----QQCKQTSEYLRYEDTLL--LEDQPTGENEMVIMRPGNKYEYKFGFEL
        .    ::     :.  .  ::. ..: :.   .:. ..:. .  .. : ..:: :.:::
CCDS34 TESRNAGSNTAYTQNYTEEVEYFNHKDILIGHERDDDNSEEGFHTIHSG-RHEYAFSFEL
      60        70        80        90       100        110        

         110       120       130       140       150       160     
pF1KB5 PQGPLGTSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKE
       :: ::.:::.:..: : ::::: : ::     . ::.: : . .:.:::.:..: .. ::
CCDS34 PQTPLATSFEGRHGSVRYWVKAELHRPWLLPVKLKKEFTVFEHIDINTPSLLSPQAGTKE
      120       130       140       150       160       170        

         170       180       190       200       210       220     
pF1KB5 KKVSCMFIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANG
       : . : :  .: .:.::.:.:::.  :. :.: :..::  ::.:::::::   ... :.:
CCDS34 KTLCCWFCTSGPISLSAKIERKGYTPGESIQIFAEIENCSSRMVVPKAAIYQTQAFYAKG
      180       190       200       210       220       230        

         230       240       250       260       270       280     
pF1KB5 QTKVLTQKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGS
       . : . : ....::. . ::   .: :: :..  . :::: :.:.::::::..::..::.
CCDS34 KMKEVKQLVANLRGESLSSGKTETWNGKLLKIPPVSPSILDCSIIRVEYSLMVYVDIPGA
      240       250       260       270       280       290        

         290         300       310       320       330       340   
pF1KB5 KKVILDLPLVIGS--RSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIPEDHR
         ..:.::::::.     ..:::::..:. : .:.:..:..:. ::::: : .:. :..:
CCDS34 MDLFLNLPLVIGTIPLHPFGSRTSSVSSQCSMNMNWLSLSLPERPEAPPSYAEVVTEEQR
      300       310       320       330       340       350        

           350         360       370       380       390        
pF1KB5 LESPTTPL--LDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ       
        ..  .:.   ::.. . ..:.: :  ::.:.::: :.:.::               
CCDS34 -RNNLAPVSACDDFERALQGPLFAYIQEFRFLPPPLYSEIDPNPDQSADDRPSCPSR
       360       370       380       390       400       410    

>>CCDS12370.1 ARRDC2 gene_id:27106|Hs108|chr19            (407 aa)
 initn: 808 init1: 529 opt: 891  Z-score: 1025.1  bits: 198.5 E(32554): 8.8e-51
Smith-Waterman score: 891; 38.8% identity (66.4% similar) in 399 aa overlap (2-383:1-393)

               10            20        30        40        50      
pF1KB5 MVMFKKIKSFEVVFNDP----EKVYGSGEKVAGRVIVEVCEVTRVKAVRILACGVAKVLW
        ..: :.:.: : ..      : :...:. :::::..:.  ..:: :.:. : : :.: :
CCDS12  MLFDKVKAFSVQLDGATAGVEPVFSGGQAVAGRVLLELSSAARVGALRLRARGRAHVHW
                10        20        30        40        50         

             60             70        80        90       100       
pF1KB5 MQ----GS-----QQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFGFELPQ
        .    ::     :. ..  : . .. :::  :  :::.  . . :: ..:. :.:.:: 
CCDS12 TESRSAGSSTAYTQSYSERVEVVSHRATLLAPD--TGET--TTLPPG-RHEFLFSFQLPP
      60        70        80        90           100        110    

       110       120       130       140       150       160       
pF1KB5 GPLGTSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKEKK
         : :::.::.: : : .:: : ::  :.....: : :.. ::.::: :.:: .. .:: 
CCDS12 -TLVTSFEGKHGSVRYCIKATLHRPWVPARRARKVFTVIEPVDINTPALLAPQAGAREKV
           120       130       140       150       160       170   

       170       180       190       200       210       220       
pF1KB5 VSCMFIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANGQT
       .   .   : ::.::.:::::.  :. : . :...:  .: :.:.::.:  .:..: :  
CCDS12 ARSWYCNRGLVSLSAKIDRKGYTPGEVIPVFAEIDNGSTRPVLPRAAVVQTQTFMARGAR
           180       190       200       210       220       230   

       230       240       250       260       270       280       
pF1KB5 KVLTQKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGSKK
       :     ..:. :. .  :  : :.:..::.  . :::: : .:.:.:.: . :..::..:
CCDS12 KQKRAVVASLAGEPVGPGQRALWQGRALRIPPVGPSILHCRVLHVDYALKVCVDIPGTSK
           240       250       260       270       280       290   

       290         300       310       320       330         340   
pF1KB5 VILDLPLVIGS--RSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIP--EDHR
       ..:.::::::.     ..::.::..:..:  ..:    .:. ::::: : .:.   :.  
CCDS12 LLLELPLVIGTIPLHPFGSRSSSVGSHASFLLDWRLGALPERPEAPPEYSEVVADTEEAA
           300       310       320       330       340       350   

           350       360       370       380       390       
pF1KB5 LESPTTPLLDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ      
       : .   :: .: : : ..:.: :  ::.. ::: :.: ::              
CCDS12 LGQSPFPLPQDPDMSLEGPFFAYIQEFRYRPPPLYSEEDPNPLLGDMRPRCMTC
           360       370       380       390       400       

>>CCDS32956.1 ARRDC2 gene_id:27106|Hs108|chr19            (402 aa)
 initn: 815 init1: 529 opt: 868  Z-score: 998.8  bits: 193.7 E(32554): 2.6e-49
Smith-Waterman score: 868; 38.5% identity (66.4% similar) in 390 aa overlap (7-383:6-388)

               10         20        30        40        50         
pF1KB5 MVMFKKIKSFEVVFN-DPEKVYGSGEKVAGRVIVEVCEVTRVKAVRILACGVAKVLWMQG
             ..:: . .   :  .: .::.. :::..:.    ::.:... : : : . :..:
CCDS32  MRSGGVRSFALELARGPGGAYRGGERLCGRVLLEAAAPLRVRALEVKARGGAATHWLEG
                10        20        30        40        50         

              60        70        80        90       100       110 
pF1KB5 --------SQQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFGFELPQGPLG
               :..   .  ::: .. :::.:  :::.  . . :: ..:. :.:.::   : 
CCDS32 RSVGVNAVSSDYAAAETYLRRRQ-LLLRD--TGET--TTLPPG-RHEFLFSFQLPP-TLV
      60        70        80           90          100        110  

             120       130       140       150       160       170 
pF1KB5 TSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPVSAKKEKKVSCM
       :::.::.: : : .:: : ::  :.....: : :.. ::.::: :.:: .. .:: .   
CCDS32 TSFEGKHGSVRYCIKATLHRPWVPARRARKVFTVIEPVDINTPALLAPQAGAREKVARSW
            120       130       140       150       160       170  

             180       190       200       210       220       230 
pF1KB5 FIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVARHTYLANGQTKVLT
       .   : ::.::.:::::.  :. : . :...:  .: :.:.::.:  .:..: :  :   
CCDS32 YCNRGLVSLSAKIDRKGYTPGEVIPVFAEIDNGSTRPVLPRAAVVQTQTFMARGARKQKR
            180       190       200       210       220       230  

             240       250       260       270       280       290 
pF1KB5 QKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSLLIYVSVPGSKKVILD
         ..:. :. .  :  : :.:..::.  . :::: : .:.:.:.: . :..::..:..:.
CCDS32 AVVASLAGEPVGPGQRALWQGRALRIPPVGPSILHCRVLHVDYALKVCVDIPGTSKLLLE
            240       250       260       270       280       290  

               300       310       320       330         340       
pF1KB5 LPLVIGS--RSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCYMDVIP--EDHRLESP
       ::::::.     ..::.::..:..:  ..:    .:. ::::: : .:.   :.  : . 
CCDS32 LPLVIGTIPLHPFGSRSSSVGSHASFLLDWRLGALPERPEAPPEYSEVVADTEEAALGQS
            300       310       320       330       340       350  

       350       360       370       380       390       
pF1KB5 TTPLLDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ      
         :: .: : : ..:.: :  ::.. ::: :.: ::              
CCDS32 PFPLPQDPDMSLEGPFFAYIQEFRYRPPPLYSEEDPNPLLGDMRPRCMTC
            360       370       380       390       400  




391 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 21:40:11 2016 done: Fri Nov  4 21:40:12 2016
 Total Scan time:  2.910 Total Display time:  0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com