FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8278, 326 aa
1>>>pF1KB8278 326 - 326 aa - 326 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.9024+/-0.000864; mu= 5.3444+/- 0.052
mean_var=158.0262+/-32.449, 0's: 0 Z-trim(112.0): 48 B-trim: 0 in 0/52
Lambda= 0.102026
statistics sampled from 12795 (12840) to 12795 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.394), width: 16
Scan time: 2.530
The best scores are: opt bits E(32554)
CCDS7138.1 BMI1 gene_id:648|Hs108|chr10 ( 326) 2210 336.6 1.6e-92
CCDS59213.1 BMI1 gene_id:100532731|Hs108|chr10 ( 469) 2210 336.8 2.1e-92
CCDS32638.1 PCGF2 gene_id:7703|Hs108|chr17 ( 344) 1376 213.9 1.5e-55
CCDS1946.2 PCGF1 gene_id:84759|Hs108|chr2 ( 259) 571 95.3 5.5e-20
CCDS31275.1 PCGF6 gene_id:84108|Hs108|chr10 ( 350) 482 82.3 6.1e-16
CCDS7413.1 PCGF5 gene_id:84333|Hs108|chr10 ( 256) 474 81.1 1.1e-15
CCDS3339.2 PCGF3 gene_id:10336|Hs108|chr4 ( 242) 415 72.4 4.2e-13
>>CCDS7138.1 BMI1 gene_id:648|Hs108|chr10 (326 aa)
initn: 2210 init1: 2210 opt: 2210 Z-score: 1774.7 bits: 336.6 E(32554): 1.6e-92
Smith-Waterman score: 2210; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:1-326)
10 20 30 40 50 60
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS71 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS71 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 DKRIITDDEIISLSIEFFDQNRLDRKVNKDKEKSKEEVNDKRYLRCPAAMTVMHLRKFLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS71 DKRIITDDEIISLSIEFFDQNRLDRKVNKDKEKSKEEVNDKRYLRCPAAMTVMHLRKFLR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 SKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRPTCKRMKISHQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS71 SKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRPTCKRMKISHQR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 DGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHPQFPHISSTMNGTSNSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS71 DGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHPQFPHISSTMNGTSNSP
250 260 270 280 290 300
310 320
pF1KB8 SGNHQSSFANRPRKSSVNGSSATSSG
::::::::::::::::::::::::::
CCDS71 SGNHQSSFANRPRKSSVNGSSATSSG
310 320
>>CCDS59213.1 BMI1 gene_id:100532731|Hs108|chr10 (469 aa)
initn: 2210 init1: 2210 opt: 2210 Z-score: 1772.4 bits: 336.8 E(32554): 2.1e-92
Smith-Waterman score: 2210; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:144-469)
10 20 30
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATT
::::::::::::::::::::::::::::::
CCDS59 LLGRTLIPHPIQRLVLVAAWNNYRIFYQAEMHRTTRIKITELNPHLMCVLCGGYFIDATT
120 130 140 150 160 170
40 50 60 70 80 90
pF1KB8 IIECLHSFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 IIECLHSFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNE
180 190 200 210 220 230
100 110 120 130 140 150
pF1KB8 MKRRRDFYAAHPSADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MKRRRDFYAAHPSADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKD
240 250 260 270 280 290
160 170 180 190 200 210
pF1KB8 KEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIA
300 310 320 330 340 350
220 230 240 250 260 270
pF1KB8 YIYTWRRNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 YIYTWRRNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSS
360 370 380 390 400 410
280 290 300 310 320
pF1KB8 CLPSPSTPVQSPHPQFPHISSTMNGTSNSPSGNHQSSFANRPRKSSVNGSSATSSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 CLPSPSTPVQSPHPQFPHISSTMNGTSNSPSGNHQSSFANRPRKSSVNGSSATSSG
420 430 440 450 460
>>CCDS32638.1 PCGF2 gene_id:7703|Hs108|chr17 (344 aa)
initn: 1403 init1: 829 opt: 1376 Z-score: 1110.9 bits: 213.9 E(32554): 1.5e-55
Smith-Waterman score: 1400; 63.2% identity (80.7% similar) in 342 aa overlap (1-320:1-338)
10 20 30 40 50 60
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV
::::::::::::::::::.::::::::::::.:::::::::::::::::.::::.:::::
CCDS32 MHRTTRIKITELNPHLMCALCGGYFIDATTIVECLHSFCKTCIVRYLETNKYCPMCDVQV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE
:::::::.::::::::::::::::::::.:::::::::::.: ... :::::::::: ..
CCDS32 HKTRPLLSIRSDKTLQDIVYKLVPGLFKDEMKRRRDFYAAYPLTEVPNGSNEDRGEVLEQ
70 80 90 100 110 120
130 140 150 160 170
pF1KB8 DKRIITDDEIISLSIEFFD--QNRLDRK---VNKDKEKSKEEVNDKRYLRCPAAMTVMHL
.: ..::::.::::::.. ..: ..: : : .: : : :.::::::::::::
CCDS32 EKGALSDDEIVSLSIEFYEGARDRDEKKGPLENGDGDKEKTGV---RFLRCPAAMTVMHL
130 140 150 160 170
180 190 200 210 220 230
pF1KB8 RKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRPTCKRMK
::::.:::.:. ....:.::.::::.::::::::::: :::::::::::::.:.:::.
CCDS32 AKFLRNKMDVPSKYKVEVLYEDEPLKEYYTLMDIAYIYPWRRNGPLPLKYRVQPACKRLT
180 190 200 210 220 230
240 250 260 270 280
pF1KB8 ISH---QRDGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQ-SP--------H
.. .: ...: : .: :::: ::: .:.::: ::::.:: . :: :
CCDS32 LATVPTPSEGTNTSGASECESVSDKAPSPAT-LPATSSSLPSPATPSHGSPSSHGPPATH
240 250 260 270 280 290
290 300 310 320
pF1KB8 PQFPHISSTMNGTSNSPSGN-----HQSSFANRPRKSSVNGSSATSSG
: : :: .:.... .:. . : ..: :: .:::.
CCDS32 PTSPTPPSTASGATTAANGGSLNCLQTPSSTSRGRKMTVNGAPVPPLT
300 310 320 330 340
>>CCDS1946.2 PCGF1 gene_id:84759|Hs108|chr2 (259 aa)
initn: 511 init1: 478 opt: 571 Z-score: 472.3 bits: 95.3 E(32554): 5.5e-20
Smith-Waterman score: 578; 41.6% identity (70.0% similar) in 233 aa overlap (6-228:35-255)
10 20 30
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECL
:.:: .:: :..: ::.:::.::::: :::
CCDS19 QGGQIAIAMRLRNQLQSVYKMDPLRNEEEVRVKIKDLNEHIVCCLCAGYFVDATTITECL
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 HSFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRR
:.:::.:::.::.::::::.:....:.:.::::.. :...::::::::::: .: :: :
CCDS19 HTFCKSCIVKYLQTSKYCPMCNIKIHETQPLLNLKLDRVMQDIVYKLVPGLQDSEEKRIR
70 80 90 100 110 120
100 110 120 130 140
pF1KB8 DFYAA-------HPSADAANGSNEDRGEVA-DEDK-RIITDDEIISLSIEFFDQNRLDRK
.:: . .:... :: . :..: . :: ..: .: ::.
CCDS19 EFYQSRGLDRVTQPTGEEPALSNLGLPFSSFDHSKAHYYRYDEQLNLCLE-----RLSS-
130 140 150 160 170
150 160 170 180 190 200
pF1KB8 VNKDKEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTL
.:::.:: : ...:.:: . : :::. : .. . : ........: : :..:.
CCDS19 -GKDKNKS---VLQNKYVRCSVRAEVRHLRRVLCHRL-MLNPQHVQLLFDNEVLPDHMTM
180 190 200 210 220 230
210 220 230 240 250 260
pF1KB8 MDIAYIYTWR-RNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGI
.: .. : . .:: :.: :.
CCDS19 KQI-WLSRWFGKPSPLLLQYSVKEKRR
240 250
>>CCDS31275.1 PCGF6 gene_id:84108|Hs108|chr10 (350 aa)
initn: 494 init1: 393 opt: 482 Z-score: 399.6 bits: 82.3 E(32554): 6.1e-16
Smith-Waterman score: 482; 39.2% identity (66.5% similar) in 209 aa overlap (7-209:123-322)
10 20 30
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLH
:...::.:...: .: ::.:::::: ::::
CCDS31 EEEEEEEDMSHFSLRLEGGRQDSEDEEERLINLSELTPYILCSICKGYLIDATTITECLH
100 110 120 130 140 150
40 50 60 70 80 90
pF1KB8 SFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRD
.:::.::::.. :. :: :.. ::.:.:: ::: :. ::::::::: .: . : :. .:
CCDS31 TFCKSCIVRHFYYSNRCPKCNIVVHQTQPLYNIRLDRQLQDIVYKLVINLEEREKKQMHD
160 170 180 190 200 210
100 110 120 130 140 150
pF1KB8 FY------AAHPSADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKD
:: . .:.. :.. :.. . :. : . .:: .::. :. .
CCDS31 FYKERGLEVPKPAVPQPVPSSKGRSKKVLESVFRIPPELDMSLLLEFIGANE-----GTG
220 230 240 250 260
160 170 180 190 200 210
pF1KB8 KEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIA
. : : :...: . :. :..:::: :: . . :.:.. .. :..: :: .:
CCDS31 HFKPLE----KKFVRVSGEATIGHVEKFLRRKMGLDPACQVDIICGDHLLEQYQTLREIR
270 280 290 300 310 320
220 230 240 250 260 270
pF1KB8 YIYTWRRNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSS
CCDS31 RAIGDAAMQDGLLVLHYGLVVSPLKIT
330 340 350
>>CCDS7413.1 PCGF5 gene_id:84333|Hs108|chr10 (256 aa)
initn: 448 init1: 357 opt: 474 Z-score: 395.2 bits: 81.1 E(32554): 1.1e-15
Smith-Waterman score: 474; 33.1% identity (63.7% similar) in 245 aa overlap (9-242:9-237)
10 20 30 40 50 60
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV
. ..::.. : .: ::.: ::. ::::.:::::::...: :. :: : ::
CCDS74 MATQRKHLVKDFNPYITCYICKGYLIKPTTVTECLHTFCKTCIVQHFEDSNDCPRCGNQV
10 20 30 40 50 60
70 80 90 100 110
pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAH-PSADAANGSNE-DRGEVA
:.: :: .: :.::..:..:::::: ..:..:. .:. . :. .. . ... :. .:
CCDS74 HETNPLEMLRLDNTLEEIIFKLVPGLREQELERESEFWKKNKPQENGQDDTSKADKPKV-
70 80 90 100 110
120 130 140 150 160
pF1KB8 DEDKRIITDDEIISLSIEFFDQNRLDRKVN------KDKEKSKEEVND---KRYLRCPAA
::. ::. : .: : .. ... .: ..: :...:: .
CCDS74 DEEGDENEDDK---------DYHRSDPQIAICLDCLRNNGQSGDNVVKGLMKKFIRCSTR
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB8 MTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRP
.:: ..::: :. .:.....::. . : . .: :.. :. :: : ..:
CCDS74 VTVGTIKKFLSLKLKLPSSYELDVLCNGEIMGKDHT-MEFIYMTRWRLRGE---NFRCL-
180 190 200 210 220
230 240 250 260 270 280
pF1KB8 TCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHPQFPHI
.:. .. : ::
CCDS74 NCSASQVCSQ-DGPLYQSYPMVLQYRPRIDFG
230 240 250
>>CCDS3339.2 PCGF3 gene_id:10336|Hs108|chr4 (242 aa)
initn: 558 init1: 412 opt: 415 Z-score: 348.6 bits: 72.4 E(32554): 4.2e-13
Smith-Waterman score: 524; 36.8% identity (66.1% similar) in 239 aa overlap (4-226:3-236)
10 20 30 40 50 60
pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV
: .::. ..: :. : ::.::.:::::. ::::.::..:.:.::: .. :: : . .
CCDS33 MLTRKIKLWDINAHITCRLCSGYLIDATTVTECLHTFCRSCLVKYLEENNTCPTCRIVI
10 20 30 40 50
70 80 90 100 110 120
pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE
:...:: : :.:.::::::::::: . ::...:.:: : . . : . .::. .
CCDS33 HQSHPLQYIGHDRTMQDIVYKLVPGLQEAEMRKQREFY--HKLGMEVPG--DIKGETCSA
60 70 80 90 100 110
130 140 150 160
pF1KB8 DKRIIT--------DDEIISLSIEFF-----DQNRLDRKVNKDKE--KSKEEVNDKRYLR
... . :: . . : : .: :..:. : .:: . ....:
CCDS33 KQHLDSHRNGETKADDSSNKEAAEEKPEEDNDYHRSDEQVSICLECNSSKLRGLKRKWIR
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB8 CPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWR-RNGPLPLK
: : ::.::.::. .:... . ..:.. .:: : .:: .. . :: ...:: :.
CCDS33 CSAQATVLHLKKFIAKKLNLSSFNELDILCNEEILGKDHTL-KFVVVTRWRFKKAPLLLH
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB8 YRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHP
::
CCDS33 YRPKMDLL
240
326 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 22:02:56 2016 done: Fri Nov 4 22:02:56 2016
Total Scan time: 2.530 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]