FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8278, 326 aa 1>>>pF1KB8278 326 - 326 aa - 326 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.9024+/-0.000864; mu= 5.3444+/- 0.052 mean_var=158.0262+/-32.449, 0's: 0 Z-trim(112.0): 48 B-trim: 0 in 0/52 Lambda= 0.102026 statistics sampled from 12795 (12840) to 12795 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.394), width: 16 Scan time: 2.530 The best scores are: opt bits E(32554) CCDS7138.1 BMI1 gene_id:648|Hs108|chr10 ( 326) 2210 336.6 1.6e-92 CCDS59213.1 BMI1 gene_id:100532731|Hs108|chr10 ( 469) 2210 336.8 2.1e-92 CCDS32638.1 PCGF2 gene_id:7703|Hs108|chr17 ( 344) 1376 213.9 1.5e-55 CCDS1946.2 PCGF1 gene_id:84759|Hs108|chr2 ( 259) 571 95.3 5.5e-20 CCDS31275.1 PCGF6 gene_id:84108|Hs108|chr10 ( 350) 482 82.3 6.1e-16 CCDS7413.1 PCGF5 gene_id:84333|Hs108|chr10 ( 256) 474 81.1 1.1e-15 CCDS3339.2 PCGF3 gene_id:10336|Hs108|chr4 ( 242) 415 72.4 4.2e-13 >>CCDS7138.1 BMI1 gene_id:648|Hs108|chr10 (326 aa) initn: 2210 init1: 2210 opt: 2210 Z-score: 1774.7 bits: 336.6 E(32554): 1.6e-92 Smith-Waterman score: 2210; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:1-326) 10 20 30 40 50 60 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS71 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS71 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 DKRIITDDEIISLSIEFFDQNRLDRKVNKDKEKSKEEVNDKRYLRCPAAMTVMHLRKFLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS71 DKRIITDDEIISLSIEFFDQNRLDRKVNKDKEKSKEEVNDKRYLRCPAAMTVMHLRKFLR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 SKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRPTCKRMKISHQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS71 SKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRPTCKRMKISHQR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 DGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHPQFPHISSTMNGTSNSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS71 DGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHPQFPHISSTMNGTSNSP 250 260 270 280 290 300 310 320 pF1KB8 SGNHQSSFANRPRKSSVNGSSATSSG :::::::::::::::::::::::::: CCDS71 SGNHQSSFANRPRKSSVNGSSATSSG 310 320 >>CCDS59213.1 BMI1 gene_id:100532731|Hs108|chr10 (469 aa) initn: 2210 init1: 2210 opt: 2210 Z-score: 1772.4 bits: 336.8 E(32554): 2.1e-92 Smith-Waterman score: 2210; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:144-469) 10 20 30 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATT :::::::::::::::::::::::::::::: CCDS59 LLGRTLIPHPIQRLVLVAAWNNYRIFYQAEMHRTTRIKITELNPHLMCVLCGGYFIDATT 120 130 140 150 160 170 40 50 60 70 80 90 pF1KB8 IIECLHSFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 IIECLHSFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNE 180 190 200 210 220 230 100 110 120 130 140 150 pF1KB8 MKRRRDFYAAHPSADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MKRRRDFYAAHPSADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKD 240 250 260 270 280 290 160 170 180 190 200 210 pF1KB8 KEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 KEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIA 300 310 320 330 340 350 220 230 240 250 260 270 pF1KB8 YIYTWRRNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 YIYTWRRNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSS 360 370 380 390 400 410 280 290 300 310 320 pF1KB8 CLPSPSTPVQSPHPQFPHISSTMNGTSNSPSGNHQSSFANRPRKSSVNGSSATSSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 CLPSPSTPVQSPHPQFPHISSTMNGTSNSPSGNHQSSFANRPRKSSVNGSSATSSG 420 430 440 450 460 >>CCDS32638.1 PCGF2 gene_id:7703|Hs108|chr17 (344 aa) initn: 1403 init1: 829 opt: 1376 Z-score: 1110.9 bits: 213.9 E(32554): 1.5e-55 Smith-Waterman score: 1400; 63.2% identity (80.7% similar) in 342 aa overlap (1-320:1-338) 10 20 30 40 50 60 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV ::::::::::::::::::.::::::::::::.:::::::::::::::::.::::.::::: CCDS32 MHRTTRIKITELNPHLMCALCGGYFIDATTIVECLHSFCKTCIVRYLETNKYCPMCDVQV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE :::::::.::::::::::::::::::::.:::::::::::.: ... :::::::::: .. CCDS32 HKTRPLLSIRSDKTLQDIVYKLVPGLFKDEMKRRRDFYAAYPLTEVPNGSNEDRGEVLEQ 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 DKRIITDDEIISLSIEFFD--QNRLDRK---VNKDKEKSKEEVNDKRYLRCPAAMTVMHL .: ..::::.::::::.. ..: ..: : : .: : : :.:::::::::::: CCDS32 EKGALSDDEIVSLSIEFYEGARDRDEKKGPLENGDGDKEKTGV---RFLRCPAAMTVMHL 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 RKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRPTCKRMK ::::.:::.:. ....:.::.::::.::::::::::: :::::::::::::.:.:::. CCDS32 AKFLRNKMDVPSKYKVEVLYEDEPLKEYYTLMDIAYIYPWRRNGPLPLKYRVQPACKRLT 180 190 200 210 220 230 240 250 260 270 280 pF1KB8 ISH---QRDGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQ-SP--------H .. .: ...: : .: :::: ::: .:.::: ::::.:: . :: : CCDS32 LATVPTPSEGTNTSGASECESVSDKAPSPAT-LPATSSSLPSPATPSHGSPSSHGPPATH 240 250 260 270 280 290 290 300 310 320 pF1KB8 PQFPHISSTMNGTSNSPSGN-----HQSSFANRPRKSSVNGSSATSSG : : :: .:.... .:. . : ..: :: .:::. CCDS32 PTSPTPPSTASGATTAANGGSLNCLQTPSSTSRGRKMTVNGAPVPPLT 300 310 320 330 340 >>CCDS1946.2 PCGF1 gene_id:84759|Hs108|chr2 (259 aa) initn: 511 init1: 478 opt: 571 Z-score: 472.3 bits: 95.3 E(32554): 5.5e-20 Smith-Waterman score: 578; 41.6% identity (70.0% similar) in 233 aa overlap (6-228:35-255) 10 20 30 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECL :.:: .:: :..: ::.:::.::::: ::: CCDS19 QGGQIAIAMRLRNQLQSVYKMDPLRNEEEVRVKIKDLNEHIVCCLCAGYFVDATTITECL 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB8 HSFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRR :.:::.:::.::.::::::.:....:.:.::::.. :...::::::::::: .: :: : CCDS19 HTFCKSCIVKYLQTSKYCPMCNIKIHETQPLLNLKLDRVMQDIVYKLVPGLQDSEEKRIR 70 80 90 100 110 120 100 110 120 130 140 pF1KB8 DFYAA-------HPSADAANGSNEDRGEVA-DEDK-RIITDDEIISLSIEFFDQNRLDRK .:: . .:... :: . :..: . :: ..: .: ::. CCDS19 EFYQSRGLDRVTQPTGEEPALSNLGLPFSSFDHSKAHYYRYDEQLNLCLE-----RLSS- 130 140 150 160 170 150 160 170 180 190 200 pF1KB8 VNKDKEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTL .:::.:: : ...:.:: . : :::. : .. . : ........: : :..:. CCDS19 -GKDKNKS---VLQNKYVRCSVRAEVRHLRRVLCHRL-MLNPQHVQLLFDNEVLPDHMTM 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB8 MDIAYIYTWR-RNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGI .: .. : . .:: :.: :. CCDS19 KQI-WLSRWFGKPSPLLLQYSVKEKRR 240 250 >>CCDS31275.1 PCGF6 gene_id:84108|Hs108|chr10 (350 aa) initn: 494 init1: 393 opt: 482 Z-score: 399.6 bits: 82.3 E(32554): 6.1e-16 Smith-Waterman score: 482; 39.2% identity (66.5% similar) in 209 aa overlap (7-209:123-322) 10 20 30 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLH :...::.:...: .: ::.:::::: :::: CCDS31 EEEEEEEDMSHFSLRLEGGRQDSEDEEERLINLSELTPYILCSICKGYLIDATTITECLH 100 110 120 130 140 150 40 50 60 70 80 90 pF1KB8 SFCKTCIVRYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRD .:::.::::.. :. :: :.. ::.:.:: ::: :. ::::::::: .: . : :. .: CCDS31 TFCKSCIVRHFYYSNRCPKCNIVVHQTQPLYNIRLDRQLQDIVYKLVINLEEREKKQMHD 160 170 180 190 200 210 100 110 120 130 140 150 pF1KB8 FY------AAHPSADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKD :: . .:.. :.. :.. . :. : . .:: .::. :. . CCDS31 FYKERGLEVPKPAVPQPVPSSKGRSKKVLESVFRIPPELDMSLLLEFIGANE-----GTG 220 230 240 250 260 160 170 180 190 200 210 pF1KB8 KEKSKEEVNDKRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIA . : : :...: . :. :..:::: :: . . :.:.. .. :..: :: .: CCDS31 HFKPLE----KKFVRVSGEATIGHVEKFLRRKMGLDPACQVDIICGDHLLEQYQTLREIR 270 280 290 300 310 320 220 230 240 250 260 270 pF1KB8 YIYTWRRNGPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSS CCDS31 RAIGDAAMQDGLLVLHYGLVVSPLKIT 330 340 350 >>CCDS7413.1 PCGF5 gene_id:84333|Hs108|chr10 (256 aa) initn: 448 init1: 357 opt: 474 Z-score: 395.2 bits: 81.1 E(32554): 1.1e-15 Smith-Waterman score: 474; 33.1% identity (63.7% similar) in 245 aa overlap (9-242:9-237) 10 20 30 40 50 60 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV . ..::.. : .: ::.: ::. ::::.:::::::...: :. :: : :: CCDS74 MATQRKHLVKDFNPYITCYICKGYLIKPTTVTECLHTFCKTCIVQHFEDSNDCPRCGNQV 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAH-PSADAANGSNE-DRGEVA :.: :: .: :.::..:..:::::: ..:..:. .:. . :. .. . ... :. .: CCDS74 HETNPLEMLRLDNTLEEIIFKLVPGLREQELERESEFWKKNKPQENGQDDTSKADKPKV- 70 80 90 100 110 120 130 140 150 160 pF1KB8 DEDKRIITDDEIISLSIEFFDQNRLDRKVN------KDKEKSKEEVND---KRYLRCPAA ::. ::. : .: : .. ... .: ..: :...:: . CCDS74 DEEGDENEDDK---------DYHRSDPQIAICLDCLRNNGQSGDNVVKGLMKKFIRCSTR 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB8 MTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRNGPLPLKYRVRP .:: ..::: :. .:.....::. . : . .: :.. :. :: : ..: CCDS74 VTVGTIKKFLSLKLKLPSSYELDVLCNGEIMGKDHT-MEFIYMTRWRLRGE---NFRCL- 180 190 200 210 220 230 240 250 260 270 280 pF1KB8 TCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHPQFPHI .:. .. : :: CCDS74 NCSASQVCSQ-DGPLYQSYPMVLQYRPRIDFG 230 240 250 >>CCDS3339.2 PCGF3 gene_id:10336|Hs108|chr4 (242 aa) initn: 558 init1: 412 opt: 415 Z-score: 348.6 bits: 72.4 E(32554): 4.2e-13 Smith-Waterman score: 524; 36.8% identity (66.1% similar) in 239 aa overlap (4-226:3-236) 10 20 30 40 50 60 pF1KB8 MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQV : .::. ..: :. : ::.::.:::::. ::::.::..:.:.::: .. :: : . . CCDS33 MLTRKIKLWDINAHITCRLCSGYLIDATTVTECLHTFCRSCLVKYLEENNTCPTCRIVI 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 HKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPSADAANGSNEDRGEVADE :...:: : :.:.::::::::::: . ::...:.:: : . . : . .::. . CCDS33 HQSHPLQYIGHDRTMQDIVYKLVPGLQEAEMRKQREFY--HKLGMEVPG--DIKGETCSA 60 70 80 90 100 110 130 140 150 160 pF1KB8 DKRIIT--------DDEIISLSIEFF-----DQNRLDRKVNKDKE--KSKEEVNDKRYLR ... . :: . . : : .: :..:. : .:: . ....: CCDS33 KQHLDSHRNGETKADDSSNKEAAEEKPEEDNDYHRSDEQVSICLECNSSKLRGLKRKWIR 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB8 CPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWR-RNGPLPLK : : ::.::.::. .:... . ..:.. .:: : .:: .. . :: ...:: :. CCDS33 CSAQATVLHLKKFIAKKLNLSSFNELDILCNEEILGKDHTL-KFVVVTRWRFKKAPLLLH 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB8 YRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGIPSTSSCLPSPSTPVQSPHP :: CCDS33 YRPKMDLL 240 326 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 22:02:56 2016 done: Fri Nov 4 22:02:56 2016 Total Scan time: 2.530 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]