FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9783, 251 aa 1>>>pF1KB9783 251 - 251 aa - 251 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1288+/-0.000671; mu= 8.1472+/- 0.041 mean_var=134.3932+/-27.130, 0's: 0 Z-trim(115.9): 25 B-trim: 374 in 1/50 Lambda= 0.110633 statistics sampled from 16411 (16435) to 16411 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.824), E-opt: 0.2 (0.505), width: 16 Scan time: 2.850 The best scores are: opt bits E(32554) CCDS13986.1 CBX7 gene_id:23492|Hs108|chr22 ( 251) 1719 284.5 4.5e-77 CCDS32758.1 CBX4 gene_id:8535|Hs108|chr17 ( 560) 447 81.8 1.1e-15 CCDS77675.1 CBX6 gene_id:23466|Hs108|chr22 ( 394) 420 77.3 1.7e-14 CCDS13980.1 CBX6 gene_id:23466|Hs108|chr22 ( 412) 395 73.4 2.8e-13 CCDS11765.1 CBX8 gene_id:57332|Hs108|chr17 ( 389) 380 71.0 1.4e-12 CCDS11764.1 CBX2 gene_id:84733|Hs108|chr17 ( 211) 345 65.2 4.1e-11 CCDS32757.1 CBX2 gene_id:84733|Hs108|chr17 ( 532) 348 65.9 6.1e-11 >>CCDS13986.1 CBX7 gene_id:23492|Hs108|chr22 (251 aa) initn: 1719 init1: 1719 opt: 1719 Z-score: 1497.1 bits: 284.5 E(32554): 4.5e-77 Smith-Waterman score: 1719; 100.0% identity (100.0% similar) in 251 aa overlap (1-251:1-251) 10 20 30 40 50 60 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 EERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 EERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVKA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 GAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 DVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDITANSITVTFREAQAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDITANSITVTFREAQAA 190 200 210 220 230 240 250 pF1KB9 EGFFRDRSGKF ::::::::::: CCDS13 EGFFRDRSGKF 250 >>CCDS32758.1 CBX4 gene_id:8535|Hs108|chr17 (560 aa) initn: 505 init1: 426 opt: 447 Z-score: 395.0 bits: 81.8 E(32554): 1.1e-15 Smith-Waterman score: 447; 39.0% identity (63.8% similar) in 213 aa overlap (1-207:1-206) 10 20 30 40 50 60 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK ::: :.::.:::::::.:::.:::.:::::::.:: :::.::::::.::::::..:.... CCDS32 MELPAVGEHVFAVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEPEENILDPRLLIAFQNR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 EERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVKA :.... :::::::::: :..: : :. : . . : :. .: .. CCDS32 ERQEQLMGYRKRGPKPKPLVVQ--VPTFARRSNVLTGLQDSSTDNRAKLDLGA-QGKGQG 70 80 90 100 110 130 140 150 160 170 pF1KB9 GAPELVDKG--PLVPTLPFPLRKP----RKAHKYLRLSRKKFPPRGPNLESHSHRRELFL :: .: : :: .... : .:. :: : :. . .. . . CCDS32 HQYELNSKKHHQYQPHSKERAGKPPPPGKSGKYYYQLNSKKHHPYQPDPKMYDLQYQGGH 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 QEPPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDITANSITVTF .: :.: . ... ..::.. :.. :.: CCDS32 KEAPSPTCPDLGAK----SHPPDKWAQGAGAKGYLGAVKPLAGAAGAPGKGSEKGPPNGM 180 190 200 210 220 230 240 250 pF1KB9 REAQAAEGFFRDRSGKF CCDS32 MPAPKEAVTGNGIGGKMKIVKNKNKNGRIVIVMSKYMENGMQAVKIKSGEVAEGEARSPS 240 250 260 270 280 290 >>CCDS77675.1 CBX6 gene_id:23466|Hs108|chr22 (394 aa) initn: 517 init1: 391 opt: 420 Z-score: 373.8 bits: 77.3 E(32554): 1.7e-14 Smith-Waterman score: 423; 46.4% identity (64.3% similar) in 168 aa overlap (1-161:1-151) 10 20 30 40 50 60 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: CCDS77 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 EERDRASGYRKRGPKPKRLLLQRLYSMD---LRSS---HKAKGKEKLCFSLTC-PLGSGS :.. . : .::::::: .::. : . :.:: :. : . : .. :: . CCDS77 ERERELYGPKKRGPKPKTFLLKPSASASSPKLHSSAAVHRLKKDIRRCHRMSRRPLPRPD 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB9 PEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELF :.: :.: : : : :: . .:. .: :: : CCDS77 PQG----GSPGLR---P--PISPFS--------ETVRIINRKVKPREPKRNRIILNLKVI 130 140 150 160 180 190 200 210 220 230 pF1KB9 LQEPPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDITANSITVT CCDS77 DKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKFGAFALYKPPPAP 170 180 190 200 210 220 >>CCDS13980.1 CBX6 gene_id:23466|Hs108|chr22 (412 aa) initn: 517 init1: 391 opt: 395 Z-score: 352.0 bits: 73.4 E(32554): 2.8e-13 Smith-Waterman score: 400; 43.8% identity (65.1% similar) in 169 aa overlap (1-165:1-156) 10 20 30 40 50 60 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: CCDS13 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 EERDRASGYRKRGPKPKRLLLQ-RLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVK :.. . : .::::::: .::. : . :: : . ::. ...::. . CCDS13 ERERELYGPKKRGPKPKTFLLKARAQAEALRISD-------VHFSVKPSASASSPKLHSS 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 AGAPEL---VDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQE :.. .: . . . :.: :. . :: :: .: :. CCDS13 AAVHRLKKDIRRCHRMSRRPLPRPDPQGGSPGLR------PPISPFSETVRIINRKVKPR 120 130 140 150 160 180 190 200 210 220 230 pF1KB9 PPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDITANSITVTFRE CCDS13 EPKRNRIILNLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHM 170 180 190 200 210 220 >>CCDS11765.1 CBX8 gene_id:57332|Hs108|chr17 (389 aa) initn: 457 init1: 380 opt: 380 Z-score: 339.4 bits: 71.0 E(32554): 1.4e-12 Smith-Waterman score: 380; 64.6% identity (87.8% similar) in 82 aa overlap (1-82:1-82) 10 20 30 40 50 60 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK :::::.::.:::.:.. :.:.:::..::::::::: :::::::::.::: ::. :.::. CCDS11 MELSAVGERVFAAEALLKRRIRKGRMEYLVKWKGWSQKYSTWEPEENILDARLLAAFEER 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 EERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVKA :.. . : .::::::: .::. CCDS11 EREMELYGPKKRGPKPKTFLLKAQAKAKAKTYEFRSDSARGIRIPYPGRSPQDLASTSRA 70 80 90 100 110 120 >>CCDS11764.1 CBX2 gene_id:84733|Hs108|chr17 (211 aa) initn: 395 init1: 305 opt: 345 Z-score: 313.0 bits: 65.2 E(32554): 4.1e-11 Smith-Waterman score: 345; 49.1% identity (76.7% similar) in 116 aa overlap (2-117:3-114) 10 20 30 40 50 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEE :::..::::::.: : .::.::::.::::::.:: :...:::::.::::::..:... CCDS11 MEELSSVGEQVFAAECILSKRLRKGKLEYLVKWRGWSSKHNSWEPEENILDPRLLLAFQK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 KEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVK ::.. .... :::: .: : ..: .:. :...: : : . . : : : :: CCDS11 KEHEKEVQN-RKRGKRP-RGRPRKLTAMS-SCSRRSKLKVGGCAGYADPT-SQHPLGVGG 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 AGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPA CCDS11 RQREGLGPSGRGWHFCQQSVPLLGKQEPPFFLSLSFCCQGPQPAESSSPPLPGASCFSLS 120 130 140 150 160 170 >>CCDS32757.1 CBX2 gene_id:84733|Hs108|chr17 (532 aa) initn: 426 init1: 305 opt: 348 Z-score: 309.9 bits: 65.9 E(32554): 6.1e-11 Smith-Waterman score: 349; 36.8% identity (60.5% similar) in 223 aa overlap (2-205:3-212) 10 20 30 40 50 pF1KB9 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEE :::..::::::.: : .::.::::.::::::.:: :...:::::.::::::..:... CCDS32 MEELSSVGEQVFAAECILSKRLRKGKLEYLVKWRGWSSKHNSWEPEENILDPRLLLAFQK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 KEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVK ::.. .... :::: .: : ..: .:. :...: :: : . .:.: . CCDS32 KEHEKEVQN-RKRGKRP-RGRPRKLTAMS-SCSRRSKLKEPDAPSKSKSSSSSSSSTSSS 70 80 90 100 110 120 130 140 150 160 pF1KB9 AGAPELVD------KGPL-VPTLPFPLRKPR--------KAHKYLRLSRKKFPPRGPNLE ... : : .:: : : : .: . : . .:: .:: : CCDS32 SSSDEEDDSDLDAKRGPRGRETHPVPQKKAQILVAKPELKDPIRKKRGRKPLPP-----E 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB9 SHSHRRELFLQEPPAPDVLQAAGE--WEPAAQ--PPEEEADADLAEGPPPWTPALPSSEV ... :: . : . ::..: . ::.. :: : :: CCDS32 QKATRRPVSLAK-----VLKTARKDLGAPASKLPPPLSAPVAGLAALKAHAKEACGGPSA 180 190 200 210 220 230 240 250 pF1KB9 TVTDITANSITVTFREAQAAEGFFRDRSGKF CCDS32 MATPENLASLMKGMASSPGRGGISWQSSIVHYMNRMTQSQAQAASRLALKAQATNKCGLG 230 240 250 260 270 280 251 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 20:05:39 2016 done: Fri Nov 4 20:05:39 2016 Total Scan time: 2.850 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]