FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0116, 291 aa 1>>>pF1KA0116 291 - 291 aa - 291 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5859+/-0.000825; mu= 12.8372+/- 0.049 mean_var=53.0006+/-10.558, 0's: 0 Z-trim(104.6): 22 B-trim: 38 in 1/51 Lambda= 0.176171 statistics sampled from 7989 (7996) to 7989 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.623), E-opt: 0.2 (0.246), width: 16 Scan time: 2.250 The best scores are: opt bits E(32554) CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3 ( 291) 1904 491.8 2.4e-139 CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13 ( 276) 374 103.0 2.6e-22 CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4 ( 439) 375 103.2 3.5e-22 CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4 ( 456) 375 103.2 3.6e-22 >>CCDS2725.1 EXOSC7 gene_id:23016|Hs108|chr3 (291 aa) initn: 1904 init1: 1904 opt: 1904 Z-score: 2615.1 bits: 491.8 E(32554): 2.4e-139 Smith-Waterman score: 1904; 99.7% identity (100.0% similar) in 291 aa overlap (1-291:1-291) 10 20 30 40 50 60 pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 ELSDDPYDCIRLSVENVPCIVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 ELSDDPYDCIRLSVENVPCIVTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVTCMR 190 200 210 220 230 240 250 260 270 280 290 pF1KA0 KVGKGSLDPESIFEMMETGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG :::::::::::::::::::::::::::::::::.::::::::::::::::: CCDS27 KVGKGSLDPESIFEMMETGKRVGKVLHASLQSVVHKEESLGPKRQKVGFLG 250 260 270 280 290 >>CCDS31958.1 EXOSC8 gene_id:11340|Hs108|chr13 (276 aa) initn: 391 init1: 316 opt: 374 Z-score: 513.9 bits: 103.0 E(32554): 2.6e-22 Smith-Waterman score: 374; 28.9% identity (62.8% similar) in 266 aa overlap (18-276:18-276) 10 20 30 40 50 60 pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV ..:. : ::: ..: . :. .:...::: ::::.: .. CCDS31 MAAGFKTVEPLEYYRRFLKENCRPDGRELGEFRTTTVNIGSISTADGSALVKLGNTTVIC 10 20 30 40 50 60 70 80 90 100 110 pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFE-GRGGDDLGTEIANTLYR-IFNNKSS :::::...:. . :..::. :: . .:. : :.. ...:. . ...:.. CCDS31 GVKAEFAAPSTDAPDKGYVVPNVDLPPLCSSRFRSGPPGEE--AQVASQFIADVIENSQI 70 80 90 100 110 120 130 140 150 160 170 pF1KA0 VDLKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSK .. . ::::: . :::: :.. :. ::..:: ..:. ::: :...:.: . : : . CCDS31 IQKEDLCISPGKLVWVLYCDLICLDYDGNILDACTFALLAALKNVQLPEVTINE-ETALA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KA0 DIELSDDPYDCIRLSVENVPCIVTLCKIGYRH-VVDATLQEEACSLASLLVSVTSKGVVT ...:. : :.... : ... . .:: : .:: . ..: . . .: . CCDS31 EVNLKKKSY----LNIRTHPVATSFAVFDDTLLIVDPTGEEEHLATGTLTIVMDEEGKLC 180 190 200 210 220 230 240 250 260 270 280 290 pF1KA0 CMRKVGKGSLDPESIFEMMETG----KRVGKVLHASLQSVLHKEESLGPKRQKVGFLG :..: : ..: .. . : . :.: :.. ..:. : CCDS31 CLHKPGGSGLTGAKLQDCMSRAVTRHKEVKKLMDEVIKSMKPK 240 250 260 270 >>CCDS3722.2 EXOSC9 gene_id:5393|Hs108|chr4 (439 aa) initn: 355 init1: 262 opt: 375 Z-score: 511.7 bits: 103.2 E(32554): 3.5e-22 Smith-Waterman score: 375; 28.9% identity (60.2% similar) in 294 aa overlap (1-289:1-284) 10 20 30 40 50 60 pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV : . ::. :. .......: :.::: ::: ... .. : :.::.: .: CCDS37 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRIS---FGTDYGCCIVELGKTRVLG 10 20 30 40 50 70 80 90 100 110 120 pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD :. :. .:::.. .:: : : .. : :.: :: .:: ... . : . :.. .: CCDS37 QVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNSKCID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI ..::. :. : . ::. ::. ::..:: :::. .:: . : : : : :: . CCDS37 TESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDE-----V 120 130 140 150 160 170 190 200 210 220 230 pF1KA0 EL-SDDPYDCIRLSVENVP-CI-VTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVT : . . : . ::....: :. .. . : .:: . .:: . .::: . .: CCDS37 TLYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERV-MDGLLVIAMNKHREI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KA0 C-MRKVGKGSLDPESIFEMME-TGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG : ... : : ..... . .: .:... . :.. :...... . : :: CCDS37 CTIQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKA-LENDQKVRKEGGKFGFAESIAN 240 250 260 270 280 290 CCDS37 QRITAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDL 300 310 320 330 340 350 >>CCDS34057.1 EXOSC9 gene_id:5393|Hs108|chr4 (456 aa) initn: 355 init1: 262 opt: 375 Z-score: 511.4 bits: 103.2 E(32554): 3.6e-22 Smith-Waterman score: 375; 28.9% identity (60.2% similar) in 294 aa overlap (1-289:1-284) 10 20 30 40 50 60 pF1KA0 MASVTLSEAEKVYIVHGVQEDLRVDGRGCEDYRCVEVETDVVSNTSGSARVKLGHTDILV : . ::. :. .......: :.::: ::: ... .. : :.::.: .: CCDS34 MKETPLSNCERRFLLRAIEEKKRLDGRQTYDYRNIRIS---FGTDYGCCIVELGKTRVLG 10 20 30 40 50 70 80 90 100 110 120 pF1KA0 GVKAEMGTPKLEKPNEGYLEFFVDCSASATPEFEGRGGDDLGTEIANTLYRIFNNKSSVD :. :. .:::.. .:: : : .. : :.: :: .:: ... . : . :.. .: CCDS34 QVSCELVSPKLNRATEGILFFNLELSQMAAPAFEPGRQSDLLVKLNRLMERCLRNSKCID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KA0 LKTLCISPREHCWVLYVDVLLLECGGNLFDAISIAVKAALFNTRIPRVRVLEDEEGSKDI ..::. :. : . ::. ::. ::..:: :::. .:: . : : : : :: . CCDS34 TESLCVVAGEKVWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDE-----V 120 130 140 150 160 170 190 200 210 220 230 pF1KA0 EL-SDDPYDCIRLSVENVP-CI-VTLCKIGYRHVVDATLQEEACSLASLLVSVTSKGVVT : . . : . ::....: :. .. . : .:: . .:: . .::: . .: CCDS34 TLYTPEERDPVPLSIHHMPICVSFAFFQQGTYLLVDPNEREERV-MDGLLVIAMNKHREI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KA0 C-MRKVGKGSLDPESIFEMME-TGKRVGKVLHASLQSVLHKEESLGPKRQKVGFLG : ... : : ..... . .: .:... . :.. :...... . : :: CCDS34 CTIQSSGGIMLLKDQVLRCSKIAGVKVAEITELILKA-LENDQKVRKEGGKFGFAESIAN 240 250 260 270 280 290 CCDS34 QRITAFKMEKAPIDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDL 300 310 320 330 340 350 291 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 18:08:56 2016 done: Wed Nov 2 18:08:56 2016 Total Scan time: 2.250 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]