FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4038, 363 aa 1>>>pF1KB4038 363 - 363 aa - 363 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6462+/-0.000773; mu= 7.9881+/- 0.047 mean_var=158.0736+/-32.181, 0's: 0 Z-trim(114.8): 38 B-trim: 43 in 1/52 Lambda= 0.102010 statistics sampled from 15307 (15343) to 15307 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.471), width: 16 Scan time: 3.080 The best scores are: opt bits E(32554) CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 ( 363) 2533 383.9 1.2e-106 CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 ( 372) 981 155.5 6.8e-38 CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 ( 373) 981 155.5 6.8e-38 CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 ( 180) 607 100.2 1.4e-21 >>CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 (363 aa) initn: 2533 init1: 2533 opt: 2533 Z-score: 2028.3 bits: 383.9 E(32554): 1.2e-106 Smith-Waterman score: 2533; 100.0% identity (100.0% similar) in 363 aa overlap (1-363:1-363) 10 20 30 40 50 60 pF1KB4 MWATCCNWFCLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MWATCCNWFCLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 QTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDIST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDIST 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 EMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 SAQATSVPPAQVQENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SAQATSVPPAQVQENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRAS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 LSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 DQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHV 310 320 330 340 350 360 pF1KB4 FRS ::: CCDS11 FRS >>CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 (372 aa) initn: 939 init1: 432 opt: 981 Z-score: 793.8 bits: 155.5 E(32554): 6.8e-38 Smith-Waterman score: 981; 44.2% identity (69.1% similar) in 382 aa overlap (1-363:8-372) 10 20 30 40 pF1KB4 MWATCCNWF-------CLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLE-P :::.::. . . :: : .. :: .:..: :.. : : CCDS31 MKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGP-FRFTPNPEFSTYP-PAATEGP 10 20 30 40 50 50 60 70 80 90 100 pF1KB4 S--CKSCGAHFANTARKQTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELMK . ::.:: :. .:..: ::::.:: .:: . .. : : :. .. ::::: .::. CCDS31 NIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSV-LQENLRRCSTCHLLQETAFQRPQLMR 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB4 MKVKDLRDYLSLHDISTEMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFLT .::::::.:: :..: . :::::.:: ::: .. . :..: .:.:. . . ..:.: CCDS31 LKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMD-TSSLNSSRSQTSSFFT 120 130 140 150 160 170 170 180 190 200 210 pF1KB4 QPHSSMVPPTSPNLPSSSAQ---------ATSVPPAQVQENQQANGHVSQDQEEPVYLES . : :.:. :: : . : ::::: .. ..... .:... .. CCDS31 RSFFSNY--TAPSATMSSFQGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDDD 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB4 VARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCE . :::.. ....: :::::::..:.:.::..:::::::::::::::.:::: CCDS31 DEEENAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCE 240 250 260 270 280 280 290 300 310 320 330 pF1KB4 KWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGHM ::::.:.:.::::... :. : . : . . ...::.::::. ::::::::::: CCDS31 KWELVEKVNRLYKENEENQKSY-GERLQ----LQDEEDDSLCRICMDAVIDCVLLECGHM 290 300 310 320 330 340 340 350 360 pF1KB4 VTCTKCGKRMNECPICRQYVIRAVHVFRS ::::::::::.:::::::::.::::::.: CCDS31 VTCTKCGKRMSECPICRQYVVRAVHVFKS 350 360 370 >>CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 (373 aa) initn: 939 init1: 432 opt: 981 Z-score: 793.8 bits: 155.5 E(32554): 6.8e-38 Smith-Waterman score: 981; 44.2% identity (69.1% similar) in 382 aa overlap (1-363:9-373) 10 20 30 40 pF1KB4 MWATCCNWF-------CLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLE- :::.::. . . :: : .. :: .:..: :.. : CCDS92 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGP-FRFTPNPEFSTYP-PAATEG 10 20 30 40 50 50 60 70 80 90 100 pF1KB4 PS--CKSCGAHFANTARKQTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELM :. ::.:: :. .:..: ::::.:: .:: . .. : : :. .. ::::: .:: CCDS92 PNIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSV-LQENLRRCSTCHLLQETAFQRPQLM 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB4 KMKVKDLRDYLSLHDISTEMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFL ..::::::.:: :..: . :::::.:: ::: .. . :..: .:.:. . . ..:. CCDS92 RLKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMD-TSSLNSSRSQTSSFF 120 130 140 150 160 170 170 180 190 200 210 pF1KB4 TQPHSSMVPPTSPNLPSSSAQ---------ATSVPPAQVQENQQANGHVSQDQEEPVYLE :. : :.:. :: : . : ::::: .. ..... .:... . CCDS92 TRSFFSNY--TAPSATMSSFQGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDD 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB4 SVARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCC . . :::.. ....: :::::::..:.:.::..:::::::::::::::.::: CCDS92 DDEEENAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCC 240 250 260 270 280 280 290 300 310 320 330 pF1KB4 EKWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGH :::::.:.:.::::... :. : . : . . ...::.::::. :::::::::: CCDS92 EKWELVEKVNRLYKENEENQKSY-GERLQ----LQDEEDDSLCRICMDAVIDCVLLECGH 290 300 310 320 330 340 340 350 360 pF1KB4 MVTCTKCGKRMNECPICRQYVIRAVHVFRS :::::::::::.:::::::::.::::::.: CCDS92 MVTCTKCGKRMSECPICRQYVVRAVHVFKS 350 360 370 >>CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 (180 aa) initn: 636 init1: 339 opt: 607 Z-score: 500.7 bits: 100.2 E(32554): 1.4e-21 Smith-Waterman score: 621; 54.9% identity (80.6% similar) in 175 aa overlap (189-363:17-180) 160 170 180 190 200 210 pF1KB4 QAFLTQPHSSMVPPTSPNLPSSSAQATSVPPAQVQENQQANGHVSQDQEEPVYLESVARV ::::: .. ..... .:... .. . CCDS73 MKGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDDDDEEE 10 20 30 40 220 230 240 250 260 270 pF1KB4 PAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEKWEL :::.. ....: :::::::..:.:.::..:::::::::::::::.:::::::: CCDS73 NAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWEL 50 60 70 80 90 100 280 290 300 310 320 330 pF1KB4 MERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMVTCT .:.:.::::... :. : . : . . ...::.::::. ::::::::::::::: CCDS73 VEKVNRLYKENEENQK-SYGERLQ----LQDEEDDSLCRICMDAVIDCVLLECGHMVTCT 110 120 130 140 150 340 350 360 pF1KB4 KCGKRMNECPICRQYVIRAVHVFRS ::::::.:::::::::.::::::.: CCDS73 KCGKRMSECPICRQYVVRAVHVFKS 160 170 180 363 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 05:33:55 2016 done: Sat Nov 5 05:33:55 2016 Total Scan time: 3.080 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]