FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7033, 424 aa 1>>>pF1KB7033 424 - 424 aa - 424 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0808+/-0.00078; mu= 13.8669+/- 0.047 mean_var=81.1640+/-16.524, 0's: 0 Z-trim(109.2): 9 B-trim: 192 in 1/49 Lambda= 0.142361 statistics sampled from 10692 (10698) to 10692 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.686), E-opt: 0.2 (0.329), width: 16 Scan time: 3.300 The best scores are: opt bits E(32554) CCDS294.2 FAM46B gene_id:115572|Hs108|chr1 ( 425) 2811 586.8 1.3e-167 CCDS34489.1 FAM46A gene_id:55603|Hs108|chr6 ( 442) 1654 349.2 4.6e-96 CCDS896.1 FAM46C gene_id:54855|Hs108|chr1 ( 391) 1586 335.2 6.6e-92 CCDS14446.1 FAM46D gene_id:169966|Hs108|chrX ( 389) 1281 272.6 4.8e-73 >>CCDS294.2 FAM46B gene_id:115572|Hs108|chr1 (425 aa) initn: 2811 init1: 2811 opt: 2811 Z-score: 3122.6 bits: 586.8 E(32554): 1.3e-167 Smith-Waterman score: 2811; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:2-425) 10 20 30 40 50 pF1KB7 MPSESGAERRDRAAAQVGTAAATAVATAAPAGGGPDPEALSAFPGRHLSGLSWPQVKRL ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 MMPSESGAERRDRAAAQVGTAAATAVATAAPAGGGPDPEALSAFPGRHLSGLSWPQVKRL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 DALLSEPIPIHGRGNFPTLSVQPRQIVQVVRSTLEEQGLHVHSVRLHGSAASHVLHPESG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 DALLSEPIPIHGRGNFPTLSVQPRQIVQVVRSTLEEQGLHVHSVRLHGSAASHVLHPESG 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 LGYKDLDLVFRVDLRSEASFQLTKAVVLACLLDFLPAGVSRAKITPLTLKEAYVQKLVKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 LGYKDLDLVFRVDLRSEASFQLTKAVVLACLLDFLPAGVSRAKITPLTLKEAYVQKLVKV 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 CTDSDRWSLISLSNKSGKNVELKFVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 CTDSDRWSLISLSNKSGKNVELKFVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEA 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB7 FHPTVTGESLYGDFTEALEHLRHRVIATRSPEEIRGGGLLKYCHLLVRGFRPRPSTDVRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 FHPTVTGESLYGDFTEALEHLRHRVIATRSPEEIRGGGLLKYCHLLVRGFRPRPSTDVRA 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB7 LQRYMCSRFFIDFPDLVEQRRTLERYLEAHFGGADAARRYACLVTLHRVVNESTVCLMNH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 LQRYMCSRFFIDFPDLVEQRRTLERYLEAHFGGADAARRYACLVTLHRVVNESTVCLMNH 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB7 ERRQTLDLIAALALQALAEQGPAATAALAWRPPGTDGVVPATVNYYVTPVQPLLAHAYPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 ERRQTLDLIAALALQALAEQGPAATAALAWRPPGTDGVVPATVNYYVTPVQPLLAHAYPT 370 380 390 400 410 420 420 pF1KB7 WLPCN ::::: CCDS29 WLPCN >>CCDS34489.1 FAM46A gene_id:55603|Hs108|chr6 (442 aa) initn: 1599 init1: 1219 opt: 1654 Z-score: 1838.1 bits: 349.2 E(32554): 4.6e-96 Smith-Waterman score: 1654; 65.3% identity (83.2% similar) in 386 aa overlap (44-424:59-442) 20 30 40 50 60 70 pF1KB7 AAQVGTAAATAVATAAPAGGGPDPEALSAFPGRHLSGLSWPQVKRLDALLSEPIPIHGRG : : . :.: ::.:::..::: ::::::: CCDS34 GGDFGGGDFGGGDFGGGGSFGGHCLDYCESPTAHCNVLNWEQVQRLDGILSETIPIHGRG 30 40 50 60 70 80 80 90 100 110 120 130 pF1KB7 NFPTLSVQPRQIVQVVRSTLEEQGLHVHSVRLHGSAASHVLHPESGLGYKDLDLVFRVDL ::::: .:: ::.::: : :. . :..:::.::::::::: .::::::::::.: .:: CCDS34 NFPTLELQPSLIVKVVRRRLAEKRIGVRDVRLNGSAASHVLHQDSGLGYKDLDLIFCADL 90 100 110 120 130 140 140 150 160 170 180 190 pF1KB7 RSEASFQLTKAVVLACLLDFLPAGVSRAKITPLTLKEAYVQKLVKVCTDSDRWSLISLSN :.:. :: .: ::: ::::::: ::.. ::::::::::::::.::::.:::::::::::: CCDS34 RGEGEFQTVKDVVLDCLLDFLPEGVNKEKITPLTLKEAYVQKMVKVCNDSDRWSLISLSN 150 160 170 180 190 200 200 210 220 230 240 250 pF1KB7 KSGKNVELKFVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEAFHPTVTGESLYGDF .::::::::::::.:::::::.::::: ::::::: .:: .::.:.::::. :::.:::: CCDS34 NSGKNVELKFVDSLRRQFEFSVDSFQIKLDSLLLFYECSENPMTETFHPTIIGESVYGDF 210 220 230 240 250 260 260 270 280 290 300 310 pF1KB7 TEALEHLRHRVIATRSPEEIRGGGLLKYCHLLVRGFRPRPSTDVRALQRYMCSRFFIDFP ::..:: ...::::.:::::::::::::.:::::::: : ....::::::::::::: CCDS34 QEAFDHLCNKIIATRNPEEIRGGGLLKYCNLLVRGFRP-ASDEIKTLQRYMCSRFFIDFS 270 280 290 300 310 320 320 330 340 350 360 370 pF1KB7 DLVEQRRTLERYLEAHFGGADAARRYACLVTLHRVVNESTVCLMNHERRQTLDLIAALAL :. ::.: :: ::. :: : . :.: :.::: ::::::::::.:::::::.::. ::. CCDS34 DIGEQQRKLESYLQNHFVGLED-RKYEYLMTLHGVVNESTVCLMGHERRQTLNLITMLAI 330 340 350 360 370 380 380 390 400 410 420 pF1KB7 QALAEQG--PAATAALAWRPPGTDGVVPATVNYYVTPVQPLLA---HAYPTWLPCN ..::.:. : .. . . :. . :::.. :::... ..: :::::: CCDS34 RVLADQNVIPNVANVTCYYQPAPYVADANFSNYYIAQVQPVFTCQQQTYSTWLPCN 390 400 410 420 430 440 >>CCDS896.1 FAM46C gene_id:54855|Hs108|chr1 (391 aa) initn: 1494 init1: 1331 opt: 1586 Z-score: 1763.4 bits: 335.2 E(32554): 6.6e-92 Smith-Waterman score: 1586; 64.0% identity (83.0% similar) in 383 aa overlap (48-424:14-391) 20 30 40 50 60 70 pF1KB7 GTAAATAVATAAPAGGGPDPEALSAFPGRHLSGLSWPQVKRLDALLSEPIPIHGRGNFPT .: :.: ::.:: .:.: .:::::::::: CCDS89 MAEESSCTRDCMSFSVLNWDQVSRLHEVLTEVVPIHGRGNFPT 10 20 30 40 80 90 100 110 120 130 pF1KB7 LSVQPRQIVQVVRSTLEEQGLHVHSVRLHGSAASHVLHPESGLGYKDLDLVFRVDLRSEA : . ..:::.::: ::: :..::.:::.::::.::: ..::: :::::.:.: : .:: CCDS89 LEITLKDIVQTVRSRLEEAGIKVHDVRLNGSAAGHVLVKDNGLGCKDLDLIFHVALPTEA 50 60 70 80 90 100 140 150 160 170 180 190 pF1KB7 SFQLTKAVVLACLLDFLPAGVSRAKITPLTLKEAYVQKLVKVCTDSDRWSLISLSNKSGK :::.. ::: ::.::: ::.. ::.:.::::::::::::::::.:::::::::::.:: CCDS89 EFQLVRDVVLCSLLNFLPEGVNKLKISPVTLKEAYVQKLVKVCTDTDRWSLISLSNKNGK 110 120 130 140 150 160 200 210 220 230 240 250 pF1KB7 NVELKFVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEAFHPTVTGESLYGDFTEAL :::::::::.:::::::.:::::::::::.: .::..:.:: ::::: :::.:::: ::. CCDS89 NVELKFVDSIRRQFEFSVDSFQIILDSLLFFYDCSNNPISEHFHPTVIGESMYGDFEEAF 170 180 190 200 210 220 260 270 280 290 300 310 pF1KB7 EHLRHRVIATRSPEEIRGGGLLKYCHLLVRGFRPRPSTDVRALQRYMCSRFFIDFPDLVE .::..:.:::..:::::::::::: .:::: ::: . ....:.:::::::::::::..: CCDS89 DHLQNRLIATKNPEEIRGGGLLKYSNLLVRDFRPTDQEEIKTLERYMCSRFFIDFPDILE 230 240 250 260 270 280 320 330 340 350 360 370 pF1KB7 QRRTLERYLEAHFGGADAAR-RYACLVTLHRVVNESTVCLMNHERRQTLDLIAALALQAL :.: :: ::. :: :. : .: :. :.:::::::::::.:::::::.::. :::..: CCDS89 QQRKLETYLQNHF--AEEERSKYDYLMILRRVVNESTVCLMGHERRQTLNLISLLALRVL 290 300 310 320 330 340 380 390 400 410 420 pF1KB7 AEQG--PAATAALAWRPPG---TDGVVPATVNYYVTPVQPLLAHAYPTWLPCN :::. :.:: . . :. .:: ::::. .. :::::::: CCDS89 AEQNIIPSATNVTCYYQPAPYVSDG---NFSNYYVAHPPVTYSQPYPTWLPCN 350 360 370 380 390 >>CCDS14446.1 FAM46D gene_id:169966|Hs108|chrX (389 aa) initn: 1226 init1: 1092 opt: 1281 Z-score: 1424.9 bits: 272.6 E(32554): 4.8e-73 Smith-Waterman score: 1281; 52.5% identity (79.0% similar) in 366 aa overlap (48-411:6-370) 20 30 40 50 60 70 pF1KB7 GTAAATAVATAAPAGGGPDPEALSAFPGRHLSGLSWPQVKRLDALLSEPIPIHGRGNFPT ...:.: :: :: .:.: :::::.::::: CCDS14 MSEIRFTNLTWDQVITLDQVLDEVIPIHGKGNFPT 10 20 30 80 90 100 110 120 130 pF1KB7 LSVQPRQIVQVVRSTLEEQGLHVHSVRLHGSAASHVLHPESGLGYKDLDLVFRVDLRSEA . :.:..:..::.. : ::. :...::.::.::..: ..:..:::::..: :.: .. CCDS14 MEVKPKDIIHVVKDQLIGQGIIVKDARLNGSVASYILASHNGISYKDLDVIFGVELPGNE 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB7 SFQLTKAVVLACLLDFLPAGVSRAKITPLTLKEAYVQKLVKVCTDSDRWSLISLSNKSGK ::..: .:: ::::::: :.. :..: .:.::::::::::. : ::::::::..:: CCDS14 EFQVVKDAVLDCLLDFLPKDVKKEKLSPDIMKDAYVQKLVKVCNGHDCWSLISLSNNTGK 100 110 120 130 140 150 200 210 220 230 240 250 pF1KB7 NVELKFVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEAFHPTVTGESLYGDFTEAL :.:::::.:.:::::::.:::::.:: .: : . ... ... .:.:..::.:::: ::. CCDS14 NLELKFVSSLRRQFEFSVDSFQIVLDPMLDFYSDKNAKLTKESYPVVVAESMYGDFQEAM 160 170 180 190 200 210 260 270 280 290 300 310 pF1KB7 EHLRHRVIATRSPEEIRGGGLLKYCHLLVRGFRPRPSTDVRALQRYMCSRFFIDFPDLVE ::.:..: ::.::::::::::::: :::.::.: .... :.:::::::::::: . : CCDS14 THLQHKLICTRKPEEIRGGGLLKYCSLLVHGFKPACMSEIKNLERYMCSRFFIDFPHIEE 220 230 240 250 260 270 320 330 340 350 360 370 pF1KB7 QRRTLERYLEAHFGGADAARRYACLVTLHRVVNESTVCLMNHERRQTLDLIAALALQALA :.. .: ::. :: : .. .: :.::: ::::::::::..:::: : ::. .::..:. CCDS14 QQKKIESYLHNHFIG-EGMTKYDYLMTLHGVVNESTVCLMSYERRQILHLITMMALKVLG 280 290 300 310 320 330 380 390 400 410 420 pF1KB7 EQG--PAATAALAWRPPGTDGVVPATVNYYVTPVQPLLAHAYPTWLPCN : . : . . . :. .. : :: : : CCDS14 ELNILPNTQKVTCFYQPAPYFAAEARYPIYVIPEPPPVSFQPYHPLHFRGSNGMS 340 350 360 370 380 424 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 04:08:22 2016 done: Fri Nov 4 04:08:23 2016 Total Scan time: 3.300 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]