FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4735, 384 aa 1>>>pF1KB4735 384 - 384 aa - 384 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1952+/-0.000847; mu= 16.7417+/- 0.051 mean_var=63.9875+/-12.721, 0's: 0 Z-trim(106.6): 23 B-trim: 43 in 1/49 Lambda= 0.160334 statistics sampled from 9047 (9069) to 9047 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.66), E-opt: 0.2 (0.279), width: 16 Scan time: 1.940 The best scores are: opt bits E(32554) CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 ( 384) 2801 656.7 9.7e-189 CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 382) 1471 349.1 3.9e-96 CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 373) 1258 299.8 2.6e-81 CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 ( 199) 820 198.3 4.9e-51 CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 ( 357) 524 130.0 3.2e-30 CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 ( 190) 513 127.3 1.1e-29 CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 ( 386) 511 127.0 2.8e-29 CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 ( 198) 402 101.7 6.2e-22 CCDS33649.1 APOBEC3F gene_id:200316|Hs108|chr22 ( 101) 397 100.3 7.9e-22 CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 183) 349 89.4 2.9e-18 CCDS54531.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 182) 344 88.2 6.3e-18 CCDS54530.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 200) 344 88.2 6.9e-18 CCDS81662.1 AICDA gene_id:57379|Hs108|chr12 ( 188) 330 85.0 6.1e-17 CCDS8579.1 APOBEC1 gene_id:339|Hs108|chr12 ( 236) 307 79.7 3e-15 CCDS4848.1 APOBEC2 gene_id:10930|Hs108|chr6 ( 224) 294 76.7 2.3e-14 CCDS54532.1 APOBEC3H gene_id:164668|Hs108|chr22 ( 154) 250 66.4 1.9e-11 >>CCDS13984.1 APOBEC3G gene_id:60489|Hs108|chr22 (384 aa) initn: 2801 init1: 2801 opt: 2801 Z-score: 3502.0 bits: 656.7 E(32554): 9.7e-189 Smith-Waterman score: 2801; 100.0% identity (100.0% similar) in 384 aa overlap (1-384:1-384) 10 20 30 40 50 60 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSRPPLDAKIFRGQVYS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSRPPLDAKIFRGQVYS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 ELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIFV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 ARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLPK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 YYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRRG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 FLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSPCFSCAQEMAKFIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSPCFSCAQEMAKFIS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 KNKHVSLCIFTARIYDDQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDTFVDHQGCPFQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KNKHVSLCIFTARIYDDQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDTFVDHQGCPFQP 310 320 330 340 350 360 370 380 pF1KB4 WDGLDEHSQDLSGRLRAILQNQEN :::::::::::::::::::::::: CCDS13 WDGLDEHSQDLSGRLRAILQNQEN 370 380 >>CCDS13982.1 APOBEC3B gene_id:9582|Hs108|chr22 (382 aa) initn: 1336 init1: 572 opt: 1471 Z-score: 1839.4 bits: 349.1 E(32554): 3.9e-96 Smith-Waterman score: 1471; 56.6% identity (75.2% similar) in 387 aa overlap (1-384:1-382) 10 20 30 40 50 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTK-GPSRPPLDAKIFRGQVY :.:..:: .:::::::: :: :.::: :. .::::::: : : : :. .:::::: CCDS13 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGQVY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB4 SELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIF . .:: :: :. :: .: . ...::..::.:: :. .: ::.: :.::::: CCDS13 FKPQYHAEMCFLSWFCG-NQLPAYKCFQITWFVSWTPCPDCVAKLAEFLSEHPNVTLTIS 70 80 90 100 110 120 130 140 150 160 170 pF1KB4 VARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLP .:::::.:. ::..:: : : : :.: ::.:.:: .:: .:::.. . : :: .. CCDS13 AARLYYYWERDYRRALCRLSQA--GARVT--IMDYEEFAYCWENFVYNEGQQFMPWYKFD 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB4 KYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRR . : .:: : ::::. ::: ::::::::.: : :..:::::::::. : ::::..:. CCDS13 ENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVERLDNGTWVLMDQHM 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB4 GFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSPCFS--CAQEMAK :::::.: . . :::::: :::..: .:: : :::: : ::::::: :: :. CCDS13 GFLCNEAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFISWSPCFSWGCAGEVRA 240 250 260 270 280 290 300 310 320 330 340 350 pF1KB4 FISKNKHVSLCIFTARIYDDQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDTFVDHQGCP :...: :: : ::.::::: . .:.:. : .:::..:::::.::..:::::: .:::: CCDS13 FLQENTHVRLRIFAARIYDYDPLYKEALQMLRDAGAQVSIMTYDEFEYCWDTFVYRQGCP 300 310 320 330 340 350 360 370 380 pF1KB4 FQPWDGLDEHSQDLSGRLRAILQNQEN :::::::.:::: :::::::::::: : CCDS13 FQPWDGLEEHSQALSGRLRAILQNQGN 360 370 380 >>CCDS33648.1 APOBEC3F gene_id:200316|Hs108|chr22 (373 aa) initn: 1306 init1: 460 opt: 1258 Z-score: 1573.2 bits: 299.8 E(32554): 2.6e-81 Smith-Waterman score: 1258; 50.8% identity (72.8% similar) in 386 aa overlap (1-380:1-373) 10 20 30 40 50 60 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSRPPLDAKIFRGQVYS ::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::: CCDS33 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSRPRLDAKIFRGQVYS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 ELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIFV . ..: :: :. :: .: . ...::..::.:: :. .: :::: :.::::: . CCDS33 QPEHHAEMCFLSWFCG-NQLPAYKCFQITWFVSWTPCPDCVAKLAEFLAEHPNVTLTISA 70 80 90 100 110 130 140 150 160 170 180 pF1KB4 ARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLPK :::::.:. ::..:: : : : : .:::. .:: .:: .::::. . : :: .. CCDS33 ARLYYYWERDYRRALCRLSQA--GAR--VKIMDDEEFAYCWENFVYSEGQPFMPWYKFDD 120 130 140 150 160 170 190 200 210 220 230 pF1KB4 YYILLHIMLGEILRHSMD---PPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQ : .:: : ::::. :. : : :.:.: . ::.:..::. .: ... . : . CCDS33 NYAFLHRTLKEILRNPMEAMYPHIFYFHFKNLRKAYGRNESWLCFTMEVVKHHSPV--SW 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB4 RRGFLCNQA-PHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSPCFSCAQEMA .:: . ::. :. : ::: :::. . :. . .:.:: .:::::: :: :.: CCDS33 KRGVFRNQVDPETHC-----HAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPECAGEVA 240 250 260 270 280 300 310 320 330 340 350 pF1KB4 KFISKNKHVSLCIFTARIYD--DQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDTFVDHQ .:......:.: :::::.: : :::::.:.. ::.. :: :..::.::..:: .. CCDS33 EFLARHSNVNLTIFTARLYYFWDTDY-QEGLRSLSQEGASVEIMGYKDFKYCWENFVYND 290 300 310 320 330 340 360 370 380 pF1KB4 GCPFQPWDGLDEHSQDLSGRLRAILQNQEN ::.:: :: . :...:. ::. CCDS33 DEPFKPWKGLKYNFLFLDSKLQEILE 350 360 370 >>CCDS13981.1 APOBEC3A gene_id:200315|Hs108|chr22 (199 aa) initn: 828 init1: 471 opt: 820 Z-score: 1029.8 bits: 198.3 E(32554): 4.9e-51 Smith-Waterman score: 820; 65.3% identity (77.2% similar) in 193 aa overlap (194-384:10-199) 170 180 190 200 210 220 pF1KB4 FVYSQRELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYE :: ::: :: :::: :::.:::::: CCDS13 MEASPASGPRHLMDPHIFTSNFNNGI---GRHKTYLCYE 10 20 30 230 240 250 260 270 280 pF1KB4 VERMHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFT :::. : : : ..:.:::: ::: . . :::::: :::..: .:: : :::: : CCDS13 VERLDNGTSVKMDQHRGFLHNQAKNLLCGFYGRHAELRFLDLVPSLQLDPAQIYRVTWFI 40 50 60 70 80 90 290 300 310 320 330 340 pF1KB4 SWSPCFS--CAQEMAKFISKNKHVSLCIFTARIYDDQGRCQEGLRTLAEAGAKISIMTYS ::::::: :: :. :...: :: : ::.::::: . .:.:. : .:::..:::::. CCDS13 SWSPCFSWGCAGEVRAFLQENTHVRLRIFAARIYDYDPLYKEALQMLRDAGAQVSIMTYD 100 110 120 130 140 150 350 360 370 380 pF1KB4 EFKHCWDTFVDHQGCPFQPWDGLDEHSQDLSGRLRAILQNQEN :::::::::::::::::::::::::::: :::::::::::: : CCDS13 EFKHCWDTFVDHQGCPFQPWDGLDEHSQALSGRLRAILQNQGN 160 170 180 190 >>CCDS58807.1 APOBEC3B gene_id:9582|Hs108|chr22 (357 aa) initn: 1252 init1: 431 opt: 524 Z-score: 655.9 bits: 130.0 E(32554): 3.2e-30 Smith-Waterman score: 1350; 53.7% identity (71.1% similar) in 387 aa overlap (1-384:1-357) 10 20 30 40 50 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTK-GPSRPPLDAKIFRGQVY :.:..:: .:::::::: :: :.::: :. .::::::: : : : :. .:::::: CCDS58 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGQVY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB4 SELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIF . .:: :: :. :: .: . ...::..::.:: :. .: ::.: :.::::: CCDS58 FKPQYHAEMCFLSWFCG-NQLPAYKCFQITWFVSWTPCPDCVAKLAEFLSEHPNVTLTIS 70 80 90 100 110 120 130 140 150 160 170 pF1KB4 VARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLP .:::::.:. ::..:: : : : :.: ::.:.:: .:: .:::.. . : :: .. CCDS58 AARLYYYWERDYRRALCRLSQA--GARVT--IMDYEEFAYCWENFVYNEGQQFMPWYKFD 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB4 KYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRR . : .:: : ::::. ::: ::::::::.: : :..:::::::::. : ::::..:. CCDS58 ENYAFLHRTLKEILRYLMDPDTFTFNFNNDPLVLRRRQTYLCYEVERLDNGTWVLMDQHM 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB4 GFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSPCFS--CAQEMAK :::::. :: : :::: : ::::::: :: :. CCDS58 GFLCNE-------------------------LDPAQIYRVTWFISWSPCFSWGCAGEVRA 240 250 260 270 300 310 320 330 340 350 pF1KB4 FISKNKHVSLCIFTARIYDDQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDTFVDHQGCP :...: :: : ::.::::: . .:.:. : .:::..:::::.::..:::::: .:::: CCDS58 FLQENTHVRLRIFAARIYDYDPLYKEALQMLRDAGAQVSIMTYDEFEYCWDTFVYRQGCP 280 290 300 310 320 330 360 370 380 pF1KB4 FQPWDGLDEHSQDLSGRLRAILQNQEN :::::::.:::: :::::::::::: : CCDS58 FQPWDGLEEHSQALSGRLRAILQNQGN 340 350 >>CCDS13983.1 APOBEC3C gene_id:27350|Hs108|chr22 (190 aa) initn: 478 init1: 275 opt: 513 Z-score: 646.3 bits: 127.3 E(32554): 1.1e-29 Smith-Waterman score: 513; 43.6% identity (67.2% similar) in 195 aa overlap (1-194:1-190) 10 20 30 40 50 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVK-TKGPSRPPLDAKIFRGQVY :.:..:: .. :: :: ..: : . :: .:::. :. : : . .::.:: CCDS13 MNPQIRNPMKAMYPGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSVVSWKTGVFRNQVD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB4 SELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIF :: . : : :. :: : . .:.:::: ::::: :. ..: :::. .:.:::: CCDS13 SETHCHAERCFLSWFCD-DILSPNTKYQVTWYTSWSPCPDCAGEVAEFLARHSNVNLTIF 70 80 90 100 110 120 130 140 150 160 170 pF1KB4 VARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLP .:::::: : :::.:::: : .: ....::.:..:..:: .:::.. : :.::..: CCDS13 TARLYYFQYPCYQEGLRSLSQ--EG--VAVEIMDYEDFKYCWENFVYNDNEPFKPWKGLK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB4 KYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRR . ::. : : :. CCDS13 TNFRLLKRRLRESLQ 180 190 >>CCDS46709.1 APOBEC3D gene_id:140564|Hs108|chr22 (386 aa) initn: 845 init1: 296 opt: 511 Z-score: 639.2 bits: 127.0 E(32554): 2.8e-29 Smith-Waterman score: 531; 40.2% identity (65.9% similar) in 229 aa overlap (157-380:165-386) 130 140 150 160 170 180 pF1KB4 WDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYSQRELFEPWNNLPKYYILLH : .:: .:: .. . : :: .. : :: CCDS46 LYYYRDRDWRWVLLRLHKAGARVKIMDYEDFAYCWENFVCNEGQPFMPWYKFDDNYASLH 140 150 160 170 180 190 190 200 210 220 230 240 pF1KB4 IMLGEILRHSMD---PPTFTFNFNNEPWVRGRHETYLCYEVERMHNDTWVLLNQRRGFLC : ::::. :. : : :.:.: . ::.:..::. .: .. . :. ..:: . CCDS46 RTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTMEVTKHHSAVF--RKRGVFR 200 210 220 230 240 250 250 260 270 280 290 300 pF1KB4 NQA-PHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSPCFSCAQEMAKFISKN ::. :. : ::: :::. . :. . .:.:: .:::::: :: :.:.:.... CCDS46 NQVDPETHC-----HAERCFLSWFCDDILSPNTNYEVTWYTSWSPCPECAGEVAEFLARH 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 KHVSLCIFTARI-YDDQGRCQEGLRTLAEAGAKISIMTYSEFKHCWDTFVDHQGCPFQPW ..:.: :::::. : . :::: .:.. ::...:: :..: :: .:: . ::.:: CCDS46 SNVNLTIFTARLCYFWDTDYQEGLCSLSQEGASVKIMGYKDFVSCWKNFVYSDDEPFKPW 310 320 330 340 350 360 370 380 pF1KB4 DGLDEHSQDLSGRLRAILQNQEN ::. . . :. ::: ::: CCDS46 KGLQTNFRLLKRRLREILQ 370 380 >-- initn: 394 init1: 187 opt: 443 Z-score: 554.2 bits: 111.3 E(32554): 1.5e-24 Smith-Waterman score: 443; 44.4% identity (66.9% similar) in 169 aa overlap (1-156:1-164) 10 20 30 40 50 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTK-GPSRPPLDAKIFRG--- :.:..:: .:::::::: :: :.::: :. .::::::: : : : :. .::: CCDS46 MNPQIRNPMERMYRDTFYDNFENEPILYGRSYTWLCYEVKIKRGRSNLLWDTGVFRGPVL 10 20 30 40 50 60 60 70 80 90 100 pF1KB4 ---------QVYSELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATF .:: ... : :: :. :: : : ......::..::.:: :. .. : CCDS46 PKRQSNHRQEVYFRFENHAEMCFLSWFCGNR-LPANRRFQITWFVSWNPCLPCVVKVTKF 70 80 90 100 110 110 120 130 140 150 160 pF1KB4 LAEDPKVTLTIFVARLYYFWDPDYQEALRSLCQKRDGPRATMKIMNYDEFQHCWSKFVYS ::: :.::::: .:::::. : :.. .: : .. : : .:::.:.. CCDS46 LAEHPNVTLTISAARLYYYRDRDWRWVLLRL--HKAGAR--VKIMDYEDFAYCWENFVCN 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB4 QRELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVERM CCDS46 EGQPFMPWYKFDDNYASLHRTLKEILRNPMEAMYPHIFYFHFKNLLKACGRNESWLCFTM 180 190 200 210 220 230 >>CCDS41747.1 AICDA gene_id:57379|Hs108|chr12 (198 aa) initn: 459 init1: 207 opt: 402 Z-score: 507.2 bits: 101.7 E(32554): 6.2e-22 Smith-Waterman score: 491; 44.3% identity (68.6% similar) in 185 aa overlap (197-379:6-180) 170 180 190 200 210 220 pF1KB4 SQRELFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRHETYLCYEVER :. : ..:.: :..::.:::::: :.: CCDS41 MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKR 10 20 30 230 240 250 260 270 280 pF1KB4 MHNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWS :. . .. :.: :. .: :.:: :: : : :: . :::: ::::: CCDS41 --RDSATSFSLDFGYLRNK--------NGCHVELLFLRYISDWDLDPGRCYRVTWFTSWS 40 50 60 70 80 290 300 310 320 330 340 pF1KB4 PCFSCAQEMAKFISKNKHVSLCIFTARIY--DDQGRCQEGLRTLAEAGAKISIMTYSEFK ::..::...: :. : ..:: :::::.: .:. :::: : .::..:.:::.... CCDS41 PCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYF 90 100 110 120 130 140 350 360 370 380 pF1KB4 HCWDTFVDHQGCPFQPWDGLDEHSQDLSGRLRAILQNQEN .::.:::... :. :.:: :.: :: .:: :: CCDS41 YCWNTFVENHERTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL 150 160 170 180 190 >>CCDS33649.1 APOBEC3F gene_id:200316|Hs108|chr22 (101 aa) initn: 397 init1: 397 opt: 397 Z-score: 505.4 bits: 100.3 E(32554): 7.9e-22 Smith-Waterman score: 397; 98.3% identity (98.3% similar) in 58 aa overlap (1-58:1-58) 10 20 30 40 50 60 pF1KB4 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSRPPLDAKIFRGQVYS ::::::::::::::::::::::::::::::::::::::::::::::: :::::::::: CCDS33 MKPHFRNTVERMYRDTFSYNFYNRPILSRRNTVWLCYEVKTKGPSRPRLDAKIFRGQVPR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 ELKYHPEMRFFHWFSKWRKLHRDQEYEVTWYISWSPCTKCTRDMATFLAEDPKVTLTIFV CCDS33 SFIRAPFQVLSSPFGQCAPPHGTAQVQWPPQLTAGREQGRP 70 80 90 100 >>CCDS13985.1 APOBEC3H gene_id:164668|Hs108|chr22 (183 aa) initn: 352 init1: 219 opt: 349 Z-score: 441.5 bits: 89.4 E(32554): 2.9e-18 Smith-Waterman score: 363; 33.9% identity (60.8% similar) in 189 aa overlap (201-381:8-183) 180 190 200 210 220 pF1KB4 LFEPWNNLPKYYILLHIMLGEILRHSMDPPTFTFNFNNEPWVRGRH---ETYLCYEVERM :: ..:::. .: . .. :::.. . CCDS13 MALLTAETFRLQFNNKRRLRRPYYPRKALLCYQLTPQ 10 20 30 230 240 250 260 270 280 pF1KB4 HNDTWVLLNQRRGFLCNQAPHKHGFLEGRHAELCFLDVIPFWKLDLDQDYRVTCFTSWSP ...: ::.. :. :::.::.. : :: : :.:::. .::: CCDS13 NGST-----PTRGYFENKKKC--------HAEICFINEIKSMGLDETQCYQVTCYLTWSP 40 50 60 70 80 290 300 310 320 330 340 pF1KB4 CFSCAQEMAKFISKNKHVSLCIFTARIYDDQGRCQE-GLRTLAEAGAKISIMTYSEFKHC : ::: :.. ::. . :..: ::..:.: . :. ::: : . . . .: . :: : CCDS13 CSSCAWELVDFIKAHDHLNLGIFASRLYYHWCKPQQKGLRLLCGSQVPVEVMGFPEFADC 90 100 110 120 130 140 350 360 370 380 pF1KB4 WDTFVDHQG----CPFQPWDGLDEHSQDLSGRLRAILQNQEN :..::::. :.. . ::..:. .. ::. : :. CCDS13 WENFVDHEKPLSFNPYKMLEELDKNSRAIKRRLERIKQS 150 160 170 180 384 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 03:12:07 2016 done: Fri Nov 4 03:12:07 2016 Total Scan time: 1.940 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]