FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4658, 806 aa 1>>>pF1KB4658 806 - 806 aa - 806 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9642+/-0.00129; mu= 11.9831+/- 0.076 mean_var=138.5597+/-28.919, 0's: 0 Z-trim(105.0): 209 B-trim: 72 in 1/49 Lambda= 0.108957 statistics sampled from 7922 (8182) to 7922 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.611), E-opt: 0.2 (0.251), width: 16 Scan time: 3.750 The best scores are: opt bits E(32554) CCDS13967.1 PLA2G6 gene_id:8398|Hs108|chr22 ( 806) 5438 867.7 0 CCDS33645.1 PLA2G6 gene_id:8398|Hs108|chr22 ( 752) 2671 432.8 1.1e-120 CCDS33355.2 ANKRD44 gene_id:91526|Hs108|chr2 ( 367) 387 73.5 7.3e-13 CCDS44734.1 ANKK1 gene_id:255239|Hs108|chr11 ( 765) 350 67.9 7.2e-11 CCDS46769.1 ANKRD28 gene_id:23243|Hs108|chr3 (1053) 352 68.3 7.3e-11 >>CCDS13967.1 PLA2G6 gene_id:8398|Hs108|chr22 (806 aa) initn: 5438 init1: 5438 opt: 5438 Z-score: 4630.8 bits: 867.7 E(32554): 0 Smith-Waterman score: 5438; 100.0% identity (100.0% similar) in 806 aa overlap (1-806:1-806) 10 20 30 40 50 60 pF1KB4 MQFFGRLVNTFSGVTNLFSNPFRVKEVAVADYTSSDRVREEGQLILFQNTPNRTWDCVLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MQFFGRLVNTFSGVTNLFSNPFRVKEVAVADYTSSDRVREEGQLILFQNTPNRTWDCVLV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 NPRNSQSGFRLFQLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTDLIRNHPSW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NPRNSQSGFRLFQLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTDLIRNHPSW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 SVAHLAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SVAHLAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 VTDYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VTDYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 LCNARCNIMGPNGYPIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNAEMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LCNARCNIMGPNGYPIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNAEMA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 RMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 RMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 DNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPIHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPIHG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 VPAEQGSAAPHHPFSLERAQPPPISLNNLELQDLMHISRARKPAFILGSMRDEKRTHDHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VPAEQGSAAPHHPFSLERAQPPPISLNNLELQDLMHISRARKPAFILGSMRDEKRTHDHL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB4 LCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGM 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB4 YFRMKDEVFRGSRPYESGPLEEFLKREFGEHTKMTDVRKPKVMLTGTLSDRQPAELHLFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YFRMKDEVFRGSRPYESGPLEEFLKREFGEHTKMTDVRKPKVMLTGTLSDRQPAELHLFR 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB4 NYDAPETVREPRFNQNVNLRPPAQPSDQLVWRAARSSGAAPTYFRPNGRFLDGGLLANNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NYDAPETVREPRFNQNVNLRPPAQPSDQLVWRAARSSGAAPTYFRPNGRFLDGGLLANNP 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB4 TLDAMTEIHEYNQDLIRKGQANKVKKLSIVVSLGTGRSPQVPVTCVDVFRPSNPWELAKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TLDAMTEIHEYNQDLIRKGQANKVKKLSIVVSLGTGRSPQVPVTCVDVFRPSNPWELAKT 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB4 VFGAKELGKMVVDCCTDPDGRAVDRARAWCEMVGIQYFRLNPQLGTDIMLDEVSDTVLVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VFGAKELGKMVVDCCTDPDGRAVDRARAWCEMVGIQYFRLNPQLGTDIMLDEVSDTVLVN 730 740 750 760 770 780 790 800 pF1KB4 ALWETEVYIYEHREEFQKLIQLLLSP :::::::::::::::::::::::::: CCDS13 ALWETEVYIYEHREEFQKLIQLLLSP 790 800 >>CCDS33645.1 PLA2G6 gene_id:8398|Hs108|chr22 (752 aa) initn: 5051 init1: 2671 opt: 2671 Z-score: 2280.5 bits: 432.8 E(32554): 1.1e-120 Smith-Waterman score: 4947; 93.2% identity (93.3% similar) in 806 aa overlap (1-806:1-752) 10 20 30 40 50 60 pF1KB4 MQFFGRLVNTFSGVTNLFSNPFRVKEVAVADYTSSDRVREEGQLILFQNTPNRTWDCVLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MQFFGRLVNTFSGVTNLFSNPFRVKEVAVADYTSSDRVREEGQLILFQNTPNRTWDCVLV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 NPRNSQSGFRLFQLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTDLIRNHPSW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 NPRNSQSGFRLFQLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTDLIRNHPSW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 SVAHLAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SVAHLAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 VTDYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VTDYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 LCNARCNIMGPNGYPIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNAEMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LCNARCNIMGPNGYPIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNAEMA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 RMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 DNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPIHG ::::::::::::::::::::::::::::::::::: CCDS33 DNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGR------------------------- 370 380 390 430 440 450 460 470 480 pF1KB4 VPAEQGSAAPHHPFSLERAQPPPISLNNLELQDLMHISRARKPAFILGSMRDEKRTHDHL .:::::::::::::::::::::::::::::: CCDS33 -----------------------------QLQDLMHISRARKPAFILGSMRDEKRTHDHL 400 410 420 490 500 510 520 530 540 pF1KB4 LCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGM 430 440 450 460 470 480 550 560 570 580 590 600 pF1KB4 YFRMKDEVFRGSRPYESGPLEEFLKREFGEHTKMTDVRKPKVMLTGTLSDRQPAELHLFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 YFRMKDEVFRGSRPYESGPLEEFLKREFGEHTKMTDVRKPKVMLTGTLSDRQPAELHLFR 490 500 510 520 530 540 610 620 630 640 650 660 pF1KB4 NYDAPETVREPRFNQNVNLRPPAQPSDQLVWRAARSSGAAPTYFRPNGRFLDGGLLANNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 NYDAPETVREPRFNQNVNLRPPAQPSDQLVWRAARSSGAAPTYFRPNGRFLDGGLLANNP 550 560 570 580 590 600 670 680 690 700 710 720 pF1KB4 TLDAMTEIHEYNQDLIRKGQANKVKKLSIVVSLGTGRSPQVPVTCVDVFRPSNPWELAKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TLDAMTEIHEYNQDLIRKGQANKVKKLSIVVSLGTGRSPQVPVTCVDVFRPSNPWELAKT 610 620 630 640 650 660 730 740 750 760 770 780 pF1KB4 VFGAKELGKMVVDCCTDPDGRAVDRARAWCEMVGIQYFRLNPQLGTDIMLDEVSDTVLVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VFGAKELGKMVVDCCTDPDGRAVDRARAWCEMVGIQYFRLNPQLGTDIMLDEVSDTVLVN 670 680 690 700 710 720 790 800 pF1KB4 ALWETEVYIYEHREEFQKLIQLLLSP :::::::::::::::::::::::::: CCDS33 ALWETEVYIYEHREEFQKLIQLLLSP 730 740 750 >>CCDS33355.2 ANKRD44 gene_id:91526|Hs108|chr2 (367 aa) initn: 331 init1: 212 opt: 387 Z-score: 344.4 bits: 73.5 E(32554): 7.3e-13 Smith-Waterman score: 387; 31.7% identity (59.3% similar) in 290 aa overlap (155-437:11-290) 130 140 150 160 170 180 pF1KB4 LAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMDVT-- :: : .:: : :. . : ::. CCDS33 MAVLKLTDQPPLVQAIFSGDPE---EIRMLIHKTEDVNTL 10 20 30 190 200 210 220 230 240 pF1KB4 DYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLLLC : . .: .: :. ......:: .. : .: .:. ::::: : ..: :.::. CCDS33 DSEKRTPLHVAAFLGDAEIIELLILSG-ARVNAKDNMWLTPLHRAVASRSEEAVQVLIKH 40 50 60 70 80 90 250 260 270 280 290 pF1KB4 NARCNIMGPNGY-PIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAK---NAE .: : : :.: : . :::.:: . :: .. .: : : . :: : ..: CCDS33 SADVNARDKNWQTPLHVAAANKAVKCAEVIIPLLSS-VNVSD-RGGRTALHHAALNGHVE 100 110 120 130 140 150 300 310 320 330 340 350 pF1KB4 MARMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAM :. .:: .: :.:. .. ::: :.. ...: . .:..:::.. . ..: :::: : CCDS33 MVNLLLAKGANINAFDKKDRRALHWAAYMGHLDVVALLINHGAEVTCKDKKGYTPLHAAA 160 170 180 190 200 210 360 370 380 390 400 410 pF1KB4 SKDNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPI :. .....: :. .:.:.: : .:.: .: :. :... : ::. : CCDS33 SNGQINVVKHLLNLGVEIDEINVYGNTALHIACYNGQ----DAVVNELIDYGANVNQPNN 220 230 240 250 260 270 420 430 440 450 460 470 pF1KB4 HG-VPAEQGSAAPHHPFSLERAQPPPISLNNLELQDLMHISRARKPAFILGSMRDEKRTH .: .: . ..:. : . :: CCDS33 NGFTPLHFAAASTHGALCLELLVNNGADVNIQSKDGKSPLHMTAVHGRFTRSQTLIQNGG 280 290 300 310 320 330 >>CCDS44734.1 ANKK1 gene_id:255239|Hs108|chr11 (765 aa) initn: 363 init1: 202 opt: 350 Z-score: 308.6 bits: 67.9 E(32554): 7.2e-11 Smith-Waterman score: 350; 32.4% identity (58.5% similar) in 299 aa overlap (120-406:462-744) 90 100 110 120 130 140 pF1KB4 QLLPFYESSPQVLHTEVLQHLTDLIRNHPSWSVAHLAVELGIRECFHHSRIISCANCAEN :. :::.. .... .:.. . : CCDS44 LHFAAQNGDDGTARLLLDHGACVDAQEREGWTPLHLAAQNNFENV---ARLLVSRQADPN 440 450 460 470 480 150 160 170 180 190 200 pF1KB4 ---EEGCTPLHLACRKGDGEILVELVQYCHTQMDVTDYKGETVFHYAVQ-GDNSQVLQLL :: ::::.: : ::.:. ...:. . . .: .: ::. : . .:: CCDS44 LHEAEGKTPLHVAAYFGHVS-LVKLLTSQGAELDAQQRNLRTPLHLAVERGKVRAIQHLL 490 500 510 520 530 540 210 220 230 240 250 260 pF1KB4 GRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLLLCNARCNIMGPNGY-PIHSAMKFSQ .:: . ....: ::: : :: . ..:: .: .. .:. :.: : . CCDS44 KSGAVP--DALDQSGYGPLHTAAARGKYLICKMLLRYGASLELPTHQGWTPLHLA---AY 550 560 570 580 590 600 270 280 290 300 310 pF1KB4 KGCAEMIISMDSSQIHSKDPRYGA---SPLHWA-KNAEMARM--LLKRGCNVNSTSSAGN :: :.: . : :.. :: .::: : ...: : . ::. : . :.. ..: CCDS44 KGHLEIIHLLAES--HANMGALGAVNWTPLHLAARHGEEAVVSALLQCGADPNAAEQSGW 610 620 630 640 650 660 320 330 340 350 360 370 pF1KB4 TALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSKDNVEMIKALIVFGAEVDT : ::.::.:. : .: :: : ::. ::.. : :: ::: : :. ..:.:. ::..:. CCDS44 TPLHLAVQRSTFLSVINLLEHHANVHARNKVGWTPAHLAALKGNTAILKVLVEAGAQLDV 670 680 690 700 710 720 380 390 400 410 420 430 pF1KB4 PNDFGETPTFLASKIGRLVTRK-AILTLLRTVGAEYCFPPIHGVPAEQGSAAPHHPFSLE . . :: :: : .:: .:...: CCDS44 QDGVSCTPLQLA-----LRSRKQGIMSFLEGKEPSVATLGGSKPGAEMEI 730 740 750 760 >>CCDS46769.1 ANKRD28 gene_id:23243|Hs108|chr3 (1053 aa) initn: 1002 init1: 210 opt: 352 Z-score: 308.4 bits: 68.3 E(32554): 7.3e-11 Smith-Waterman score: 355; 26.4% identity (55.0% similar) in 371 aa overlap (103-461:185-548) 80 90 100 110 120 pF1KB4 QLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTD-----LIRNHPSWSVAHLAV : ::.. :.. ... :.. : :. CCDS46 MVKLLLSRGANINAFDKKDRRAIHWAAYMGHIEVVKLLVSHGAEVTCKDKKSYTPLHAAA 160 170 180 190 200 210 130 140 150 160 170 180 pF1KB4 ELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMDVTDYKGE :. .. .. : : ::::.:: .:. .. ::.. : . .. . :: CCDS46 SSGMISVVKYLLDLGVDMNEPNAYGNTPLHVACYNGQDVVVNELID-CGAIVNQKNEKGF 220 230 240 250 260 270 190 200 210 220 230 240 pF1KB4 TVFHYAVQGDNSQV-LQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLLLCNARC : .:.:. . .. . :.:: :. : .:. ...: ::::.. :. .... .: CCDS46 TPLHFAAASTHGALCLELLVGNG-ADVNMKSKDGKTPLHMTALHGRFSRSQTIIQSGAVI 280 290 300 310 320 330 250 260 270 280 290 300 pF1KB4 NIMGPNGY-PIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNA---EMARM . :: :.: : ..... . .:. :. .: .: ::: : . . : CCDS46 DCEDKNGNTPLHIAARYGHELLINTLIT--SGADTAKRGIHGMFPLHLAALSGFSDCCRK 340 350 360 370 380 390 310 320 330 340 350 360 pF1KB4 LLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSKDN ::. : .... .. : : ::.:. . ..: .::. ::. . . . : .::: : .. : CCDS46 LLSSGFDIDTPDDFGRTCLHAAAAGGNLECLNLLLNTGADFNKKDKFGRSPLHYAAANCN 400 410 420 430 440 450 370 380 390 400 410 420 pF1KB4 VEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPIHGVP . . ::. :: :. .. : :: :. . : . :::. :. . .: CCDS46 YQCLFALVGSGASVNDLDERGCTPLHYAATSD--TDGKCLEYLLRN-DANPGIRDKQGYN 460 470 480 490 500 430 440 450 460 470 480 pF1KB4 AEQGSAAPHHPFSLER-AQPPPIS-LNNLELQDLMHISRARKPAFILGSMRDEKRTHDHL : . ::: : . :. :. :.. : . :.. : : CCDS46 AVHYSAAYGHRLCLQLIASETPLDVLMETSGTDMLSDSDNRATISPLHLAAYHGHHQALE 510 520 530 540 550 560 490 500 510 520 530 540 pF1KB4 LCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGM CCDS46 VLVQSLLDLDVRNSSGRTPLDLAAFKGHVECVDVLINQGASILVKDYILKRTPIHAAATN 570 580 590 600 610 620 806 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 15:18:34 2016 done: Thu Nov 3 15:18:35 2016 Total Scan time: 3.750 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]