FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8843, 496 aa 1>>>pF1KB8843 496 - 496 aa - 496 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 12.0770+/-0.000955; mu= -10.7388+/- 0.057 mean_var=447.8430+/-89.944, 0's: 0 Z-trim(117.8): 20 B-trim: 0 in 0/55 Lambda= 0.060605 statistics sampled from 18567 (18582) to 18567 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.831), E-opt: 0.2 (0.571), width: 16 Scan time: 4.430 The best scores are: opt bits E(32554) CCDS72874.1 ANKRD34A gene_id:284615|Hs108|chr1 ( 496) 3417 312.7 5.9e-85 CCDS53965.1 ANKRD34C gene_id:390616|Hs108|chr15 ( 535) 823 85.9 1.2e-16 CCDS34194.1 ANKRD34B gene_id:340120|Hs108|chr5 ( 514) 678 73.2 7.5e-13 >>CCDS72874.1 ANKRD34A gene_id:284615|Hs108|chr1 (496 aa) initn: 3417 init1: 3417 opt: 3417 Z-score: 1638.7 bits: 312.7 E(32554): 5.9e-85 Smith-Waterman score: 3417; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496) 10 20 30 40 50 60 pF1KB8 MLHTEGHALLRAVGQGKLRLARLLLEGGAYVNEGDAQGETALMAACRARYDDPQNKARMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 MLHTEGHALLRAVGQGKLRLARLLLEGGAYVNEGDAQGETALMAACRARYDDPQNKARMV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 RYLLEQGADPNIADRLGRTALMHACAGGGGAAVASLLLAHGADPSVRDHAGASALVHALD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RYLLEQGADPNIADRLGRTALMHACAGGGGAAVASLLLAHGADPSVRDHAGASALVHALD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RGDRETLATLLDACKAKGTEVIIITTDTSPSGTKKTRQYLNSPPSPGVEDPAPASPSPGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RGDRETLATLLDACKAKGTEVIIITTDTSPSGTKKTRQYLNSPPSPGVEDPAPASPSPGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 CTSPSEIQLQTAGGGGRGMLSPRAQEEEEKRDVFEFPLPKPPDDPSPSEPLPKPPRHPPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 CTSPSEIQLQTAGGGGRGMLSPRAQEEEEKRDVFEFPLPKPPDDPSPSEPLPKPPRHPPK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 PLKRLNSEPWGLVAPPQPVPPTEGRPGIERLTAEFNGLTLTGRPRLSRRHSTEGPEDPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 PLKRLNSEPWGLVAPPQPVPPTEGRPGIERLTAEFNGLTLTGRPRLSRRHSTEGPEDPPP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 WAEKVTSGGPLSRRNTAPEAQESGPPSGLRQKLSRMEPVELDTPGHLCPDSPESSRLSLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 WAEKVTSGGPLSRRNTAPEAQESGPPSGLRQKLSRMEPVELDTPGHLCPDSPESSRLSLE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 RRRYSASPLTLPPAGSAPSPRQSQESLPGAVSPLSGRRRSPGLLERRGSGTLLLDHISQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RRRYSASPLTLPPAGSAPSPRQSQESLPGAVSPLSGRRRSPGLLERRGSGTLLLDHISQT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 RPGFLPPLNVSPHPPIPDIRPQPGGRAPSLPAPPYAGAPGSPRTKRKLVRRHSMQTEQIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RPGFLPPLNVSPHPPIPDIRPQPGGRAPSLPAPPYAGAPGSPRTKRKLVRRHSMQTEQIR 430 440 450 460 470 480 490 pF1KB8 LLGGFQSLGGPGEPGR :::::::::::::::: CCDS72 LLGGFQSLGGPGEPGR 490 >>CCDS53965.1 ANKRD34C gene_id:390616|Hs108|chr15 (535 aa) initn: 925 init1: 482 opt: 823 Z-score: 412.5 bits: 85.9 E(32554): 1.2e-16 Smith-Waterman score: 1205; 44.1% identity (65.3% similar) in 547 aa overlap (2-488:8-533) 10 20 30 40 50 pF1KB8 MLHTEGHALLRAVGQGKLRLARLLLEGGAYVNEGDAQGETALMAACRARYDDPQ :.:.:..::.:: :.:::.:::::::::.::.. .::::::.:: ... : : CCDS53 MMDDDTELRTDGNSLLKAVWLGRLRLTRLLLEGGAYINESNDKGETALMVACITKHVDQQ 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 N--KARMVRYLLEQGADPNIADRLGRTALMHACAGGGGAAVASLLLAHGADPSVRDHAGA . :..::.:::.. ::::: :. :.:::.::: .:. :.:::: .:::::..:..:: CCDS53 SISKSKMVKYLLDNRADPNIQDKSGKTALIHACIRRAGGEVVSLLLENGADPSLEDRTGA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB8 SALVHALDRGDRETLATLLDACKAKGTEVIIITTDTSPSGTKKTRQYLNSPPSPGVEDPA ::::.:.. :...: ::::::::: :::::::: : :::: :.:::: :::: ::: CCDS53 SALVYAINADDKDALKHLLDACKAKGKEVIIITTDKSSSGTKTTKQYLNVPPSPKVED-- 130 140 150 160 170 180 190 200 210 220 pF1KB8 PASPSPGFCTSPSEIQLQTAGGGGRGMLSPRAQEEEEKRDVFEFPLPKPP--------DD :: .:.:::.:.:.. : . :: ...:. : : . .: .. CCDS53 --RHSPPLCASPSDIELKALG-----LDSPLTEKED---DFFSLQAGHPSSCNTSKAVNE 180 190 200 210 220 230 240 250 260 270 pF1KB8 P-SPSEPLPKPPRHPPKPLKRLNSEPWGLVAPPQPVPPTE-----GRPGIERLTAEFNGL : ::.. . . : ::::.::::::.:: . :. : ... .. . CCDS53 PGSPTRKVSNLKRARLPQLKRLQSEPWGLIAPSVLAASTRQDETHGASTDNEVIKSISDI 230 240 250 260 270 280 280 290 300 310 pF1KB8 TLTGRPRLSRRHSTEGPED------------------PPPWAEKVTSG-GP---LSRRNT .. : ::: .: .. . : : .: .: :.::.: CCDS53 SFPKRGPLSRTNSIDSKDPTLFHTVTEQVLKIPVSSAPASWKAAYEKGQAPHPRLARRGT 290 300 310 320 330 340 320 330 340 350 360 pF1KB8 APEAQES---GP--PSGLRQ--KLSRMEP--VELDT-PGHLCPDSP-----ESSRLSLER : ::. :: ::.:.. .:. .: .:: :: :: : ::.. :.: CCDS53 LPVDQEKCGMGPSGPSALKEPASLKWLENDLYDLDIQPG---PDPPNSISLESGKGPLDR 350 360 370 380 390 400 370 380 390 400 410 420 pF1KB8 RRYSASPLTLPPAGSAPSPRQSQESLPGAVSPLSGRRRSPGLLERRGSGTLLLDHISQTR .. ..: :.: :: :.: ...: ..:: :.::: : :::::::::::::.::.:: CCDS53 KKLNSSHLSLFH-GS----RESLDTVP-STSPSSARRRPPHLLERRGSGTLLLDRISHTR 410 420 430 440 450 430 440 450 460 470 pF1KB8 PGFLPPLNVSPHPPIPDIRP--QPGGRAPSLPAPPYAGAPGSP-----RTKRKLVRRHSM :::::::::. .::::::: .:. : ::.:: :.:.::.::::: CCDS53 PGFLPPLNVNLNPPIPDIRSSSKPSCSLASGLKSMVPVAPSSPKRVDLRSKKKLLRRHSM 460 470 480 490 500 510 480 490 pF1KB8 QTEQIRLLGGFQSLGGPGEPGR : ::.. :. :. . CCDS53 QIEQMKQLSDFEEIMT 520 530 >>CCDS34194.1 ANKRD34B gene_id:340120|Hs108|chr5 (514 aa) initn: 626 init1: 419 opt: 678 Z-score: 344.2 bits: 73.2 E(32554): 7.5e-13 Smith-Waterman score: 944; 40.0% identity (60.7% similar) in 535 aa overlap (2-485:7-514) 10 20 30 40 50 pF1KB8 MLHTEGHALLRAVGQGKLRLARLLLEGGAYVNEGDAQGETALMAACRARYDDPQN . .::..:..:: :..:::.:::::::::.::.. .::: :: ::.... : :. CCDS34 MDEGMEISSEGNSLIKAVHQSRLRLTRLLLEGGAYINESNDRGETPLMIACKTKHVDHQS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 --KARMVRYLLEQGADPNIADRLGRTALMHACAGGGGAAVASLLLAHGADPSVRDHAGAS ::.::.::::..::::: :. :.::::::: .: :.:::: ::: :..::.. : CCDS34 VSKAKMVKYLLENNADPNIQDKSGKTALMHACLEKAGPEVVSLLLKSGADLSLQDHSSYS 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB8 ALVHALDRGDRETLATLLDACKAKGTEVIIITTDTSPSGTKKTRQYLNSPPSPGVEDPAP :::.:.. : ::: .::.:::::: ::::::: : : . :.:::: :: :. CCDS34 ALVYAINSEDTETLKVLLSACKAKGKEVIIITTAKLPCGKHTTKQYLNMPP---VD--ID 130 140 150 160 170 180 190 200 210 220 pF1KB8 ASPSPGFCTSPSEIQLQTAGGGGRGMLSPRAQEEEEKRDVFEFP---LPKPPDDP-SPSE . ::. ::.::::...::. :: .. : . .: : : :: .:. CCDS34 GCHSPATCTTPSEIDIKTAS-------SPLSHSSETELTLFGFKDLELAGSNDDTWDPGS 180 190 200 210 220 230 240 250 260 270 280 pF1KB8 PLPKPPRHPPKPLKRLNSEPWGLVAPPQPVPPT------EGRPGI---ERLTAEFNGLTL :. :: : : : .. :: . .:: . . : : :.:. . :::.: CCDS34 PVRKPALAPKGP-KLPHAPPW-VKSPPLLMHQNRVASLQEELQDITPEEELSYKTNGLAL 230 240 250 260 270 280 290 300 310 320 330 340 pF1KB8 TGRPRLSRRHSTEGPEDPPPWAEKVTSGGPLSRRNTAPEAQESGPPSGLRQKLSRMEPVE . :. ::.. .: . ... ::. . : . .. : :. .. ::. CCDS34 S--KRFITRHQSIDVKDTAHLLRAFDQAS--SRKMSYDEINCQSYLSEGNQQCIEV-PVD 290 300 310 320 330 340 350 360 370 pF1KB8 LDTPGHLCPDSPES---SRL-SLERRR------YSA-SPLT---LPPA---GSA------ : ::: .. : : :. ..: ::. : :. ::. :.: CCDS34 QD------PDSNQTIFASTLRSIVQKRNLGANHYSSDSQLSAGLTPPTSEDGKALIGKKK 350 360 370 380 390 380 390 400 410 420 430 pF1KB8 ---PSPRQSQES--LPGAVSPLSGRRRSPGLLERRGSGTLLLDH-ISQTRPGFLPPLNVS ::: : .:: : . : ::. ..:::::::.. ::: ..::: ::::::::. CCDS34 ILSPSPSQLSESKELLENIPPGPLSRRNHAVLERRGSGAFPLDHSVTQTRQGFLPPLNVN 400 410 420 430 440 450 440 450 460 470 480 pF1KB8 PHPPIPDIRPQPG-------GRAPSLPAPPYAGAPGSPRTKRKLVRRHSMQTEQIRLLGG :::: :: . :. .:. : : ..:. :.::.:.:::::. : . CCDS34 SHPPISDINVNNKICSLLSCGQKVLMPTVPI--FPKEFKSKKMLLRRQSLQTEQIKQLVN 460 470 480 490 500 510 490 pF1KB8 FQSLGGPGEPGR : CCDS34 F 496 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:03:22 2016 done: Fri Nov 4 16:03:23 2016 Total Scan time: 4.430 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]