FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1503, 790 aa 1>>>pF1KA1503 790 - 790 aa - 790 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.7474+/- 0.001; mu= 13.2135+/- 0.060 mean_var=87.9179+/-17.826, 0's: 0 Z-trim(105.5): 26 B-trim: 5 in 1/50 Lambda= 0.136784 statistics sampled from 8418 (8437) to 8418 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.622), E-opt: 0.2 (0.259), width: 16 Scan time: 3.060 The best scores are: opt bits E(32554) CCDS32551.1 DNAH2 gene_id:146754|Hs108|chr17 (4427) 4244 848.2 0 CCDS76937.1 DNAH2 gene_id:146754|Hs108|chr17 ( 872) 2757 554.6 2.5e-157 CCDS9255.2 DNAH10 gene_id:196385|Hs108|chr12 (4471) 306 71.1 4.5e-11 >>CCDS32551.1 DNAH2 gene_id:146754|Hs108|chr17 (4427 aa) initn: 4244 init1: 4244 opt: 4244 Z-score: 4512.3 bits: 848.2 E(32554): 0 Smith-Waterman score: 4244; 100.0% identity (100.0% similar) in 635 aa overlap (1-635:1-635) 10 20 30 40 50 60 pF1KA1 MSSKAEKKQRLSGRGSSQASWSGRATRAAVATQEQGNAPAVSEPELQAELPKEEPEPRLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MSSKAEKKQRLSGRGSSQASWSGRATRAAVATQEQGNAPAVSEPELQAELPKEEPEPRLE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 GPQAQSEESVEPEADVKPLFLSRAALTGLADAVWTQEHDAILEHFAQDPTESILTIFIDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GPQAQSEESVEPEADVKPLFLSRAALTGLADAVWTQEHDAILEHFAQDPTESILTIFIDP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 CFGLKLELGMPVQTQNQLVYFIRQAPVPITWENFEATVQFGTVRGPYIPALLRLLGGVFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 CFGLKLELGMPVQTQNQLVYFIRQAPVPITWENFEATVQFGTVRGPYIPALLRLLGGVFA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 PQIFANTGWPESIRNHFASHLHKFLACLTDTRYKLEGHTVLYIPAEAMNMKPEMVIKDKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 PQIFANTGWPESIRNHFASHLHKFLACLTDTRYKLEGHTVLYIPAEAMNMKPEMVIKDKE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 LVQRLETSMIHWTRQIKEMLSAQETVETGENLGPLEEIEFWRNRCMDLSGISKQLVKKGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LVQRLETSMIHWTRQIKEMLSAQETVETGENLGPLEEIEFWRNRCMDLSGISKQLVKKGV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 KHVESILHLAKSSYLAPFMKLAQQIQDGSRQAQSNLTFLSILKEPYQELAFMKPKDISSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KHVESILHLAKSSYLAPFMKLAQQIQDGSRQAQSNLTFLSILKEPYQELAFMKPKDISSK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA1 LPKLISLIRIIWVNSPHYNTRERLTSLFRKVCDCQYHFARWEDGKQGPLPCFFGAQGPQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LPKLISLIRIIWVNSPHYNTRERLTSLFRKVCDCQYHFARWEDGKQGPLPCFFGAQGPQI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KA1 TRNLLEIEDIFHKNLHTLRAVRGGILDVKNTCWHEDYNKFRAGIKDLEVMTQNLITSAFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 TRNLLEIEDIFHKNLHTLRAVRGGILDVKNTCWHEDYNKFRAGIKDLEVMTQNLITSAFE 430 440 450 460 470 480 490 500 510 520 530 540 pF1KA1 LVRDVPHGVLLLDTFHRLASREAIKRTYDKKAVDLYMLFNSELALVNRERNKKWPDLEPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LVRDVPHGVLLLDTFHRLASREAIKRTYDKKAVDLYMLFNSELALVNRERNKKWPDLEPY 490 500 510 520 530 540 550 560 570 580 590 600 pF1KA1 VAQYSGKARWVHILRRRIDRVMTCLAGAHFLPRIGTGKESVHTYQQMVQAIDELVRKTFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 VAQYSGKARWVHILRRRIDRVMTCLAGAHFLPRIGTGKESVHTYQQMVQAIDELVRKTFQ 550 560 570 580 590 600 610 620 630 640 650 660 pF1KA1 EWTSSLDKDCIRRLDTPLLRISQEKAGMLDVNFDKYRSHLAPFPYTPLLQLSQEFHSHLL ::::::::::::::::::::::::::::::::::: CCDS32 EWTSSLDKDCIRRLDTPLLRISQEKAGMLDVNFDKSLLILFAEIDYWERLLFETPHYVVN 610 620 630 640 650 660 670 680 690 700 710 720 pF1KA1 TPLFIILSLSHTICLLSSFYFFFSSFIFVSPHLPPCYQHFNFTTYLKTQQNKTMIGQARW CCDS32 VAERAEDLRILRENLLLVARDYNRIIAMLSPDEQALFKERIRLLDKKIHPGLKKLHWALK 670 680 690 700 710 720 >>CCDS76937.1 DNAH2 gene_id:146754|Hs108|chr17 (872 aa) initn: 2757 init1: 2757 opt: 2757 Z-score: 2938.1 bits: 554.6 E(32554): 2.5e-157 Smith-Waterman score: 4837; 89.8% identity (89.9% similar) in 822 aa overlap (51-790:51-872) 30 40 50 60 70 80 pF1KA1 WSGRATRAAVATQEQGNAPAVSEPELQAELPKEEPEPRLEGPQAQSEESVEPEADVKPLF :::::::::::::::::::::::::::::: CCDS76 WSGRATRAAVATQEQGNAPAVSEPELQAELPKEEPEPRLEGPQAQSEESVEPEADVKPLF 30 40 50 60 70 80 90 100 110 120 130 140 pF1KA1 LSRAALTGLADAVWTQEHDAILEHFAQDPTESILTIFIDPCFGLKLELGMPVQTQNQLVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LSRAALTGLADAVWTQEHDAILEHFAQDPTESILTIFIDPCFGLKLELGMPVQTQNQLVY 90 100 110 120 130 140 150 160 170 180 190 200 pF1KA1 FIRQAPVPITWENFEATVQFGTVRGPYIPALLRLLGGVFAPQIFANTGWPESIRNHFASH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 FIRQAPVPITWENFEATVQFGTVRGPYIPALLRLLGGVFAPQIFANTGWPESIRNHFASH 150 160 170 180 190 200 210 220 230 240 250 260 pF1KA1 LHKFLACLTDTRYKLEGHTVLYIPAEAMNMKPEMVIKDKELVQRLETSMIHWTRQIKEML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LHKFLACLTDTRYKLEGHTVLYIPAEAMNMKPEMVIKDKELVQRLETSMIHWTRQIKEML 210 220 230 240 250 260 270 280 290 300 310 320 pF1KA1 SAQETVETGENLGPLEEIEFWRNRCMDLSGISKQLVKKGVKHVESILHLAKSSYLAPFMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 SAQETVETGENLGPLEEIEFWRNRCMDLSGISKQLVKKGVKHVESILHLAKSSYLAPFMK 270 280 290 300 310 320 330 340 350 360 370 380 pF1KA1 LAQQIQDGSRQAQSNLTFLSILKEPYQELAFMKPKDISSKLPKLISLIRIIWVNSPHYNT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LAQQIQDGSRQAQSNLTFLSILKEPYQELAFMKPKDISSKLPKLISLIRIIWVNSPHYNT 330 340 350 360 370 380 390 pF1KA1 RERLTSLFRK-------------------------------------------------- :::::::::: CCDS76 RERLTSLFRKMSNEIIRLCCHAISLDRIFEGYVSSSKEDLQGCILCCHAWKDHYVQAVQM 390 400 410 420 430 440 400 410 pF1KA1 --------------------------------VCDCQYHFARWEDGKQGPLPCFFGAQGP :::::::::::::::::::::::::::: CCDS76 HIQFSSRGWVLDQTSIFAQVDAFVQRCKDLIEVCDCQYHFARWEDGKQGPLPCFFGAQGP 450 460 470 480 490 500 420 430 440 450 460 470 pF1KA1 QITRNLLEIEDIFHKNLHTLRAVRGGILDVKNTCWHEDYNKFRAGIKDLEVMTQNLITSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 QITRNLLEIEDIFHKNLHTLRAVRGGILDVKNTCWHEDYNKFRAGIKDLEVMTQNLITSA 510 520 530 540 550 560 480 490 500 510 520 530 pF1KA1 FELVRDVPHGVLLLDTFHRLASREAIKRTYDKKAVDLYMLFNSELALVNRERNKKWPDLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 FELVRDVPHGVLLLDTFHRLASREAIKRTYDKKAVDLYMLFNSELALVNRERNKKWPDLE 570 580 590 600 610 620 540 550 560 570 580 590 pF1KA1 PYVAQYSGKARWVHILRRRIDRVMTCLAGAHFLPRIGTGKESVHTYQQMVQAIDELVRKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 PYVAQYSGKARWVHILRRRIDRVMTCLAGAHFLPRIGTGKESVHTYQQMVQAIDELVRKT 630 640 650 660 670 680 600 610 620 630 640 650 pF1KA1 FQEWTSSLDKDCIRRLDTPLLRISQEKAGMLDVNFDKYRSHLAPFPYTPLLQLSQEFHSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 FQEWTSSLDKDCIRRLDTPLLRISQEKAGMLDVNFDKYRSHLAPFPYTPLLQLSQEFHSH 690 700 710 720 730 740 660 670 680 690 700 710 pF1KA1 LLTPLFIILSLSHTICLLSSFYFFFSSFIFVSPHLPPCYQHFNFTTYLKTQQNKTMIGQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LLTPLFIILSLSHTICLLSSFYFFFSSFIFVSPHLPPCYQHFNFTTYLKTQQNKTMIGQA 750 760 770 780 790 800 720 730 740 750 760 770 pF1KA1 RWLTPVIPALWEAGVGASLEPRSLRTAWATWQNPVSAKNTKISWAWWHKPVVSATWEGEV ::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::: CCDS76 RWLTPVIPALWEAEVGASLEPRSLRTAWATWQNPVSAKNTKISWAWWHKPVVSATWEGEV 810 820 830 840 850 860 780 790 pF1KA1 GGSPEPGRRRLQ ::::::::.::: CCDS76 GGSPEPGRQRLQ 870 >>CCDS9255.2 DNAH10 gene_id:196385|Hs108|chr12 (4471 aa) initn: 268 init1: 129 opt: 306 Z-score: 312.3 bits: 71.1 E(32554): 4.5e-11 Smith-Waterman score: 306; 28.0% identity (61.5% similar) in 200 aa overlap (193-391:196-394) 170 180 190 200 210 220 pF1KA1 VRGPYIPALLRLLGGVFAPQIFANTGWPESIRNHFASHLHKFLACLTDTRYKLEGHTVLY ::..: ...:: . . : .:::. : CCDS92 TSGEVSNSSEHESDLPPMPGEAVEYHSIQLIRDEFLMNVQKFASNIQRTMQQLEGEIKLE 170 180 190 200 210 220 230 240 250 260 270 280 pF1KA1 IPAEAMNMKPEMVIKDKELVQRLETSMIHWTRQIKEMLSAQETVETGENLGPLEEIEFWR .: ... . . : : :. :: .:.: ::. . :: .: .. ::: :::::: CCDS92 MPIISVEGEVSDLAADPETVDILEQCVINWLNQISTAVEAQ-LKKTPQGKGPLAEIEFWR 230 240 250 260 270 280 290 300 310 320 330 340 pF1KA1 NRCMDLSGISKQLVKKGVKHVESILHLAKSSYLAPFMKLAQQIQDGSRQAQSNLTFLSIL .: ::.. .: :..: .... . : .: .. . .. .:..:. ::: . CCDS92 ERNATLSALHEQTKLPIVRKVLDVIKESDSMLVANLQPVFTELFKFHTEASDNVRFLSTV 290 300 310 320 330 340 350 360 370 380 390 400 pF1KA1 KEPYQELAFMKPKDIS-SKLPKLISLIRIIWVNSPHYNTRERLTSLFRKVCDCQYHFARW .. ..... . . . .: ..: .:..:. : ::: ::. :.... CCDS92 ERYFKNITHGSGFHVVLDTIPAMMSALRMVWIISRHYNKDERMIPLMERIAWEIAERVCR 350 360 370 380 390 400 410 420 430 440 450 460 pF1KA1 EDGKQGPLPCFFGAQGPQITRNLLEIEDIFHKNLHTLRAVRGGILDVKNTCWHEDYNKFR CCDS92 VVNLRTLFKENRASAQSKTLEARNTLRLWKKAYFDTRAKIEASGREDRWEFDRKRLFERT 410 420 430 440 450 460 790 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 20:05:59 2016 done: Thu Nov 3 20:06:00 2016 Total Scan time: 3.060 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]