FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3219, 669 aa 1>>>pF1KB3219 669 - 669 aa - 669 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0001+/-0.00108; mu= 12.1106+/- 0.064 mean_var=148.1580+/-30.100, 0's: 0 Z-trim(108.7): 180 B-trim: 58 in 1/50 Lambda= 0.105369 statistics sampled from 10172 (10371) to 10172 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.674), E-opt: 0.2 (0.319), width: 16 Scan time: 4.130 The best scores are: opt bits E(32554) CCDS12135.1 FEM1A gene_id:55527|Hs108|chr19 ( 669) 4470 692.0 7.1e-199 CCDS4118.1 FEM1C gene_id:56929|Hs108|chr5 ( 617) 1702 271.2 3.1e-72 CCDS10228.1 FEM1B gene_id:10116|Hs108|chr15 ( 627) 672 114.6 4.3e-25 >>CCDS12135.1 FEM1A gene_id:55527|Hs108|chr19 (669 aa) initn: 4470 init1: 4470 opt: 4470 Z-score: 3683.9 bits: 692.0 E(32554): 7.1e-199 Smith-Waterman score: 4470; 100.0% identity (100.0% similar) in 669 aa overlap (1-669:1-669) 10 20 30 40 50 60 pF1KB3 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVAGGGTPLLIAARYGHLDVVEYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVAGGGTPLLIAARYGHLDVVEYL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 VDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRTNSTPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 VDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRTNSTPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 RAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQVNRRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQVNRRS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 AKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLIQEQPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLIQEQPG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 QEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSREAAVEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 QEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSREAAVEA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 LELLGATYVDKKRDLLGALKHWRRAMELRHQGGEYLPKPEPPQLVLAYDYSREVNTTEEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LELLGATYVDKKRDLLGALKHWRRAMELRHQGGEYLPKPEPPQLVLAYDYSREVNTTEEL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 EALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIRLWKYALDMQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 EALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIRLWKYALDMQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 QSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKGVREVERALQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 QSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKGVREVERALQ 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 LPREPGDSAQFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRLLKCAPRGKNGFTPLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LPREPGDSAQFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRLLKCAPRGKNGFTPLH 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB3 MAVDKDTTNVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNTPLHIAAQNNCPAIMNAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MAVDKDTTNVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNTPLHIAAQNNCPAIMNAL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB3 IEAGAHMDATNAFKKTAYELLDEKLLARGTMQPFNYVTLQCLAARALDKNKIPYKGFIPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 IEAGAHMDATNAFKKTAYELLDEKLLARGTMQPFNYVTLQCLAARALDKNKIPYKGFIPE 610 620 630 640 650 660 pF1KB3 DLEAFIELH ::::::::: CCDS12 DLEAFIELH >>CCDS4118.1 FEM1C gene_id:56929|Hs108|chr5 (617 aa) initn: 2238 init1: 969 opt: 1702 Z-score: 1410.3 bits: 271.2 E(32554): 3.1e-72 Smith-Waterman score: 2791; 64.8% identity (82.8% similar) in 670 aa overlap (1-669:1-616) 10 20 30 40 50 60 pF1KB3 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVAGGGTPLLIAARYGHLDVVEYL :::.:::.::::::::.:: :::...:.::.. : .: ..:.::::.::::::::.::.: CCDS41 MDLKTAVFNAARDGKLRLLTKLLASKSKEEVSSLISEKTNGATPLLMAARYGHLDMVEFL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 VDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRTNSTPL ...:.::.:.::::.::::::::::::::::::::: ::.::: .::::: :: :::::: CCDS41 LEQCSASIEVGGSVNFDGETIEGAPPLWAASAAGHLKVVQSLLNHGASVNNTTLTNSTPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 RAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQVNRRS ::::::::::.:.::: ::.:::::.::::::::::::::::.:::.::::.::.:::.: CCDS41 RAACFDGHLEIVKYLV-EHKADLEVSNRHGHTCLMISCYKGHKEIAQYLLEKGADVNRKS 130 140 150 160 170 190 200 210 220 230 240 pF1KB3 AKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLIQEQPG .:::::::::::::::.:...:: :.::.:::::::::.::::::::::..: .. CCDS41 VKGNTALHDCAESGSLDIMKMLLMYCAKMEKDGYGMTPLLSASVTGHTNIVDFLTHH--- 180 190 200 210 220 230 250 260 270 280 290 300 pF1KB3 QEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSREAAVEA :: ::. ..: CCDS41 --------AQ-----------------------------------------TSKTERINA 240 310 320 330 340 350 pF1KB3 LELLGATYVDKKRDLLGALKHWRRAMELRHQG-GEYLPKPEPPQLVLAYDYSREVNTTEE :::::::.::::::::::::.:..::..:.. . . :: : :..::::..:::..:: CCDS41 LELLGATFVDKKRDLLGALKYWKKAMNMRYSDRTNIISKPVPQTLIMAYDYAKEVNSAEE 250 260 270 280 290 300 360 370 380 390 400 410 pF1KB3 LEALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIRLWKYALDM ::.::.:::::::::::::::::::::::::::::::::::::::::.::: :::::::: CCDS41 LEGLIADPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFKRCINLWKYALDM 310 320 330 340 350 360 420 430 440 450 460 470 pF1KB3 QQSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKGVREVERAL :::::.:::::::::.::::::::..::::: :: ::: . : ::::.: :.: :.:::. CCDS41 QQSNLDPLSPMTASSLLSFAELFSFMLQDRA-KGLLGTTVTFDDLMGILCKSVLEIERAI 370 380 390 400 410 420 480 490 500 510 520 530 pF1KB3 QLPREPGDSAQFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRLLKCAPRGKNGFTPL . . :.: :..:::.:::::. ::::: :: :.:.:.::.::.:: :::::.:.:: CCDS41 KQTQCPADPLQLNKALSIILHLICLLEKVPCTLEQDHFKKQTIYRFLKLHPRGKNNFSPL 430 440 450 460 470 480 540 550 560 570 580 590 pF1KB3 HMAVDKDTTNVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNTPLHIAAQNNCPAIMNA :.::::.:: :::::: .::::.:. .:..:::: . :: :.:.:::::: :: : ::: CCDS41 HLAVDKNTTCVGRYPVCKFPSLQVTAILIECGADVNVRDSDDNSPLHIAALNNHPDIMNL 490 500 510 520 530 540 600 610 620 630 640 650 pF1KB3 LIEAGAHMDATNAFKKTAYELLDEKLLARGTMQPFNYVTLQCLAARALDKNKIPYKGFIP ::..:::.:::: :.:: .::::: .:.. .::.:..::::::::.. ...: ::: :: CCDS41 LIKSGAHFDATNLHKQTASDLLDEKEIAKNLIQPINHTTLQCLAARVIVNHRIYYKGHIP 550 560 570 580 590 600 660 pF1KB3 EDLEAFIELH : ::.:. :: CCDS41 EKLETFVSLHR 610 >>CCDS10228.1 FEM1B gene_id:10116|Hs108|chr15 (627 aa) initn: 1114 init1: 349 opt: 672 Z-score: 564.0 bits: 114.6 E(32554): 4.3e-25 Smith-Waterman score: 1232; 35.8% identity (63.0% similar) in 689 aa overlap (7-669:8-627) 10 20 30 40 50 pF1KB3 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVA--GG--GTPLLIAARYGHLD ::.:: .::. : :: .::. .. : : :. :: .:::.:::: :: CCDS10 MEGLAGYVYKAASEGKVLTLAALLLNRSESDIRYLLGYVSQQGGQRSTPLIIAARNGHAK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB3 VVEYLVDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRT ::. :... .... :.:.::: .:.:: :: :..:::..::. :. .::.::.:: : CCDS10 VVRLLLEHYRVQTQQTGTVRFDGYVIDGATALWCAAGAGHFEVVKLLVSHGANVNHTTVT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB3 NSTPLRAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQ ::::::::::::.:..:.::: :..:.. .::.. .:::::. :::: ...:::::: :. CCDS10 NSTPLRAACFDGRLDIVKYLV-ENNANISIANKYDNTCLMIAAYKGHTDVVRYLLEQRAD 130 140 150 160 170 180 190 200 210 220 230 pF1KB3 VNRRSAKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLI : .. : :::: ::.: ..:.. :. .: . .:.::::: .:. . ....:: :. CCDS10 PNAKAHCGATALHFAAEAGHIDIVKELIKWRAAIVVNGHGMTPLKVAAESCKADVVELLL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB3 QEQPGQEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSRE :. .: .:. CCDS10 -------------------------------------------------SHADC---DRR 240 300 310 320 330 340 350 pF1KB3 AAVEALELLGATYVDKKR--DLLGALKHWRRAMELRHQGGEYLPKPE--PPQLVLAYDYS . .::::::::.... .. :.. . .. :: : : :. . . : :: . :: CCDS10 SRIEALELLGASFANDRENYDIIKTYHYLYLAMLERFQDGDNILEKEVLPP--IHAYGNR 250 260 270 280 290 300 360 370 380 390 400 410 pF1KB3 REVNTTEELEALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIR : . .:::.. : : ..:..:..:::::: .. :.:. : ::::::::. .::.::. CCDS10 TECRNPQELESIRQDRDALHMEGLIVRERILGADNIDVSHPIIYRGAVYADNMEFEQCIK 310 320 330 340 350 360 420 430 440 450 460 470 pF1KB3 LWKYALDMQQSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKG :: .:: ..:.. . : ...: ::..:: ... :. . :. :: . CCDS10 LWLHALHLRQKG----NRNTHKDLLRFAQVFSQMIH-------LNETVKAPDIECVLRCS 370 380 390 400 410 480 490 500 510 520 pF1KB3 VREVERALQLPREPGDSA------QFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRL : :.:.... .. .:. .. : .:.:. . :..:. .. .. .: : CCDS10 VLEIEQSMNRVKNISDADVHNAMDNYECNLYTFLYLVCISTKTQCSEEDQCKINKQIYNL 420 430 440 450 460 470 530 540 550 560 570 580 pF1KB3 LKCAPRGKNGFTPLHMAVDKDTT--NVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNT .. :: ..::: ::.::...: . : ::. :.:.::::::. .. : ..:. CCDS10 IHLDPRTREGFTLLHLAVNSNTPVDDFHTNDVCSFPNALVTKLLLDCGAEVNAVDNEGNS 480 490 500 510 520 530 590 600 610 620 630 pF1KB3 PLHIAAQNNCP--------AIMNALIEAGAHMDATNAFKKTAYELLDEKL--LARGTMQP ::: .: : : .:. .:.::::: : :: .:: ::.. ... .. CCDS10 ALHIIVQYNRPISDFLTLHSIIISLVEAGAHTDMTNKQNKTP---LDKSTTGVSEILLKT 540 550 560 570 580 590 640 650 660 pF1KB3 FNYVTLQCLAARALDKNKIPYKGFIPEDLEAFIELH ..:.::::::. : : :. ::. :: :. .: CCDS10 QMKMSLKCLAARAVRANDINYQDQIPRTLEEFVGFH 600 610 620 669 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 12:31:38 2016 done: Thu Nov 3 12:31:39 2016 Total Scan time: 4.130 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]