FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0470, 301 aa 1>>>pF1KE0470 301 - 301 aa - 301 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8899+/-0.000861; mu= 18.2347+/- 0.052 mean_var=64.4107+/-12.736, 0's: 0 Z-trim(106.2): 22 B-trim: 0 in 0/50 Lambda= 0.159807 statistics sampled from 8819 (8832) to 8819 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.641), E-opt: 0.2 (0.271), width: 16 Scan time: 2.350 The best scores are: opt bits E(32554) CCDS35207.1 GPM6B gene_id:2824|Hs108|chrX ( 305) 2061 483.7 7.3e-137 CCDS35206.1 GPM6B gene_id:2824|Hs108|chrX ( 328) 1921 451.5 4e-127 CCDS48084.1 GPM6B gene_id:2824|Hs108|chrX ( 246) 1675 394.7 3.8e-110 CCDS14158.1 GPM6B gene_id:2824|Hs108|chrX ( 265) 1675 394.7 4e-110 CCDS14514.1 PLP1 gene_id:5354|Hs108|chrX ( 242) 979 234.2 7.6e-62 CCDS3824.1 GPM6A gene_id:2823|Hs108|chr4 ( 278) 855 205.6 3.4e-53 CCDS54822.1 GPM6A gene_id:2823|Hs108|chr4 ( 267) 854 205.4 3.9e-53 CCDS58936.1 GPM6A gene_id:2823|Hs108|chr4 ( 271) 854 205.4 3.9e-53 CCDS14513.1 PLP1 gene_id:5354|Hs108|chrX ( 277) 496 122.9 2.8e-28 >>CCDS35207.1 GPM6B gene_id:2824|Hs108|chrX (305 aa) initn: 2061 init1: 2061 opt: 2061 Z-score: 2570.6 bits: 483.7 E(32554): 7.3e-137 Smith-Waterman score: 2061; 100.0% identity (100.0% similar) in 301 aa overlap (1-301:5-305) 10 20 30 40 50 pF1KE0 METAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MKPAMETAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE0 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE0 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED 250 260 270 280 290 300 300 pF1KE0 CCTKF ::::: CCDS35 CCTKF >>CCDS35206.1 GPM6B gene_id:2824|Hs108|chrX (328 aa) initn: 1919 init1: 1919 opt: 1921 Z-score: 2395.7 bits: 451.5 E(32554): 4e-127 Smith-Waterman score: 1921; 95.6% identity (97.6% similar) in 294 aa overlap (1-294:5-298) 10 20 30 40 50 pF1KE0 METAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MKPAMETAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE0 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE0 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED :::::::::::::::::::::::::::::::::::::::. ..: . :.: :: :. CCDS35 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALIHFLMILSSNWAYLKDASKMQ 250 260 270 280 290 300 300 pF1KE0 CCTKF CCDS35 AYQDIKAKEEQELQDIQSRSKEQLNSYT 310 320 >>CCDS48084.1 GPM6B gene_id:2824|Hs108|chrX (246 aa) initn: 1675 init1: 1675 opt: 1675 Z-score: 2091.0 bits: 394.7 E(32554): 3.8e-110 Smith-Waterman score: 1675; 100.0% identity (100.0% similar) in 245 aa overlap (57-301:2-246) 30 40 50 60 70 80 pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL :::::::::::::::::::::::::::::: CCDS48 MGCFECCIKCLGGVPYASLVATILCFSGVAL 10 20 30 90 100 110 120 130 140 pF1KE0 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 YTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 YTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEV 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE0 IKSPQTNGTTGVEQICVDIRQYGIIPWNAFPGKICGSALENICNTNEFYMSYHLFIVACA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 IKSPQTNGTTGVEQICVDIRQYGIIPWNAFPGKICGSALENICNTNEFYMSYHLFIVACA 160 170 180 190 200 210 270 280 290 300 pF1KE0 GAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF ::::::::::::::::::::::::::::::::::: CCDS48 GAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF 220 230 240 >>CCDS14158.1 GPM6B gene_id:2824|Hs108|chrX (265 aa) initn: 1757 init1: 1675 opt: 1675 Z-score: 2090.5 bits: 394.7 E(32554): 4e-110 Smith-Waterman score: 1681; 86.7% identity (86.7% similar) in 301 aa overlap (1-301:5-265) 10 20 30 40 50 pF1KE0 METAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP :::::::::::::::: CCDS14 MKPAMETAAEENTEQSQERK---------------------------------------- 10 20 60 70 80 90 100 110 pF1KE0 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL 30 40 50 60 70 80 120 130 140 150 160 170 pF1KE0 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT 90 100 110 120 130 140 180 190 200 210 220 230 pF1KE0 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF 150 160 170 180 190 200 240 250 260 270 280 290 pF1KE0 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED 210 220 230 240 250 260 300 pF1KE0 CCTKF ::::: CCDS14 CCTKF >>CCDS14514.1 PLP1 gene_id:5354|Hs108|chrX (242 aa) initn: 946 init1: 651 opt: 979 Z-score: 1223.8 bits: 234.2 E(32554): 7.6e-62 Smith-Waterman score: 979; 57.1% identity (83.2% similar) in 238 aa overlap (57-294:2-238) 30 40 50 60 70 80 pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL : .::: .:: :.:.:::::: ::: :::: CCDS14 MGLLECCARCLVGAPFASLVATGLCFFGVAL 10 20 30 90 100 110 120 130 140 pF1KE0 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF :::::: ::.:: ..: .:: : .:. : .::. .:::::: :::::::: .:::::: CCDS14 FCGCGHEALTGTEKLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGF 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 YTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEV :::.::... :..::: ::. .:. :: .::.: :.:: ::. :::::....: :.::. CCDS14 YTTGAVRQIFGDYKTTICGKGLSATFVGITYALTVVWLLVFACSAVPVYIYFNTWTTCQS 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE0 IKSPQTNGTTGVEQICVDIRQYGIIPWNAFPGKICGSALENICNTNEFYMSYHLFIVACA : : .. .... ..:.: :.::..::::::::.::: : .::.: :: :..::::.: . CCDS14 IAFP-SKTSASIGSLCADARMYGVLPWNAFPGKVCGSNLLSICKTAEFQMTFHLFIAAFV 160 170 180 190 200 210 270 280 290 300 pF1KE0 GAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF ::.::...:: .:.:.:::.::::. .: CCDS14 GAAATLVSLLTFMIAATYNFAVLKLMGRGTKF 220 230 240 >>CCDS3824.1 GPM6A gene_id:2823|Hs108|chr4 (278 aa) initn: 800 init1: 389 opt: 855 Z-score: 1068.5 bits: 205.6 E(32554): 3.4e-53 Smith-Waterman score: 855; 51.2% identity (80.7% similar) in 244 aa overlap (54-290:10-243) 30 40 50 60 70 80 pF1KE0 EIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSG .. ::::::::::::.:::::.:::: ..: CCDS38 MEENMEEGQTQKGCFECCIKCLGGIPYASLIATILLYAG 10 20 30 90 100 110 120 130 140 pF1KE0 VALFCGCGHVALAGTVAILEQHF--STNASDHALLSEVIQLMQYVIYGIASFFFLYGIIL ::::::::: ::.::: ::. .: . .:.: . .:....:::::::. ::.:::.: CCDS38 VALFCGCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILL 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 LAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIW ..:::.::.:.:.:.:.:: :.::::.:. :..:::.. .::::: .:...::.:..:.: CCDS38 MVEGFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLW 100 110 120 130 140 150 210 220 230 240 250 pF1KE0 STCEVIKSPQTNGTTGVE--QICVDIRQYGIIPWNAFPGKICGSALEN---ICNTNEFYM . :. .:: :: ..:.:.::.::. . ::: .. :: .:...:. : CCDS38 TICR--------NTTLVEGANLCLDLRQFGIVTIGE-EKKIC-TVSENFLRMCESTELNM 160 170 180 190 200 260 270 280 290 300 pF1KE0 SYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF ..:::::: :::::.:::.. :.:. . :.: .: CCDS38 TFHLFIVALAGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTR 210 220 230 240 250 260 CCDS38 SKERLNAYT 270 >>CCDS54822.1 GPM6A gene_id:2823|Hs108|chr4 (267 aa) initn: 800 init1: 389 opt: 854 Z-score: 1067.5 bits: 205.4 E(32554): 3.9e-53 Smith-Waterman score: 854; 51.9% identity (80.9% similar) in 241 aa overlap (57-290:2-232) 30 40 50 60 70 80 pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL ::::::::::::.:::::.:::: ..:::: CCDS54 MGCFECCIKCLGGIPYASLIATILLYAGVAL 10 20 30 90 100 110 120 130 140 pF1KE0 FCGCGHVALAGTVAILEQHF--STNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAE :::::: ::.::: ::. .: . .:.: . .:....:::::::. ::.:::.:..: CCDS54 FCGCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILLMVE 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 GFYTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTC ::.::.:.:.:.:.:: :.::::.:. :..:::.. .::::: .:...::.:..:.:. : CCDS54 GFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLWTIC 100 110 120 130 140 150 210 220 230 240 250 pF1KE0 EVIKSPQTNGTTGVE--QICVDIRQYGIIPWNAFPGKICGSALEN---ICNTNEFYMSYH . .:: :: ..:.:.::.::. . ::: .. :: .:...:. :..: CCDS54 R--------NTTLVEGANLCLDLRQFGIVTIGE-EKKIC-TVSENFLRMCESTELNMTFH 160 170 180 190 200 260 270 280 290 300 pF1KE0 LFIVACAGAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF ::::: :::::.:::.. :.:. . :.: .: CCDS54 LFIVALAGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTRSKE 210 220 230 240 250 260 CCDS54 RLNAYT >>CCDS58936.1 GPM6A gene_id:2823|Hs108|chr4 (271 aa) initn: 800 init1: 389 opt: 854 Z-score: 1067.4 bits: 205.4 E(32554): 3.9e-53 Smith-Waterman score: 854; 51.9% identity (80.9% similar) in 241 aa overlap (57-290:6-236) 30 40 50 60 70 80 pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL ::::::::::::.:::::.:::: ..:::: CCDS58 MTDLEGCFECCIKCLGGIPYASLIATILLYAGVAL 10 20 30 90 100 110 120 130 140 pF1KE0 FCGCGHVALAGTVAILEQHF--STNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAE :::::: ::.::: ::. .: . .:.: . .:....:::::::. ::.:::.:..: CCDS58 FCGCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILLMVE 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 GFYTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTC ::.::.:.:.:.:.:: :.::::.:. :..:::.. .::::: .:...::.:..:.:. : CCDS58 GFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLWTIC 100 110 120 130 140 150 210 220 230 240 250 pF1KE0 EVIKSPQTNGTTGVE--QICVDIRQYGIIPWNAFPGKICGSALEN---ICNTNEFYMSYH . .:: :: ..:.:.::.::. . ::: .. :: .:...:. :..: CCDS58 R--------NTTLVEGANLCLDLRQFGIVTIGE-EKKIC-TVSENFLRMCESTELNMTFH 160 170 180 190 200 260 270 280 290 300 pF1KE0 LFIVACAGAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF ::::: :::::.:::.. :.:. . :.: .: CCDS58 LFIVALAGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTRSKE 210 220 230 240 250 260 CCDS58 RLNAYT 270 >>CCDS14513.1 PLP1 gene_id:5354|Hs108|chrX (277 aa) initn: 926 init1: 489 opt: 496 Z-score: 621.2 bits: 122.9 E(32554): 2.8e-28 Smith-Waterman score: 899; 49.8% identity (72.5% similar) in 273 aa overlap (57-294:2-273) 30 40 50 60 70 80 pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL : .::: .:: :.:.:::::: ::: :::: CCDS14 MGLLECCARCLVGAPFASLVATGLCFFGVAL 10 20 30 90 100 110 120 130 140 pF1KE0 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF :::::: ::.:: ..: .:: : .:. : .::. .:::::: :::::::: .:::::: CCDS14 FCGCGHEALTGTEKLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGF 40 50 60 70 80 90 150 160 170 pF1KE0 YTTSAVKELHGEFKTTACGRCISGM----------------------------------- :::.::... :..::: ::. .:. CCDS14 YTTGAVRQIFGDYKTTICGKGLSATVTGGQKGRGSRGQHQAHSLERVCHCLGKWLGHPDK 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE0 FVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGII :: .::.: :.:: ::. :::::....: :.::. : : .. .... ..:.: :.::.. CCDS14 FVGITYALTVVWLLVFACSAVPVYIYFNTWTTCQSIAFP-SKTSASIGSLCADARMYGVL 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE0 PWNAFPGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKF ::::::::.::: : .::.: :: :..::::.: .::.::...:: .:.:.:::.::::. CCDS14 PWNAFPGKVCGSNLLSICKTAEFQMTFHLFIAAFVGAAATLVSLLTFMIAATYNFAVLKL 220 230 240 250 260 270 300 pF1KE0 KSREDCCTKF .: CCDS14 MGRGTKF 301 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 06:17:55 2016 done: Thu Nov 3 06:17:55 2016 Total Scan time: 2.350 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]