FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1864, 385 aa 1>>>pF1KE1864 385 - 385 aa - 385 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9608+/-0.000833; mu= 19.0134+/- 0.050 mean_var=74.6230+/-15.067, 0's: 0 Z-trim(107.3): 75 B-trim: 5 in 2/49 Lambda= 0.148470 statistics sampled from 9445 (9524) to 9445 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.674), E-opt: 0.2 (0.293), width: 16 Scan time: 2.470 The best scores are: opt bits E(32554) CCDS56412.1 MICA gene_id:100507436|Hs108|chr6 ( 332) 2152 470.3 1.1e-132 CCDS43449.1 MICB gene_id:4277|Hs108|chr6 ( 383) 2123 464.1 9.3e-131 CCDS75423.1 MICB gene_id:4277|Hs108|chr6 ( 351) 1953 427.7 7.9e-120 CCDS75421.1 MICA gene_id:100507436|Hs108|chr6 ( 235) 1469 323.9 9.5e-89 CCDS75422.1 MICB gene_id:4277|Hs108|chr6 ( 340) 1286 284.8 7.9e-77 CCDS34394.1 B gene_id:3106|Hs108|chr6 ( 362) 518 120.3 2.7e-27 CCDS34393.1 C gene_id:3107|Hs108|chr6 ( 366) 515 119.7 4.3e-27 CCDS34373.1 A gene_id:3105|Hs108|chr6 ( 365) 505 117.5 1.9e-26 CCDS34379.1 E gene_id:3133|Hs108|chr6 ( 358) 502 116.9 2.9e-26 CCDS4578.1 HFE gene_id:3077|Hs108|chr6 ( 348) 499 116.2 4.5e-26 CCDS75412.1 HFE gene_id:3077|Hs108|chr6 ( 337) 494 115.1 9.2e-26 CCDS43438.1 F gene_id:3134|Hs108|chr6 ( 346) 493 114.9 1.1e-25 CCDS43437.1 F gene_id:3134|Hs108|chr6 ( 442) 493 115.0 1.3e-25 CCDS4668.1 G gene_id:3135|Hs108|chr6 ( 338) 491 114.5 1.4e-25 CCDS47387.1 HFE gene_id:3077|Hs108|chr6 ( 325) 484 113.0 3.9e-25 CCDS1342.1 MR1 gene_id:3140|Hs108|chr1 ( 341) 459 107.7 1.7e-23 CCDS4580.1 HFE gene_id:3077|Hs108|chr6 ( 260) 407 96.4 3.1e-20 CCDS5680.1 AZGP1 gene_id:563|Hs108|chr7 ( 298) 393 93.5 2.7e-19 CCDS12770.1 FCGRT gene_id:2217|Hs108|chr19 ( 365) 346 83.5 3.4e-16 CCDS4581.1 HFE gene_id:3077|Hs108|chr6 ( 168) 292 71.6 5.8e-13 CCDS4579.1 HFE gene_id:3077|Hs108|chr6 ( 256) 294 72.2 5.9e-13 CCDS53442.1 MR1 gene_id:3140|Hs108|chr1 ( 296) 289 71.2 1.4e-12 CCDS54975.1 HFE gene_id:3077|Hs108|chr6 ( 246) 268 66.6 2.7e-11 CCDS54974.1 HFE gene_id:3077|Hs108|chr6 ( 334) 268 66.7 3.4e-11 CCDS47386.1 HFE gene_id:3077|Hs108|chr6 ( 242) 263 65.5 5.6e-11 >>CCDS56412.1 MICA gene_id:100507436|Hs108|chr6 (332 aa) initn: 2152 init1: 2152 opt: 2152 Z-score: 2495.4 bits: 470.3 E(32554): 1.1e-132 Smith-Waterman score: 2152; 98.4% identity (99.4% similar) in 317 aa overlap (1-317:1-317) 10 20 30 40 50 60 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLRYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLRYD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RQKCRAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 RQKCRAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 IHEDNSTRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHY ::::::::::::::::::::::::.::::::::::::::::::::::::::::::::::: CCDS56 IHEDNSTRSSQHFYYDGELFLSQNLETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 HAMHADCLQELRRYLESSVVLRRRVPPMVNVTRSEASEGNITVTCRASSFYPRNITLTWR :::::::::::::::::.::::: ::::::::::::::::::::::::::::::: :::: CCDS56 HAMHADCLQELRRYLESGVVLRRTVPPMVNVTRSEASEGNITVTCRASSFYPRNIILTWR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 QDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHSTHPVPSGKV :::::::::::::::::::::::::::::::::.:::::::::::::::::::::::::: CCDS56 QDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICRGEEQRFTCYMEHSGNHSTHPVPSGKV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 LVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQVLDQHPVGTSD ::::::::::::::::: CCDS56 LVLQSHWQTFHVSAVAAGCCYFCYYYFLCPLL 310 320 330 >>CCDS43449.1 MICB gene_id:4277|Hs108|chr6 (383 aa) initn: 2123 init1: 1768 opt: 2123 Z-score: 2461.0 bits: 464.1 E(32554): 9.3e-131 Smith-Waterman score: 2123; 83.6% identity (91.7% similar) in 384 aa overlap (1-384:1-382) 10 20 30 40 50 60 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLRYD :::: :.:.:: ::::::.:::::::::::: ::: ::::::::::: ::::::::::: CCDS43 MGLGRVLLFLAVAFPFAPPAAAAEPHSLRYNLMVLSQDGSVQSGFLAEGHLDGQPFLRYD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RQKCRAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCE ::: ::::::::::.::: :::: ::.::: ::.::: ::.:::::: :::::::::::: CCDS43 RQKRRAKPQGQWAENVLGAKTWDTETEDLTENGQDLRRTLTHIKDQKGGLHSLQEIRVCE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 IHEDNSTRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHY ::::.:::.:.:::::::::::::.::.: ::::::::::::::: :: ::::::::::: CCDS43 IHEDSSTRGSRHFYYDGELFLSQNLETQESTVPQSSRAQTLAMNVTNFWKEDAMKTKTHY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 HAMHADCLQELRRYLESSVVLRRRVPPMVNVTRSEASEGNITVTCRASSFYPRNITLTWR .::.:::::.:.:::.:.:..:: :::::::: ::.:::::::::::::::::::::::: CCDS43 RAMQADCLQKLQRYLKSGVAIRRTVPPMVNVTCSEVSEGNITVTCRASSFYPRNITLTWR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 QDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHSTHPVPSGKV ::::::::.::::::::::::::::::::::: :::::::::::::::::.::::::::. CCDS43 QDGVSLSHNTQQWGDVLPDGNGTYQTWVATRIRQGEEQRFTCYMEHSGNHGTHPVPSGKA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 LVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQVLDQHPVGTSD :::::. : :.:: . .:::. : :::::::::::::::::::::::::::.: CCDS43 LVLQSQRTDFPY--VSAAMPCFVIIIILCVPCCKKKTSAAEGPELVSLQVLDQHPVGTGD 310 320 330 340 350 370 380 pF1KE1 HRDATQLGFQPLMSALGSTGSTEGA ::::.:::::::::: :::::::: CCDS43 HRDAAQLGFQPLMSATGSTGSTEGT 360 370 380 >>CCDS75423.1 MICB gene_id:4277|Hs108|chr6 (351 aa) initn: 1953 init1: 1598 opt: 1953 Z-score: 2264.7 bits: 427.7 E(32554): 7.9e-120 Smith-Waterman score: 1953; 84.0% identity (92.0% similar) in 351 aa overlap (34-384:2-350) 10 20 30 40 50 60 pF1KE1 GPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLRYDRQK ::: ::::::::::: :::::::::::::: CCDS75 MVLSQDGSVQSGFLAEGHLDGQPFLRYDRQK 10 20 30 70 80 90 100 110 120 pF1KE1 CRAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCEIHE ::::::::::.::: :::: ::.::: ::.::: ::.:::::: ::::::::::::::: CCDS75 RRAKPQGQWAENVLGAKTWDTETEDLTENGQDLRRTLTHIKDQKGGLHSLQEIRVCEIHE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 DNSTRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHYHAM :.:::.:.:::::::::::::.::.: ::::::::::::::: :: :::::::::::.:: CCDS75 DSSTRGSRHFYYDGELFLSQNLETQESTVPQSSRAQTLAMNVTNFWKEDAMKTKTHYRAM 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE1 HADCLQELRRYLESSVVLRRRVPPMVNVTRSEASEGNITVTCRASSFYPRNITLTWRQDG .:::::.:.:::.:.:..:: :::::::: ::.::::::::::::::::::::::::::: CCDS75 QADCLQKLQRYLKSGVAIRRTVPPMVNVTCSEVSEGNITVTCRASSFYPRNITLTWRQDG 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE1 VSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHSTHPVPSGKVLVL :::::.::::::::::::::::::::::: :::::::::::::::::.::::::::.::: CCDS75 VSLSHNTQQWGDVLPDGNGTYQTWVATRIRQGEEQRFTCYMEHSGNHGTHPVPSGKALVL 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE1 QSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQVLDQHPVGTSDHRD ::. : :.:: . .:::. : :::::::::::::::::::::::::::.:::: CCDS75 QSQRTDFPY--VSAAMPCFVIIIILCVPCCKKKTSAAEGPELVSLQVLDQHPVGTGDHRD 280 290 300 310 320 370 380 pF1KE1 ATQLGFQPLMSALGSTGSTEGA :.:::::::::: :::::::: CCDS75 AAQLGFQPLMSATGSTGSTEGT 330 340 350 >>CCDS75421.1 MICA gene_id:100507436|Hs108|chr6 (235 aa) initn: 1469 init1: 1469 opt: 1469 Z-score: 1706.8 bits: 323.9 E(32554): 9.5e-89 Smith-Waterman score: 1469; 97.7% identity (99.1% similar) in 220 aa overlap (98-317:1-220) 70 80 90 100 110 120 pF1KE1 PQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCEIHEDNST :::::::::::::::::::::::::::::: CCDS75 MTLAHIKDQKEGLHSLQEIRVCEIHEDNST 10 20 30 130 140 150 160 170 180 pF1KE1 RSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHYHAMHADC :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: CCDS75 RSSQHFYYDGELFLSQNLETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHYHAMHADC 40 50 60 70 80 90 190 200 210 220 230 240 pF1KE1 LQELRRYLESSVVLRRRVPPMVNVTRSEASEGNITVTCRASSFYPRNITLTWRQDGVSLS ::::::::::.::::: ::::::::::::::::::::::::::::::: ::::::::::: CCDS75 LQELRRYLESGVVLRRTVPPMVNVTRSEASEGNITVTCRASSFYPRNIILTWRQDGVSLS 100 110 120 130 140 150 250 260 270 280 290 300 pF1KE1 HDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHSTHPVPSGKVLVLQSHW ::::::::::::::::::::::::::.::::::::::::::::::::::::::::::::: CCDS75 HDTQQWGDVLPDGNGTYQTWVATRICRGEEQRFTCYMEHSGNHSTHPVPSGKVLVLQSHW 160 170 180 190 200 210 310 320 330 340 350 360 pF1KE1 QTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQVLDQHPVGTSDHRDATQL :::::::::: CCDS75 QTFHVSAVAAGCCYFCYYYFLCPLL 220 230 >>CCDS75422.1 MICB gene_id:4277|Hs108|chr6 (340 aa) initn: 1863 init1: 931 opt: 1286 Z-score: 1492.8 bits: 284.8 E(32554): 7.9e-77 Smith-Waterman score: 1779; 74.2% identity (81.0% similar) in 384 aa overlap (1-384:1-339) 10 20 30 40 50 60 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLRYD :::: :.:.:: ::::::.:::::::::::: ::: ::::::::::: ::::::::::: CCDS75 MGLGRVLLFLAVAFPFAPPAAAAEPHSLRYNLMVLSQDGSVQSGFLAEGHLDGQPFLRYD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RQKCRAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCE ::: ::::::::::.::: :::: ::.::: ::.::: ::.:::::: : CCDS75 RQKRRAKPQGQWAENVLGAKTWDTETEDLTENGQDLRRTLTHIKDQK-G----------- 70 80 90 100 130 140 150 160 170 180 pF1KE1 IHEDNSTRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDAMKTKTHY :::::::::::::: :: ::::::::::: CCDS75 -------------------------------VPQSSRAQTLAMNVTNFWKEDAMKTKTHY 110 120 130 190 200 210 220 230 240 pF1KE1 HAMHADCLQELRRYLESSVVLRRRVPPMVNVTRSEASEGNITVTCRASSFYPRNITLTWR .::.:::::.:.:::.:.:..:: :::::::: ::.:::::::::::::::::::::::: CCDS75 RAMQADCLQKLQRYLKSGVAIRRTVPPMVNVTCSEVSEGNITVTCRASSFYPRNITLTWR 140 150 160 170 180 190 250 260 270 280 290 300 pF1KE1 QDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHSTHPVPSGKV ::::::::.::::::::::::::::::::::: :::::::::::::::::.::::::::. CCDS75 QDGVSLSHNTQQWGDVLPDGNGTYQTWVATRIRQGEEQRFTCYMEHSGNHGTHPVPSGKA 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE1 LVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQVLDQHPVGTSD :::::. : :.:: . .:::. : :::::::::::::::::::::::::::.: CCDS75 LVLQSQRTDFPY--VSAAMPCFVIIIILCVPCCKKKTSAAEGPELVSLQVLDQHPVGTGD 260 270 280 290 300 310 370 380 pF1KE1 HRDATQLGFQPLMSALGSTGSTEGA ::::.:::::::::: :::::::: CCDS75 HRDAAQLGFQPLMSATGSTGSTEGT 320 330 340 >>CCDS34394.1 B gene_id:3106|Hs108|chr6 (362 aa) initn: 295 init1: 137 opt: 518 Z-score: 603.4 bits: 120.3 E(32554): 2.7e-27 Smith-Waterman score: 518; 30.1% identity (63.6% similar) in 349 aa overlap (6-342:9-341) 10 20 30 40 50 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFL :.:::.. . .. :.. ::.:: : .: : . :.. ..: :. CCDS34 MLVMAPRTVLLLLSAALALTETWAGS--HSMRYFYTSVSRPGRGEPRFISVGYVDDTQFV 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 RYDRQKC--RAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIK---DQKE-GLH :.: . : .:.. : :. : . :::.:. .... : .: ... .:.: : : CCDS34 RFDSDAASPREEPRAPWIEQE-GPEYWDRNTQIYKAQAQTDRESLRNLRGYYNQSEAGSH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SLQEIRVCEIHEDNST-RSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLK .:: . :.. :. :. ... :::. ... : . . ::. ... :: .. :.. CCDS34 TLQSMYGCDVGPDGRLLRGHDQYAYDGKDYIALNEDLRSWTAADTA-AQ---ITQRKW-- 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 EDAMKTKTHYHAMHADCLQELRRYLESSV-VLRRRVPPMVNVTRSEASEGNITVTCRASS : : ... . ....:.. ::::::.. :.: :: ..::. :. . :. : : . CCDS34 EAAREAEQRRAYLEGECVEWLRRYLENGKDKLERADPPKTHVTHHPISDHEATLRCWALG 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 FYPRNITLTWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGN ::: .:::::..:: . ..::. .. : :. :.: :.:. . .:::::.::...: : CCDS34 FYPAEITLTWQRDGEDQTQDTELV-ETRPAGDRTFQKWAAVVVPSGEEQRYTCHVQHEG- 240 250 260 270 280 290 300 310 320 330 340 pF1KE1 HSTHPVPSGKVLVLQ-SHWQTFHVSAVAAAAAAIFVIIIFYVRC---CKKKTSAAEGPEL .:. .: . : .: . ...:. :.. :..: : :..:.:...: CCDS34 -----LPKPLTLRWEPSSQSTVPIVGIVAGLAVLAVVVIGAVVAAVMCRRKSSGGKGGSY 290 300 310 320 330 340 350 360 370 380 pF1KE1 VSLQVLDQHPVGTSDHRDATQLGFQPLMSALGSTGSTEGA CCDS34 SQAACSDSAQGSDVSLTA 350 360 >>CCDS34393.1 C gene_id:3107|Hs108|chr6 (366 aa) initn: 361 init1: 138 opt: 515 Z-score: 599.8 bits: 119.7 E(32554): 4.3e-27 Smith-Waterman score: 515; 30.0% identity (63.7% similar) in 350 aa overlap (6-342:9-342) 10 20 30 40 50 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFL ..:::.: . .. : . ::.:: :..: : . :.. ..: :. CCDS34 MRVMAPRALLLLLSGGLALTETWACS--HSMRYFDTAVSRPGRGEPRFISVGYVDDTQFV 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 RYDRQKC--RAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIK---DQKE-GLH :.: . :..:.. :.:. : . :::::. ... :..: ... .:.: : : CCDS34 RFDSDAASPRGEPRAPWVEQE-GPEYWDRETQKYKRQAQADRVSLRNLRGYYNQSEDGSH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SLQEIRVCEIHEDNST-RSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLK .::.. :.. :. :. .. :::. ... : . . ::. :.: :. .. : CCDS34 TLQRMSGCDLGPDGRLLRGYDQSAYDGKDYIALNEDLRSWTA-----ADTAAQITQR--K 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 EDAMKTKTHYHA-MHADCLQELRRYLESSV-VLRRRVPPMVNVTRSEASEGNITVTCRAS .: .. . .: ... :.. ::::::.. .:.: :: ..::. :. . :. : : CCDS34 LEAARAAEQLRAYLEGTCVEWLRRYLENGKETLQRAEPPKTHVTHHPLSDHEATLRCWAL 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 SFYPRNITLTWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSG .::: .:::::..:: . ..::. .. : :.::.: :.:. . .:.:::.::.:.: : CCDS34 GFYPAEITLTWQRDGEDQTQDTELV-ETRPAGDGTFQKWAAVVVPSGQEQRYTCHMQHEG 240 250 260 270 280 290 300 310 320 330 340 pF1KE1 NHSTHPVPSGKVLVLQSHWQTFHVSAVAAAAAAIFVIIIF----YVRCCKKKTSAAEGPE . .:. . : :. . ...:. :.. :. .. . :..:.:...: CCDS34 LQ--EPLT---LSWEPSSQPTIPIMGIVAGLAVLVVLAVLGAVVTAMMCRRKSSGGKGGS 290 300 310 320 330 340 350 360 370 380 pF1KE1 LVSLQVLDQHPVGTSDHRDATQLGFQPLMSALGSTGSTEGA CCDS34 CSQAACSNSAQGSDESLITCKA 350 360 >>CCDS34373.1 A gene_id:3105|Hs108|chr6 (365 aa) initn: 353 init1: 136 opt: 505 Z-score: 588.3 bits: 117.5 E(32554): 1.9e-26 Smith-Waterman score: 505; 31.6% identity (63.6% similar) in 291 aa overlap (6-288:9-289) 10 20 30 40 50 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFL ..:::.: . .. :.. ::.:: .: .: : . :.: ..: :. CCDS34 MAVMAPRTLLLLLSGALALTQTWAGS--HSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFV 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 RYDRQKC--RAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIK---DQKE-GLH :.: . : .:.. : :. : . ::.:::.. .... :. :. .. .:.: : : CCDS34 RFDSDAASQRMEPRAPWIEQE-GPEYWDQETRNVKAQSQTDRVDLGTLRGYYNQSEAGSH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SLQEIRVCEIHEDNS-TRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLK ..: . :.. :. :. .. :::. ... : . . :: : .: .. . CCDS34 TIQIMYGCDVGSDGRFLRGYRQDAYDGKDYIALNEDLRSWT------AADMAAQITKRKW 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 EDAMKTKTHYHAMHADCLQELRRYLESSV-VLRRRVPPMVNVTRSEASEGNITVTCRASS : : ... . . :.. ::::::.. .:.: :: ...:. :. . :. : : . CCDS34 EAAHEAEQLRAYLDGTCVEWLRRYLENGKETLQRTDPPKTHMTHHPISDHEATLRCWALG 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 FYPRNITLTWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGN ::: .:::::..:: . ..::. .. : :.::.: :.:. . .:::::.::...: : CCDS34 FYPAEITLTWQRDGEDQTQDTELV-ETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGL 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE1 HSTHPVPSGKVLVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQ CCDS34 PKPLTLRWELSSQPTIPIVGIIAGLVLLGAVITGAVVAAVMWRRKSSDRKGGSYTQAASS 300 310 320 330 340 350 >>CCDS34379.1 E gene_id:3133|Hs108|chr6 (358 aa) initn: 388 init1: 142 opt: 502 Z-score: 584.9 bits: 116.9 E(32554): 2.9e-26 Smith-Waterman score: 502; 33.3% identity (63.9% similar) in 294 aa overlap (4-288:4-286) 10 20 30 40 50 60 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLRYD : ..:::. . .. :.. :::.: : .: : . :.. ..: :.:.: CCDS34 MVDGTLLLLLSEALALTQTWAGS--HSLKYFHTSVSRPGRGEPRFISVGYVDDTQFVRFD 10 20 30 40 50 70 80 90 100 110 pF1KE1 RQKC--RAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAHIK---DQKE-GLHSLQ . : :.. : :. :.. ::::::. ... .:..: .. .:.: : :.:: CCDS34 NDAASPRMVPRAPWMEQE-GSEYWDRETRSARDTAQIFRVNLRTLRGYYNQSEAGSHTLQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 EIRVCEIHEDNS-TRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDA .. ::. :. :. ..: :::. .:. : . . :: : : .. . ..:: CCDS34 WMHGCELGPDGRFLRGYEQFAYDGKDYLTLNEDLRSWT------AVDTAAQISEQKSNDA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 MKTKTHYHAMHAD-CLQELRRYLESSV-VLRRRVPPMVNVTRSEASEGNITVTCRASSFY ... : .:. : :.. :..:::.. .: . :: ..::. :. . :. : : .:: CCDS34 SEAE-HQRAYLEDTCVEWLHKYLEKGKETLLHLEPPKTHVTHHPISDHEATLRCWALGFY 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 PRNITLTWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHS : .:::::.::: . ..::. .. : :.::.: :.:. . .:::::.::...: : CCDS34 PAEITLTWQQDGEGHTQDTELV-ETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPE 240 250 260 270 280 300 310 320 330 340 350 pF1KE1 THPVPSGKVLVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSLQVL CCDS34 PVTLRWKPASQPTIPIVGIIAGLVLLGSVVSGAVVAAVIWRKKSSGGKGGSYSKAEWSDS 290 300 310 320 330 340 >>CCDS4578.1 HFE gene_id:3077|Hs108|chr6 (348 aa) initn: 424 init1: 251 opt: 499 Z-score: 581.6 bits: 116.2 E(32554): 4.5e-26 Smith-Waterman score: 499; 29.6% identity (59.5% similar) in 348 aa overlap (5-342:7-340) 10 20 30 40 50 pF1KE1 MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSGFLAEVHLDGQPFLR :..::: . . : . :::.: . : . : : : ..: : :. CCDS45 MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 YDRQKCRAKPQGQWAEDVLGNKTWDRETRDLTGNGK----DLRMTLAHIKDQKEGLHSLQ ::... :..:. :. . .... : . ...: : . :. . . . .::. :.:: CCDS45 YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKES-HTLQ 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 EIRVCEIHEDNSTRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQTLAMNVRNFLKEDAM : ::..:::::.. .. :::. : .: .: . :. : .. .. . CCDS45 VILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRA-----AEPRAWPTKLEWERHKI 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 KTKTHYHAMHADCLQELRRYLE-SSVVLRRRVPPMVNVTRSEASEGNITVTCRASSFYPR ... . .. :: .:.. :: . :: ..:::.:.::. ... . :. ::: ..::. CCDS45 RARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTH-HVTSSVTTLRCRALNYYPQ 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 NITLTWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGEEQRFTCYMEHSGNHST- :::. : .: .. . ::::.:.:::: :.. . :::::.:: .:: : . CCDS45 NITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPL 240 250 260 270 280 290 300 310 320 330 340 pF1KE1 ----HPVPSGKVLVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCKKKTSAAEGPELVSL .: ::: .::. .:..:. .. .:. :.: . .. . .: : CCDS45 IVIWEPSPSG-TLVIGV------ISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAE 300 310 320 330 340 350 360 370 380 pF1KE1 QVLDQHPVGTSDHRDATQLGFQPLMSALGSTGSTEGA CCDS45 RE 385 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:27:20 2016 done: Sun Nov 6 21:27:20 2016 Total Scan time: 2.470 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]