FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2663, 491 aa 1>>>pF1KE2663 491 - 491 aa - 491 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7899+/-0.000889; mu= 16.1700+/- 0.054 mean_var=80.4507+/-15.668, 0's: 0 Z-trim(107.4): 16 B-trim: 0 in 0/50 Lambda= 0.142991 statistics sampled from 9544 (9554) to 9544 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.662), E-opt: 0.2 (0.293), width: 16 Scan time: 2.570 The best scores are: opt bits E(32554) CCDS5140.1 TMEM200A gene_id:114801|Hs108|chr6 ( 491) 3177 665.1 4.7e-191 CCDS45825.1 TMEM200C gene_id:645369|Hs108|chr18 ( 621) 335 78.9 1.7e-14 CCDS30658.1 TMEM200B gene_id:399474|Hs108|chr1 ( 307) 313 74.2 2.3e-13 >>CCDS5140.1 TMEM200A gene_id:114801|Hs108|chr6 (491 aa) initn: 3177 init1: 3177 opt: 3177 Z-score: 3543.6 bits: 665.1 E(32554): 4.7e-191 Smith-Waterman score: 3177; 100.0% identity (100.0% similar) in 491 aa overlap (1-491:1-491) 10 20 30 40 50 60 pF1KE2 MIATGGVITGLAALKRQDSARSQQHVNLSPSPATQEKKPIRRRPRADVVVVRGKIRLYSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 MIATGGVITGLAALKRQDSARSQQHVNLSPSPATQEKKPIRRRPRADVVVVRGKIRLYSP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SGFFLILGVLISIIGIAMAVLGYWPQKEHFIDAETTLSTNETQVIRNEGGVVVRFFEQHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 SGFFLILGVLISIIGIAMAVLGYWPQKEHFIDAETTLSTNETQVIRNEGGVVVRFFEQHL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 HSDKMKMLGPFTMGIGIFIFICANAILHENRDKETKIIHMRDIYSTVIDIHTLRIKEQRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 HSDKMKMLGPFTMGIGIFIFICANAILHENRDKETKIIHMRDIYSTVIDIHTLRIKEQRQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 MNGMYTGLMGETEVKQNGSSCASRLAANTIASFSGFRSSFRMDSSVEEDELMLNEGKSSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 MNGMYTGLMGETEVKQNGSSCASRLAANTIASFSGFRSSFRMDSSVEEDELMLNEGKSSG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 HLMPPLLSDSSVSVFGLYPPPSKTTDDKTSGSKKCETKSIVSSSISAFTLPVIKLNNCVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 HLMPPLLSDSSVSVFGLYPPPSKTTDDKTSGSKKCETKSIVSSSISAFTLPVIKLNNCVI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 DEPSIDNITEDADNLKSRSRNLSMDSLVVPLPNTSESFQPVSTVLPRNNSIGESLSSQYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 DEPSIDNITEDADNLKSRSRNLSMDSLVVPLPNTSESFQPVSTVLPRNNSIGESLSSQYK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 SSMALGPGAGQLLSPGAARRQFGSNTSLHLLSSHSKSLDLDRGPSTLTVQAEQRKHPSWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 SSMALGPGAGQLLSPGAARRQFGSNTSLHLLSSHSKSLDLDRGPSTLTVQAEQRKHPSWP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 RLDRNNSKGYMKLENKEDPMDRLLVPQVAIKKDFTNKEKLLMISRSHNNLSFEHDEFLSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 RLDRNNSKGYMKLENKEDPMDRLLVPQVAIKKDFTNKEKLLMISRSHNNLSFEHDEFLSN 430 440 450 460 470 480 490 pF1KE2 NLKRGTSETRF ::::::::::: CCDS51 NLKRGTSETRF 490 >>CCDS45825.1 TMEM200C gene_id:645369|Hs108|chr18 (621 aa) initn: 667 init1: 316 opt: 335 Z-score: 373.5 bits: 78.9 E(32554): 1.7e-14 Smith-Waterman score: 453; 33.2% identity (55.9% similar) in 304 aa overlap (19-272:11-299) 10 20 30 40 50 60 pF1KE2 MIATGGVITGLAALKRQDSARSQQHVNLSPSPATQEKKPIRRRPRADVVVVRGKIRLYSP :::.:. . :: ..:. ..: . :::::.::..: : CCDS45 MIATGGLLRISARKQDPLR-PPSQIPKRKRKAKKRRKNDVVVVKGKLKLCSI 10 20 30 40 50 70 80 90 100 pF1KE2 SGFFLILGVLISIIGIAMAVLGYWPQ-----KE-----------HFI----DAETTLSTN ::.. . :.:. ..::::::.::::. .: : . .. .. : : CCDS45 SGLIALCGILVLLVGIAMAVVGYWPKATGTNREGGKQLPPAGSSHRVPTTANSSSSGSKN 60 70 80 90 100 110 110 120 130 pF1KE2 ETQV-IRNEGGV----------------------------VVRFFEQHLHSDKMKMLGPF ... : ::: :.: .:::::.:..::. CCDS45 RSRSHPRAPGGVNSSSAGAPRSTPPARAASPSSSSTSVGFFFRIFSGYLHSDKLKVFGPL 120 130 140 150 160 170 140 150 160 170 180 190 pF1KE2 TMGIGIFIFICANAILHENRDKETKIIHMRDIYSTVIDIHTLRIKEQRQMNGMYTGLMGE ::::::.::::::.:::::::.::::..::.::::::.:.:: :. . .. . CCDS45 IMGIGIFLFICANAVLHENRDKKTKIINLRDLYSTVIDVHSLRAKD------LAAAAAAA 180 190 200 210 220 200 210 220 230 240 250 pF1KE2 TEVKQNGSSCASRLAANTIASFSGFRSSFRMDSSVEEDELMLNEGKSSGHLMPPLLSDSS . . ..:: : : ..:: : :. : :. : .: .. .. CCDS45 AAAAASSSSSAPAAAPPGAIPLNGFLS------YVQSRGLELKPGGCGGS--GDAFGAAA 230 240 250 260 270 260 270 280 290 300 310 pF1KE2 VSVFGLYPP-PSKTTDDKTSGSKKCETKSIVSSSISAFTLPVIKLNNCVIDEPSIDNITE . . : .:: :. . . :. CCDS45 MLAKGSWPPHPAAPSGGRPRGAASPPDLASSPRCPREPPSLAEAVYSVYRERSGVAGSRR 280 290 300 310 320 330 >>CCDS30658.1 TMEM200B gene_id:399474|Hs108|chr1 (307 aa) initn: 431 init1: 165 opt: 313 Z-score: 353.5 bits: 74.2 E(32554): 2.3e-13 Smith-Waterman score: 313; 41.2% identity (70.6% similar) in 119 aa overlap (41-156:30-145) 20 30 40 50 60 pF1KE2 LAALKRQDSARSQQHVNLSPSPATQEKKPIRRRPRA--DVVVVRGKIRLYSPSGFFLILG :::::. . . ::...:: :::: : :: CCDS30 MTAGSPEECGEVRRSPEGRVSRLGRRLGRRRRPRSPPEPLRVRARLRLRSPSGAFAALG 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 VLISIIGIAMAVLGYWPQKEHFIDAETT-LSTNETQVIRNEGGVVVRFFEQHLHSDKMKM .:. ..:...:: ::::.. .... :. . . .: :: : : ..... CCDS30 ALVVLVGMGIAVAGYWPHRAGAPGSRAANASSPQMSELRREGRGGGRAHGPH---ERLRL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 LGPFTMGIGIFIFICANAILHENRDKETKIIHMRDIYSTVIDIHTLRIKEQRQMNGMYTG ::: ::.:.:.:::::..:.:::: ::. CCDS30 LGPVIMGVGLFVFICANTLLYENRDLETRRLRQGVLRAQALRPPDGPGWDCALLPSPGPR 120 130 140 150 160 170 491 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Jan 20 09:30:13 2017 done: Fri Jan 20 09:30:13 2017 Total Scan time: 2.570 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]