FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9424, 361 aa 1>>>pF1KE9424 361 - 361 aa - 361 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5408+/-0.00105; mu= 14.8647+/- 0.062 mean_var=99.3781+/-27.889, 0's: 0 Z-trim(104.0): 227 B-trim: 1000 in 2/48 Lambda= 0.128656 statistics sampled from 7394 (7685) to 7394 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.585), E-opt: 0.2 (0.236), width: 16 Scan time: 2.690 The best scores are: opt bits E(32554) CCDS55720.1 FFAR4 gene_id:338557|Hs108|chr10 ( 361) 2378 452.5 2.6e-127 CCDS31248.1 FFAR4 gene_id:338557|Hs108|chr10 ( 377) 1521 293.4 2.1e-79 CCDS3791.1 NPY2R gene_id:4887|Hs108|chr4 ( 381) 340 74.2 2e-13 CCDS4321.1 NMUR2 gene_id:56923|Hs108|chr5 ( 415) 306 67.9 1.7e-11 CCDS7606.1 PRLHR gene_id:2834|Hs108|chr10 ( 370) 297 66.2 5e-11 CCDS3719.1 QRFPR gene_id:84109|Hs108|chr4 ( 431) 294 65.7 8.2e-11 >>CCDS55720.1 FFAR4 gene_id:338557|Hs108|chr10 (361 aa) initn: 2378 init1: 2378 opt: 2378 Z-score: 2399.0 bits: 452.5 E(32554): 2.6e-127 Smith-Waterman score: 2378; 100.0% identity (100.0% similar) in 361 aa overlap (1-361:1-361) 10 20 30 40 50 60 pF1KE9 MSPECARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIFAVSLLGNVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MSPECARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIFAVSLLGNVC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 ALVLVARRRRRGATACLVLNLFCADLLFISAIPLVLAVRWTEAWLLGPVACHLLFYVMTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ALVLVARRRRRGATACLVLNLFCADLLFISAIPLVLAVRWTEAWLLGPVACHLLFYVMTL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 SGSVTILTLAAVSLERMVCIVHLQRGVRGPGRRARAVLLALIWGYSAVAALPLCVFFRVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SGSVTILTLAAVSLERMVCIVHLQRGVRGPGRRARAVLLALIWGYSAVAALPLCVFFRVV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 PQRLPGADQEISICTLIWPTIPGEISWDVSFVTLNFLVPGLVIVISYSKILQITKASRKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PQRLPGADQEISICTLIWPTIPGEISWDVSFVTLNFLVPGLVIVISYSKILQITKASRKR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 LTVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPIIITILLILIQNFKQDLVIWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LTVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPIIITILLILIQNFKQDLVIWP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 SLFFWVVAFTFANSALNPILYNMTLCRNEWKKIFCCFWFPEKGAILTDTSVKRNDLSIIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SLFFWVVAFTFANSALNPILYNMTLCRNEWKKIFCCFWFPEKGAILTDTSVKRNDLSIIS 310 320 330 340 350 360 pF1KE9 G : CCDS55 G >>CCDS31248.1 FFAR4 gene_id:338557|Hs108|chr10 (377 aa) initn: 1521 init1: 1521 opt: 1521 Z-score: 1539.1 bits: 293.4 E(32554): 2.1e-79 Smith-Waterman score: 2336; 95.8% identity (95.8% similar) in 377 aa overlap (1-361:1-377) 10 20 30 40 50 60 pF1KE9 MSPECARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIFAVSLLGNVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MSPECARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIFAVSLLGNVC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 ALVLVARRRRRGATACLVLNLFCADLLFISAIPLVLAVRWTEAWLLGPVACHLLFYVMTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ALVLVARRRRRGATACLVLNLFCADLLFISAIPLVLAVRWTEAWLLGPVACHLLFYVMTL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 SGSVTILTLAAVSLERMVCIVHLQRGVRGPGRRARAVLLALIWGYSAVAALPLCVFFRVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SGSVTILTLAAVSLERMVCIVHLQRGVRGPGRRARAVLLALIWGYSAVAALPLCVFFRVV 130 140 150 160 170 180 190 200 210 220 230 pF1KE9 PQRLPGADQEISICTLIWPTIPGEISWDVSFVTLNFLVPGLVIVISYSKILQ-------- :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 PQRLPGADQEISICTLIWPTIPGEISWDVSFVTLNFLVPGLVIVISYSKILQTSEHLLDA 190 200 210 220 230 240 240 250 260 270 280 pF1KE9 --------ITKASRKRLTVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPIIITI :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 RAVVTHSEITKASRKRLTVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPIIITI 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE9 LLILIQNFKQDLVIWPSLFFWVVAFTFANSALNPILYNMTLCRNEWKKIFCCFWFPEKGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 LLILIQNFKQDLVIWPSLFFWVVAFTFANSALNPILYNMTLCRNEWKKIFCCFWFPEKGA 310 320 330 340 350 360 350 360 pF1KE9 ILTDTSVKRNDLSIISG ::::::::::::::::: CCDS31 ILTDTSVKRNDLSIISG 370 >>CCDS3791.1 NPY2R gene_id:4887|Hs108|chr4 (381 aa) initn: 249 init1: 127 opt: 340 Z-score: 354.4 bits: 74.2 E(32554): 2e-13 Smith-Waterman score: 348; 29.1% identity (57.3% similar) in 323 aa overlap (46-351:52-355) 20 30 40 50 60 70 pF1KE9 SLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIF-AVSLLG----NVCALVLVARRRR ::.: . .. ::: .. :.. . CCDS37 YGPQTTPRGELVPDPEPELIDSTKLIEVQVVLILAYCSIILLGVIGNSLVIHVVIKFKSM 30 40 50 60 70 80 80 90 100 110 120 pF1KE9 RGATACLVLNLFCADLLFIS-AIPLVLAVRWTEAWLLGPVACHLLFYVMTLSGSVTILTL : .: .. :: :::: . .:..:. : .::: :::. :.. :. .:. .:: CCDS37 RTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLCHLVPYAQGLAVQVSTITL 90 100 110 120 130 140 130 140 150 160 170 180 pF1KE9 AAVSLERMVCIV-HLQRGVRGPGRRARAVLLALIWGYSAVAALPLCVFFRV-VPQRLPGA ....:.: ::: ::. . ..: ....: :: ::. : :: .: . . . .: CCDS37 TVIALDRHRCIVYHLESKI---SKRISFLIIGLAWGISALLASPLAIFREYSLIEIIP-- 150 160 170 180 190 190 200 210 220 230 240 pF1KE9 DQEISICTLIWP----TIPGEISWDVSFVTLNFLVPGLVIVISYSKILQITKASRKRLTV : :: :: :: .: : . ...: . . ...: .: .::..: . : .... CCDS37 DFEIVACTEKWPGEEKSIYGTV-YSLSSLLILYVLPLGIISFSYTRIWSKLK---NHVSP 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE9 SLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPIIITILLILIQNFKQDLVIWPSLF . : .. :: : :. .. : ..: : . : :. : . :.. :: . .: CCDS37 GAANDHYHQRR--QKTTKM---LVCVVVVFAVSWLPLHAFQLAVDIDSQVLDLKEYKLIF 260 270 280 290 300 310 320 330 340 350 pF1KE9 --FWVVAF--TFANSALNPILYNMTLCRNEWKKIFCCFWFPEK-GAILTDTSVKRNDLSI : ..:. ::: ::.::. . : : .. : .. :: ...:: CCDS37 TVFHIIAMCSTFA----NPLLYGW-MNSNYRKAFLSAFRCEQRLDAIHSEVSVTFKAKKN 310 320 330 340 350 360 360 pF1KE9 ISG CCDS37 LEVRKNSGPNDSFTEATNV 370 380 >>CCDS4321.1 NMUR2 gene_id:56923|Hs108|chr5 (415 aa) initn: 210 init1: 142 opt: 306 Z-score: 319.8 bits: 67.9 E(32554): 1.7e-11 Smith-Waterman score: 306; 27.9% identity (59.2% similar) in 326 aa overlap (13-323:21-329) 10 20 30 40 50 pF1KE9 MSPECARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIFA :... .... . :. . .: .. ..: : : ::. CCDS43 MSGMEKLQNASWIYQQKLEDPFQKHLNSTEEYLAFLCGPRRSHFFLPVSV---VYVPIFV 10 20 30 40 50 60 70 80 90 100 pF1KE9 VSLLGNV--CALVLVARRRRRGATACLVLNLFCADLL-FISAIPLVLAVRWTE-AWLLGP :...::: : ::.. .. . : ...: .::: .. ..:: . : . .:.:: CCDS43 VGVIGNVLVC-LVILQHQAMKTPTNYYLFSLAVSDLLVLLLGMPLEVYEMWRNYPFLFGP 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE9 VACHLLFYVMTLSGSVTILTLAAVSLERMVCIVHLQRGVRGPGRRARAVLLALIWGYSAV :.:.. .. ..::....::.::.: :.: :. :: .:...::.:.. CCDS43 VGCYFKTALFETVCFASILSITTVSVERYVAILHPFRAKLQSTRRRALRILGIVWGFSVL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE9 AALPLC----VFFRVVPQR--LPGADQEISICTLIWPT-IPGEISWDVSFVTLNFLVPGL .:: . :. :. .::. . ::.: : : . : .:: : .:.: CCDS43 FSLPNTSIHGIKFHYFPNGSLVPGS----ATCTVIKPMWIYNFIIQVTSF--LFYLLPMT 180 190 200 210 220 230 230 240 250 260 270 pF1KE9 VIVISYSKILQITKASRKRLTVSLAYSESHQIRVSQQDFR--LFRTLFLLMVSFFIMWSP :: . : . : : . :: .:.. :. : . . ::.:.. : : :.: CCDS43 VISVLYYLM-----ALRLKKDKSLEADEGNA--NIQRPCRKSVNKMLFVLVLVFAICWAP 240 250 260 270 280 280 290 300 310 320 330 pF1KE9 IIITILLI-LIQNFKQDLVIWPSLFFWVVA-FTFANSALNPILYNMTLCRNEWKKIFCCF . : :.. ........:. .: : . : . .::.:::.::. CCDS43 FHIDRLFFSFVEEWSESLAAVFNLVHVVSGVFFYLSSAVNPIIYNLLSRRFQAAFQNVIS 290 300 310 320 330 340 340 350 360 pF1KE9 WFPEKGAILTDTSVKRNDLSIISG CCDS43 SFHKQWHSQHDPQLPPAQRNIFLTECHFVELTEDIGPQFPCQSSMHNSHLPAALSSEQMS 350 360 370 380 390 400 >>CCDS7606.1 PRLHR gene_id:2834|Hs108|chr10 (370 aa) initn: 202 init1: 97 opt: 297 Z-score: 311.4 bits: 66.2 E(32554): 5e-11 Smith-Waterman score: 338; 27.5% identity (56.7% similar) in 353 aa overlap (2-344:24-358) 10 20 30 pF1KE9 MSPECARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLV .: : ..: :. :. :.... :.: CCDS76 MASSTTRGPRVSDLFSGLPPAVTTPANQSAEASAGNGSVAGADAPAVTPFQSLQLVHQLK 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE9 LAAVETTVLVLIFAVSLLGNVCALVLVARRRRR--GATACLVLNLFCADLLFISA-IPLV : .:.. :.:.:: : :::: : :: ..: :. :: .:.:. .: .::. CCDS76 GLIVLLYSVVVV--VGLVGN-CLLVLVIARVRRLHNVTNFLIGNLALSDVLMCTACVPLT 70 80 90 100 110 100 110 120 130 140 150 pF1KE9 LAVRWT-EAWLLGPVACHLLFYVMTLSGSVTILTLAAVSLERMVCIVHLQRGVRGPGRRA :: . ..:..: :::.:... .. :...::......:.: .:: : : . : CCDS76 LAYAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRYVVLVHPLR--RRISLRL 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE9 RAVLLALIWGYSAVAALPLCVFFRVVPQRLPGADQEISICTLIWPTIPGE---ISWDVSF : . ::. ::: ::: : : . : ... .: .: . . .: . . CCDS76 SAYAVLAIWALSAVLALPAAVHTYHVELK-P---HDVRLCEEFWGSQERQRQLYAWGLLL 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE9 VTLNFLVPGLVIVISYSKILQITKASRKRLTVS-LAYSESHQIRVSQQDFRLFRTLFLLM :: .:.: :::..:: .... :.:.. . .. :.. :. .. : : : ... CCDS76 VT--YLLPLLVILLSY---VRVSVKLRNRVVPGCVTQSQADWDRARRR--RTFCLLVVIV 240 250 260 270 280 280 290 300 310 320 pF1KE9 VSFFIMWSPIIITILLILIQNFKQDLVIWPSLFFWVVAFTFANSALNPILYNMTL--CRN : : . : :. . :: .. : . . . ...... ::..: :. CCDS76 VVFAVCWLPLHVFNLLRDLDPHAIDPYAFGLVQLLCHWLAMSSACYNPFIYAWLHDSFRE 290 300 310 320 330 340 330 340 350 360 pF1KE9 EWKKIFCCFWFPEKGAILTDTSVKRNDLSIISG : .:.. : :.: : CCDS76 ELRKLLVA-W-PRKIAPHGQNMTVSVVI 350 360 370 >>CCDS3719.1 QRFPR gene_id:84109|Hs108|chr4 (431 aa) initn: 222 init1: 146 opt: 294 Z-score: 307.5 bits: 65.7 E(32554): 8.2e-11 Smith-Waterman score: 294; 26.2% identity (58.5% similar) in 313 aa overlap (36-336:43-350) 10 20 30 40 50 60 pF1KE9 ARAAGDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLIFAVSLLGNVCALVLV : :: : : : ::::..:.::. .. .: CCDS37 RLLRDHNLTREQFIALYRLRPLVYTPELPGRAKLALVLTGV--LIFALALFGNALVFYVV 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE9 ARRRR-RGATACLVLNLFCADLLF-ISAIPLVLAVRWTEAWLLGPVACHLLFYVMTLSGS .: . : .: .. .: .:::. . ::... .. :: : :... .:.. . CCDS37 TRSKAMRTVTNIFICSLALSDLLITFFCIPVTMLQNISDNWLGGAFICKMVPFVQSTAVV 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE9 VTILTLAAVSLERMVCIVH-LQRGVRGPGRRARAVLLALIWGYSAVAALPLCVFFRVVPQ . :::.. ...:: .:: .. . .::: . .:...: ..... :. . . CCDS37 TEILTMTCIAVERHQGLVHPFKMKWQYTNRRAFT-MLGVVWLVAVIVGSPM-WHVQQLEI 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 RLPGADQEISICTLIWPTIPGEISWDVSFV-TLNFLVPGLVIVISYSKILQITKASRKRL . .. :: : : : . . ..:. .. ::.: .:..: :::: .::. CCDS37 KYDFLYEKEHICCLEEWTSPVHQKIYTTFILVILFLLPLMVMLILYSKI-GYELWIKKRV 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 TVSLAYSESHQIRVSQQDFRLFRTLFLLM--VSFF-IMWSPIIITILLILIQNFKQ--DL . . : ..:. . :...... :..: . :.:. .. ..: .::.. : CCDS37 GDGSVLRTIHGKEMSKIARKKKRAVIMMVTVVALFAVCWAPFHVVHMMIEYSNFEKEYDD 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE9 VIWPSLFFWVVAFTFANSALNPILY---NMTLCRNEWKKIFCCFWFPEKGAILTDTSVKR : .: : . :.:: :::.: : .. .: . . : CCDS37 VTIKMIFAIVQIIGFSNSICNPIVYAFMNENFKKNVLSAVCYCIVNKTFSPAQRHGNSGI 310 320 330 340 350 360 360 pF1KE9 NDLSIISG CCDS37 TMMRKKAKFSLRENPVEETKGEAFSDGNIEVKLCEQTEEKKKLKRHLALFRSELAENSPL 370 380 390 400 410 420 361 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:36:55 2016 done: Sun Nov 6 12:36:56 2016 Total Scan time: 2.690 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]