FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9816, 300 aa
1>>>pF1KB9816 300 - 300 aa - 300 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2625+/-0.000757; mu= 15.9901+/- 0.045
mean_var=91.4052+/-24.599, 0's: 0 Z-trim(109.1): 162 B-trim: 1305 in 2/49
Lambda= 0.134149
statistics sampled from 10421 (10662) to 10421 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.328), width: 16
Scan time: 2.720
The best scores are: opt bits E(32554)
CCDS12458.1 FFAR1 gene_id:2864|Hs108|chr19 ( 300) 2046 406.0 1.8e-113
CCDS12459.1 FFAR3 gene_id:2865|Hs108|chr19 ( 346) 447 96.6 2.9e-20
CCDS12461.1 FFAR2 gene_id:2867|Hs108|chr19 ( 330) 424 92.1 6.1e-19
CCDS4032.1 F2R gene_id:2149|Hs108|chr5 ( 425) 313 70.7 2.1e-12
CCDS14115.1 P2RY8 gene_id:286530|Hs108|chrY ( 359) 300 68.1 1.1e-11
CCDS14115.1 P2RY8 gene_id:286530|Hs108|chrX ( 359) 300 68.1 1.1e-11
CCDS12350.1 F2RL3 gene_id:9002|Hs108|chr19 ( 385) 287 65.6 6.5e-11
>>CCDS12458.1 FFAR1 gene_id:2864|Hs108|chr19 (300 aa)
initn: 2046 init1: 2046 opt: 2046 Z-score: 2150.6 bits: 406.0 E(32554): 1.8e-113
Smith-Waterman score: 2046; 100.0% identity (100.0% similar) in 300 aa overlap (1-300:1-300)
10 20 30 40 50 60
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSLVYALNLGCSDLLLTVSLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSLVYALNLGCSDLLLTVSLP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 LKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSAGRYLGAAFPLGYQAFRRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSAGRYLGAAFPLGYQAFRRP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 CYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGINTPVNGSPVCLEAWDPASAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 CYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGINTPVNGSPVCLEAWDPASAG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PARFSLSLLLFFLPLAITAFCYVGCLRALARSGLTHRRKLRAAWVAGGALLTLLLCVGPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PARFSLSLLLFFLPLAITAFCYVGCLRALARSGLTHRRKLRAAWVAGGALLTLLLCVGPY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 NASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLVTGYLGRGPGLKTVCAARTQGGKSQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 NASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLVTGYLGRGPGLKTVCAARTQGGKSQK
250 260 270 280 290 300
>>CCDS12459.1 FFAR3 gene_id:2865|Hs108|chr19 (346 aa)
initn: 383 init1: 215 opt: 447 Z-score: 477.3 bits: 96.6 E(32554): 2.9e-20
Smith-Waterman score: 461; 33.9% identity (62.5% similar) in 277 aa overlap (9-282:18-282)
10 20 30 40 50
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSLVYALNLGCS
:..:. .: .:.:::.::. ... . : . : ::: :
CCDS12 MDTGPDQSYFSGNHWFVFSVYLLTFLVGLPLNLLALVVFVGKLQRRPVAVDVLLLNLTAS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 DLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSAGRYLGAAFP
:::: . ::.. ::: . :::: :::. . : .: . ::::.: :.:..: :
CCDS12 DLLLLLFLPFRMVEAANGMHWPLPFILCPLSGFIFFTTIYLTALFLAAVSIERFLSVAHP
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB9 LGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGINTPVNGSPVC-
: :.. : . : .: : :. : ..:. .: : ..::. . ::. :
CCDS12 LWYKTRPRLGQAGLVSVACWLLASAHCSVVYVIEFSGD-ISHSQGT-------NGT--CY
130 140 150 160 170
180 190 200 210 220
pF1KB9 LEAWDPASAG--PARFSLSLLLFFLPLAITAFCYVGCLRALARSGLTHRRKLRAAWVAGG
:: : :.:. ....:: .:: ::..:: . :.:.: .:::. :.: . ..
CCDS12 LEFRKDQLAILLPVRLEMAVVLFVVPLIITSYCYSRLVWILGRGG-SHRRQRRVAGLLAA
180 190 200 210 220
230 240 250 260 270 280
pF1KB9 ALLTLLLCVGPYNASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLVTGYLGRGPGLKTV
.::..:.: ::::.:.:.... . . .:: . .. . ..:.: . . :
CCDS12 TLLNFLVCFGPYNVSHVVGYICGE-SPAWRIYVTLLSTLNSCVDPFVYYFSSSGFQADFH
230 240 250 260 270 280
290 300
pF1KB9 CAARTQGGKSQK
CCDS12 ELLRRLCGLWGQWQQESSMELKEQKGGEEQRADRPAERKTSEHSQGCGTGGQVACAES
290 300 310 320 330 340
>>CCDS12461.1 FFAR2 gene_id:2867|Hs108|chr19 (330 aa)
initn: 345 init1: 213 opt: 424 Z-score: 453.5 bits: 92.1 E(32554): 6.1e-19
Smith-Waterman score: 438; 30.5% identity (60.7% similar) in 308 aa overlap (2-295:4-295)
10 20 30 40 50
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARL-RLTPSLVYALNLGCSDLLLTV
: .: . :. : :.: :.::.:. ... : . .: . :.: .:::: .
CCDS12 MLPDWKSSLILMAYIIIFLTGLPANLLALRAFVGRIRQPQPAPVHILLLSLTLADLLLLL
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 SLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSAGRYLGAAFPLGYQAF
::.: .:: .. : :: .: . . . . .: . .::..: ::::.:::. :.
CCDS12 LLPFKIIEAASNFRWYLPKVVCALTSFGFYSSIYCSTWLLAGISIERYLGVAFPVQYKLS
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB9 RRPCYSWGVCAAI--WALVLCHLGLVFGLEAPGGWLDHSNTSLGINTPVNGSPV-CLEAW
::: : :: ::. :.. . : .:. .. . ::. . . :. . : : .
CCDS12 RRPLY--GVIAALVAWVMSFGHCTIVIIVQ-------YLNTTEQVRS---GNEITCYENF
130 140 150 160
180 190 200 210 220 230
pF1KB9 DPASAG---PARFSLSLLLFFLPLAITAFCYVGCLRALARSGLTH-RRKLRAAWVAGGAL
. :.:. : :.:::.:.:.: ::: . . . :. .:. ::. .: .:
CCDS12 TDNQLDVVLPVRLELCLVLFFIPMAVTIFCYWRFVWIMLSQPLVGAQRRRRAVGLAVVTL
170 180 190 200 210 220
240 250 260 270 280
pF1KB9 LTLLLCVGPYNASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLV----TGYLGR--GPG
:..:.: ::::.:..... . . ::..... .. .. :.::. .. . : : :
CCDS12 LNFLVCFGPYNVSHLVGY-HQRKSPWWRSIAVVFSSLNASLDPLLFYFSSSVVRRAFGRG
230 240 250 260 270 280
290 300
pF1KB9 LKTVCAARTQGGKSQK
:... :.::
CCDS12 LQVL---RNQGSSLLGRRGKDTAEGTNEDRGVGQGEGMPSSDFTTE
290 300 310 320 330
>>CCDS4032.1 F2R gene_id:2149|Hs108|chr5 (425 aa)
initn: 287 init1: 164 opt: 313 Z-score: 336.1 bits: 70.7 E(32554): 2.1e-12
Smith-Waterman score: 314; 26.0% identity (62.4% similar) in 242 aa overlap (11-250:108-339)
10 20 30 40
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTP
.:...:....:::..:: . ... :
CCDS40 SPLQKQLPAFISEDASGYLTSSWLTLFVPSVYTGVFVVSLPLNIMAIVVFILKMKVK-KP
80 90 100 110 120 130
50 60 70 80 90 100
pF1KB9 SLVYALNLGCSDLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAAL
..:: :.:. .:.:.. ::.: .... : . . :: ..: . .::. .....
CCDS40 AVVYMLHLATADVLFVSVLPFKISYYFSGSDWQFGSELCRFVTAAFYCNMYASILLMTVI
140 150 160 170 180 190
110 120 130 140 150 160
pF1KB9 SAGRYLGAAFPLGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGI
: :.:....:. ..: . .: :::::.. :.: : ... :.
CCDS40 SIDRFLAVVYPMQSLSWRTLGRASFTCLAIWALAIA--GVV-----PLLLKEQTIQVPGL
200 210 220 230 240
170 180 190 200 210
pF1KB9 NTPVNGSPVCLEAWDPASAGPARFS-LSLLLFFLPLAITAFCYVGCLRALARSGLTHR-R
: . . . . : :: .: ..::.:: :.. :::. .: :. :....: .
CCDS40 NITTCHDVLNETLLEGYYA--YYFSAFSAVFFFVPLIISTVCYVSIIRCLSSSAVANRSK
250 260 270 280 290 300
220 230 240 250 260 270
pF1KB9 KLRAAWVAGGALLTLLLCVGPYNASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLVTGY
: :: ....... ...: :: :. .: . .
CCDS40 KSRALFLSAAVFCIFIICFGPTNVLLIAHYSFLSHTSTTEAAYFAYLLCVCVSSISCCID
310 320 330 340 350 360
280 290 300
pF1KB9 LGRGPGLKTVCAARTQGGKSQK
CCDS40 PLIYYYASSECQRYVYSILCCKESSDPSSYNSSGQLMASKMDTCSSNLNNSIYKKLLT
370 380 390 400 410 420
>>CCDS14115.1 P2RY8 gene_id:286530|Hs108|chrY (359 aa)
initn: 256 init1: 201 opt: 300 Z-score: 323.4 bits: 68.1 E(32554): 1.1e-11
Smith-Waterman score: 305; 27.2% identity (56.9% similar) in 283 aa overlap (11-281:29-298)
10 20 30 40
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSL
.: . :...: :.... . : .::.
CCDS14 MQVPNSTGPDNATLQMLRNPAIAVALPVVYSLVAAVSIPGNLFSLWVLCRRMGPR-SPSV
10 20 30 40 50
50 60 70 80 90 100
pF1KB9 VYALNLGCSDLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSA
.. .::. .::.:. ::.. : . . :: : .:: . .:.. .. .:.
CCDS14 IFMINLSVTDLMLASVLPFQIYYHCNRHHWVFGVLLCNVVTVAFYANMYSSILTMTCISV
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB9 GRYLGAAFPLGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGINT
:.::. .::. . .:: :. ..::. : :.: :. . . : . .::: :
CCDS14 ERFLGVLYPLSSKRWRRRRYAVAACAGTWLLLLTALSPLARTD-----LTYPVHALGIIT
120 130 140 150 160 170
170 180 190 200 210
pF1KB9 PVNGSPVCLEA--WD--PASAGPARF--SLSLLLFFLPLAITAFCYVGCLRALARSGLTH
:... : :. : : : .. .:::..:..::. ::.. . : :. .:
CCDS14 -------CFDVLKWTMLPSVAMWAVFLFTIFILLFLIPFVITVACYTATILKLLRTEEAH
180 190 200 210 220
220 230 240 250 260 270
pF1KB9 RR--KLRAAWVAGGALLTLLLCVGPYN----ASNVASFLYPNLGGSWRKLGLITGAWSVV
: . ::. .:. .::... : .: : : :. ..: . :: : . .
CCDS14 GREQRRRAVGLAAVVLLAFVTCFAPNNFVLLAHIVSRLFYGKSYYHVYKLTLCLSCLNNC
230 240 250 260 270 280
280 290 300
pF1KB9 LNPLVTGYLGRGPGLKTVCAARTQGGKSQK
:.:.: . .:
CCDS14 LDPFVYYFASREFQLRLREYLGCRRVPRDTLDTRRESLFSARTTSVRSEAGAHPEGMEGA
290 300 310 320 330 340
>>CCDS14115.1 P2RY8 gene_id:286530|Hs108|chrX (359 aa)
initn: 256 init1: 201 opt: 300 Z-score: 323.4 bits: 68.1 E(32554): 1.1e-11
Smith-Waterman score: 305; 27.2% identity (56.9% similar) in 283 aa overlap (11-281:29-298)
10 20 30 40
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSL
.: . :...: :.... . : .::.
CCDS14 MQVPNSTGPDNATLQMLRNPAIAVALPVVYSLVAAVSIPGNLFSLWVLCRRMGPR-SPSV
10 20 30 40 50
50 60 70 80 90 100
pF1KB9 VYALNLGCSDLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSA
.. .::. .::.:. ::.. : . . :: : .:: . .:.. .. .:.
CCDS14 IFMINLSVTDLMLASVLPFQIYYHCNRHHWVFGVLLCNVVTVAFYANMYSSILTMTCISV
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB9 GRYLGAAFPLGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGINT
:.::. .::. . .:: :. ..::. : :.: :. . . : . .::: :
CCDS14 ERFLGVLYPLSSKRWRRRRYAVAACAGTWLLLLTALSPLARTD-----LTYPVHALGIIT
120 130 140 150 160 170
170 180 190 200 210
pF1KB9 PVNGSPVCLEA--WD--PASAGPARF--SLSLLLFFLPLAITAFCYVGCLRALARSGLTH
:... : :. : : : .. .:::..:..::. ::.. . : :. .:
CCDS14 -------CFDVLKWTMLPSVAMWAVFLFTIFILLFLIPFVITVACYTATILKLLRTEEAH
180 190 200 210 220
220 230 240 250 260 270
pF1KB9 RR--KLRAAWVAGGALLTLLLCVGPYN----ASNVASFLYPNLGGSWRKLGLITGAWSVV
: . ::. .:. .::... : .: : : :. ..: . :: : . .
CCDS14 GREQRRRAVGLAAVVLLAFVTCFAPNNFVLLAHIVSRLFYGKSYYHVYKLTLCLSCLNNC
230 240 250 260 270 280
280 290 300
pF1KB9 LNPLVTGYLGRGPGLKTVCAARTQGGKSQK
:.:.: . .:
CCDS14 LDPFVYYFASREFQLRLREYLGCRRVPRDTLDTRRESLFSARTTSVRSEAGAHPEGMEGA
290 300 310 320 330 340
>>CCDS12350.1 F2RL3 gene_id:9002|Hs108|chr19 (385 aa)
initn: 285 init1: 182 opt: 287 Z-score: 309.4 bits: 65.6 E(32554): 6.5e-11
Smith-Waterman score: 287; 28.8% identity (55.1% similar) in 316 aa overlap (3-296:75-373)
10 20 30
pF1KB9 MDLPPQLSFGLYVAAFALGFPLNVLAIRG-AT
.: .: .:: ....:.: : ::. ::
CCDS12 APRGYPGQVCANDSDTLELPDSSRALLLGWVPTRLVPALYGLVLVVGLPANGLALWVLAT
50 60 70 80 90 100
40 50 60 70 80 90
pF1KB9 AHARLRLTPSLVYALNLGCSDLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLY
:: :: . .::. .::::...:: . . : . ::. . : . ..: . .:
CCDS12 QAPRL---PSTMLLMNLAAADLLLALALPPRIAYHLRGQRWPFGEAACRLATAALYGHMY
110 120 130 140 150 160
100 110 120 130 140 150
pF1KB9 AGGGFLAALSAGRYLGAAFPLGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWL
.. .:::.: :::. . :: .:.: . :.: : : :. :.: . :. :
CCDS12 GSVLLLAAVSLDRYLALVHPLRARALRGRRLALGLCMAAW-LMAAALALPLTLQRQTFRL
170 180 190 200 210 220
160 170 180 190 200
pF1KB9 DHSNTSLGINT-PVNGSPVCLEAWDPASAGPARFS-LSLLLFFLPLAITAFCYVGCLRAL
.:. : .. :.... :.:: :. :.:: :::: .:: . :..:
CCDS12 ARSDRVLCHDALPLDAQA---SHWQPA------FTCLALLGCFLPLLAMLLCYGATLHTL
230 240 250 260 270
210 220 230 240 250 260
pF1KB9 ARSG--LTHRRKLRAAWVAGGALL----TLLLCVGPYNASNVASFLYPNLGGSWRKLGLI
: :: : .: :. .:... . .::: . . : : . :: :.. .:
CCDS12 AASGRRYGHALRLTAVVLASAVAFFVPSNLLLLLHYSDPSPSA---WGNLYGAYVP-SLA
280 290 300 310 320
270 280 290 300
pF1KB9 TGAWSVVLNPLV-------------TGYLGRGPGLKTVCAARTQGGKSQK
.. . ..:.. .: . :.:: .. : ..::
CCDS12 LSTLNSCVDPFIYYYVSAEFRDKVRAGLFQRSPGDTVASKASAEGGSRGMGTHSSLLQ
330 340 350 360 370 380
300 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 19:12:40 2016 done: Fri Nov 4 19:12:41 2016
Total Scan time: 2.720 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]