FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6702, 128 aa 1>>>pF1KB6702 128 - 128 aa - 128 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.6065+/-0.000311; mu= 14.4144+/- 0.019 mean_var=60.4300+/-11.610, 0's: 0 Z-trim(116.1): 31 B-trim: 43 in 1/54 Lambda= 0.164986 statistics sampled from 26972 (27004) to 26972 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.717), E-opt: 0.2 (0.317), width: 16 Scan time: 5.080 The best scores are: opt bits E(85289) NP_002695 (OMIM: 121010) platelet basic protein pr ( 128) 821 203.2 9.7e-53 NP_001502 (OMIM: 155730) growth-regulated alpha pr ( 107) 299 78.8 2.1e-15 NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 p ( 114) 284 75.3 2.7e-14 NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 p ( 107) 275 73.1 1.1e-13 NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 p ( 107) 264 70.5 6.9e-13 NP_002610 (OMIM: 173460) platelet factor 4 precurs ( 101) 255 68.4 2.9e-12 XP_005265753 (OMIM: 173460) PREDICTED: platelet fa ( 110) 253 67.9 4.3e-12 NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 p ( 114) 251 67.4 6.2e-12 NP_002611 (OMIM: 173461) platelet factor 4 variant ( 104) 237 64.1 5.8e-11 NP_000575 (OMIM: 146930) interleukin-8 precursor [ ( 99) 221 60.3 7.9e-10 NP_002407 (OMIM: 601704) C-X-C motif chemokine 9 p ( 125) 179 50.3 9.6e-07 NP_001556 (OMIM: 147310) C-X-C motif chemokine 10 ( 98) 158 45.3 2.5e-05 XP_006714126 (OMIM: 605149) PREDICTED: C-X-C motif ( 109) 140 41.0 0.00054 NP_006410 (OMIM: 605149) C-X-C motif chemokine 13 ( 109) 140 41.0 0.00054 NP_005400 (OMIM: 604852) C-X-C motif chemokine 11 ( 94) 126 37.6 0.0048 NP_001289052 (OMIM: 604852) C-X-C motif chemokine ( 106) 123 37.0 0.0087 >>NP_002695 (OMIM: 121010) platelet basic protein prepro (128 aa) initn: 821 init1: 821 opt: 821 Z-score: 1067.8 bits: 203.2 E(85289): 9.7e-53 Smith-Waterman score: 821; 100.0% identity (100.0% similar) in 128 aa overlap (1-128:1-128) 10 20 30 40 50 60 pF1KB6 MSLRLDTTPSCNSARPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MSLRLDTTPSCNSARPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 LRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 LRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK 70 80 90 100 110 120 pF1KB6 LAGDESAD :::::::: NP_002 LAGDESAD >>NP_001502 (OMIM: 155730) growth-regulated alpha protei (107 aa) initn: 350 init1: 296 opt: 299 Z-score: 397.4 bits: 78.8 E(85289): 2.1e-15 Smith-Waterman score: 299; 52.4% identity (81.7% similar) in 82 aa overlap (45-126:26-106) 20 30 40 50 60 70 pF1KB6 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP .: :.. . ... .::::.:..: .:::: NP_001 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAA-GASVATELRCQCLQTLQGIHP 10 20 30 40 50 80 90 100 110 120 pF1KB6 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD :::::..: . : :: :.:::::::.::: ::.: .: .:::..: : .:.: NP_001 KNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 60 70 80 90 100 >>NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 precu (114 aa) initn: 327 init1: 284 opt: 284 Z-score: 377.7 bits: 75.3 E(85289): 2.7e-14 Smith-Waterman score: 284; 56.2% identity (90.6% similar) in 64 aa overlap (60-123:46-109) 30 40 50 60 70 80 pF1KB6 LTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHC ::::.:..::.:.::: :..:.:.. : .: NP_002 SSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHPKMISNLQVFAIGPQC 20 30 40 50 60 70 90 100 110 120 pF1KB6 NQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD ..:::.:.::.:..:::::.:: .::..:: : : NP_002 SKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 80 90 100 110 >>NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 precu (107 aa) initn: 306 init1: 270 opt: 275 Z-score: 366.5 bits: 73.1 E(85289): 1.1e-13 Smith-Waterman score: 275; 48.8% identity (79.3% similar) in 82 aa overlap (45-126:25-106) 20 30 40 50 60 70 pF1KB6 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP :. ..... . : .::::.:..: .::: NP_002 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHL 10 20 30 40 50 80 90 100 110 120 pF1KB6 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD :::::..: . : :: :.:::::::.:.: ::.: .: .:::..: : . .: NP_002 KNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 60 70 80 90 100 >>NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 precu (107 aa) initn: 315 init1: 261 opt: 264 Z-score: 352.3 bits: 70.5 E(85289): 6.9e-13 Smith-Waterman score: 264; 48.1% identity (81.8% similar) in 77 aa overlap (45-121:25-101) 20 30 40 50 60 70 pF1KB6 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP :. ..... ... .::::.:..: .::: NP_002 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHL 10 20 30 40 50 80 90 100 110 120 pF1KB6 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD :::::..: . : :: :.:::::::.:.: ::.: .: ..::..: : NP_002 KNIQSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 60 70 80 90 100 >>NP_002610 (OMIM: 173460) platelet factor 4 precursor [ (101 aa) initn: 245 init1: 212 opt: 255 Z-score: 341.1 bits: 68.4 E(85289): 2.9e-12 Smith-Waterman score: 255; 52.9% identity (80.0% similar) in 70 aa overlap (52-121:30-99) 30 40 50 60 70 80 pF1KB6 VLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHPKNIQSLE : ... ..:.:.:.:::: ..:..: ::: NP_002 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITSLE 10 20 30 40 50 90 100 110 120 pF1KB6 VIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD :: : :: ...:::::.::::::: .:: :::..: : NP_002 VIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>XP_005265753 (OMIM: 173460) PREDICTED: platelet factor (110 aa) initn: 225 init1: 212 opt: 253 Z-score: 338.0 bits: 67.9 E(85289): 4.3e-12 Smith-Waterman score: 253; 58.1% identity (82.3% similar) in 62 aa overlap (60-121:47-108) 30 40 50 60 70 80 pF1KB6 LTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHC .:.:.:.:::: ..:..: ::::: : :: XP_005 VPGAAPAPPTWLEQLLSGGGVIYAEAEEDGDLQCLCVKTTSQVRPRHITSLEVIKAGPHC 20 30 40 50 60 70 90 100 110 120 pF1KB6 NQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD ...:::::.::::::: .:: :::..: : XP_005 PTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 80 90 100 110 >>NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 precu (114 aa) initn: 297 init1: 248 opt: 251 Z-score: 335.2 bits: 67.4 E(85289): 6.2e-12 Smith-Waterman score: 256; 40.5% identity (65.3% similar) in 121 aa overlap (7-121:2-107) 10 20 30 40 50 pF1KB6 MSLRLDTTPSCNSAR---PLHALQVLLLLSLLLTA---LASSTKGQTKRNLAKGKEESLD . :: .:: : .: .:: : :::: :::. : . NP_002 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASA--GPV------------- 10 20 30 40 60 70 80 90 100 110 pF1KB6 SDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIK : . .:::: :...: ..::.: .:.:. : .:..:::.:.::.:...::::.:: .: NP_002 SAVLTELRCTCLRVTLRVNPKTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLK 50 60 70 80 90 100 120 pF1KB6 KIVQKKLAGDESAD :..:: : NP_002 KVIQKILDSGNKKN 110 >>NP_002611 (OMIM: 173461) platelet factor 4 variant pre (104 aa) initn: 209 init1: 196 opt: 237 Z-score: 317.8 bits: 64.1 E(85289): 5.8e-11 Smith-Waterman score: 241; 43.6% identity (70.3% similar) in 101 aa overlap (21-121:15-102) 10 20 30 40 50 60 pF1KB6 MSLRLDTTPSCNSARPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAE : .:.:.::: .. . .:... : :.:: NP_002 MSSAARSRLTRATRQEMLFLALLLLPVVVA--------FARAEAEE-DGDL--- 10 20 30 40 70 80 90 100 110 120 pF1KB6 LRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK .:.:.:::: ..:..: ::::: : :: ...:::::.::::::: .: :::.... NP_002 -QCLCVKTTSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEH 50 60 70 80 90 100 pF1KB6 LAGDESAD : NP_002 LES >>NP_000575 (OMIM: 146930) interleukin-8 precursor [Homo (99 aa) initn: 199 init1: 150 opt: 221 Z-score: 297.5 bits: 60.3 E(85289): 7.9e-10 Smith-Waterman score: 221; 47.8% identity (75.4% similar) in 69 aa overlap (60-127:31-99) 30 40 50 60 70 80 pF1KB6 LTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSG-IHPKNIQSLEVIGKGTH ::::.:::: : .::: :. :.:: .: : NP_000 MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFIKELRVIESGPH 10 20 30 40 50 60 90 100 110 120 pF1KB6 CNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD : ..:.:. :.:::..:::: ....:.: : :.. NP_000 CANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 70 80 90 128 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 23:26:13 2016 done: Fri Nov 4 23:26:14 2016 Total Scan time: 5.080 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]