FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0844, 301 aa 1>>>pF1KE0844 301 - 301 aa - 301 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2690+/-0.000972; mu= 15.4539+/- 0.058 mean_var=65.5535+/-13.502, 0's: 0 Z-trim(104.6): 49 B-trim: 466 in 1/47 Lambda= 0.158408 statistics sampled from 7947 (7984) to 7947 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.611), E-opt: 0.2 (0.245), width: 16 Scan time: 2.380 The best scores are: opt bits E(32554) CCDS33099.1 VN1R4 gene_id:317703|Hs108|chr19 ( 301) 2020 470.6 6.6e-133 CCDS12862.1 VN1R2 gene_id:317701|Hs108|chr19 ( 395) 1130 267.2 1.4e-71 CCDS12951.1 VN1R1 gene_id:57191|Hs108|chr19 ( 353) 815 195.2 5.9e-50 >>CCDS33099.1 VN1R4 gene_id:317703|Hs108|chr19 (301 aa) initn: 2020 init1: 2020 opt: 2020 Z-score: 2499.6 bits: 470.6 E(32554): 6.6e-133 Smith-Waterman score: 2020; 100.0% identity (100.0% similar) in 301 aa overlap (1-301:1-301) 10 20 30 40 50 60 pF1KE0 MASRYVAVGMILSQTVVGVLGSFSVLLHYLSFYCTGCRLRSTDLIVKHLIVANFLALRCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MASRYVAVGMILSQTVVGVLGSFSVLLHYLSFYCTGCRLRSTDLIVKHLIVANFLALRCK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 GVPQTMAAFGVRYFLNALGCKLVFYLHRVGRGVSIGTTCLLSVFQVITVSSRKSRWAKLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GVPQTMAAFGVRYFLNALGCKLVFYLHRVGRGVSIGTTCLLSVFQVITVSSRKSRWAKLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 EKAPKHVGFSVLLCWIVCMLVNIIFPMYVTGKWNYTNITVNEDLGYCSGGGNNKIAQTLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EKAPKHVGFSVLLCWIVCMLVNIIFPMYVTGKWNYTNITVNEDLGYCSGGGNNKIAQTLR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 AMLLSFPDVLCLGLMLWVSSSMVCILHRHKQRVQHIDRSNLSPRASPENRATQSILILVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 AMLLSFPDVLCLGLMLWVSSSMVCILHRHKQRVQHIDRSNLSPRASPENRATQSILILVS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 TFVSSYTLSCLFQVCMALLDNPNSLLVNTSALMSVCFPTLSPFVLMSCDPSVYRFCFAWK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TFVSSYTLSCLFQVCMALLDNPNSLLVNTSALMSVCFPTLSPFVLMSCDPSVYRFCFAWK 250 260 270 280 290 300 pF1KE0 R : CCDS33 R >>CCDS12862.1 VN1R2 gene_id:317701|Hs108|chr19 (395 aa) initn: 1111 init1: 1111 opt: 1130 Z-score: 1398.6 bits: 267.2 E(32554): 1.4e-71 Smith-Waterman score: 1130; 57.1% identity (80.6% similar) in 294 aa overlap (3-296:85-378) 10 20 30 pF1KE0 MASRYVAVGMILSQTVVGVLGSFSVLLHYLSF .. :.. : :.:::.::.::.: .:. . CCDS12 VPQTQLSFLSSLCLVSLFLHSLVSAHGEKPTKPVGLDPTLFQVVVGILGNFSLLYYYMFL 60 70 80 90 100 110 40 50 60 70 80 90 pF1KE0 YCTGCRLRSTDLIVKHLIVANFLALRCKGVPQTMAAFGVRYFLNALGCKLVFYLHRVGRG : : . ::::::..:: ::. :.. : .:.:::.::...: : .:::...: :::::: CCDS12 YFRGYKPRSTDLILRHLTVADSLVILSKRIPETMATFGLKHFDNYFGCKFLLYAHRVGRG 120 130 140 150 160 170 100 110 120 130 140 150 pF1KE0 VSIGTTCLLSVFQVITVSSRKSRWAKLKEKAPKHVGFSVLLCWIVCMLVNIIFPMYVTGK ::::.:::::::::::.. :.::::..: ::: ..:.: .::: :::: :::.:.::: CCDS12 VSIGSTCLLSVFQVITINPRNSRWAEMKVKAPTYIGLSNILCWAFHMLVNAIFPIYTTGK 180 190 200 210 220 230 160 170 180 190 200 210 pF1KE0 WNYTNITVNEDLGYCSGGGNNKIAQTLRAMLLSFPDVLCLGLMLWVSSSMVCILHRHKQR :. .::: . ::::::. ........ : : :: ::::::::::.:::.: .:.::::. CCDS12 WSNNNITKKGDLGYCSAPLSDEVTKSVYAALTSFHDVLCLGLMLWASSSIVLVLYRHKQQ 240 250 260 270 280 290 220 230 240 250 260 270 pF1KE0 VQHIDRSNLSPRASPENRATQSILILVSTFVSSYTLSCLFQVCMALLDNPNSLLVNTSAL :::: :.:: : .:: ::: :::: :::::. :.:: . : .::.:: . ::::.:: CCDS12 VQHICRNNLYPNSSPGNRAIQSILALVSTFALCYALSFITYVYLALFDNSSWWLVNTAAL 300 310 320 330 340 350 280 290 300 pF1KE0 MSVCFPTLSPFVLMSCDPSVYRFCFAWKR . .::::.:::::: ::: :.: CCDS12 IIACFPTISPFVLMCRDPSRSRLCSICCRRNRRFFHDFRKM 360 370 380 390 >>CCDS12951.1 VN1R1 gene_id:57191|Hs108|chr19 (353 aa) initn: 800 init1: 468 opt: 815 Z-score: 1010.3 bits: 195.2 E(32554): 5.9e-50 Smith-Waterman score: 815; 43.1% identity (70.6% similar) in 299 aa overlap (1-298:41-338) 10 20 pF1KE0 MASRYVAVGM-ILSQTVVGVLGSFSVLLHY :: : :. .: :: ::.::. .: : CCDS12 PLMTRYFFLLFYSTDSSDLNENQHPLDFDEMAFGKVKSGISFLIQTGVGILGNSFLLCFY 20 30 40 50 60 70 30 40 50 60 70 80 pF1KE0 LSFYCTGCRLRSTDLIVKHLIVANFLALRCKGVPQTMAAFGVRYFLNALGCKLVFYLHRV . :: .:: ::::...: .:: ..: ::.::::::::..:.:: :::.::: ::: CCDS12 NLILFTGHKLRPTDLILSQLALANSMVLFFKGIPQTMAAFGLKYLLNDTGCKFVFYYHRV 80 90 100 110 120 130 90 100 110 120 130 140 pF1KE0 GRGVSIGTTCLLSVFQVITVSSRKSRWAKLKEKAPKHVGFSVLLCWIVCMLVNIIFPMYV : ::..: :::. ::.: .. :: ..: ..:. . : :::: .:.: . : CCDS12 GTRVSLSTICLLNGFQAIKLNPSICRWMEIKIRSPRFIDFCCLLCWAPHVLMNASVLLLV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 TGKWNYTNITVNEDLGYCSGGGNNKIAQTLRAMLLSFPDVLCLGLMLWVSSSMVCILHRH .: : : ..... :::: ...... .:.:.: :: . ::.:.:.:.::: .:.:: CCDS12 NGPLNSKNSSAKNNYGYCSYKASKRFS-SLHAVLYFSPDFMSLGFMVWASGSMVFFLYRH 200 210 220 230 240 210 220 230 240 250 260 pF1KE0 KQRVQHIDRSNLSPRASPENRATQSILILVSTFVSSYTLSCLFQVCMALLDNPNSLLVNT ::.::: . :: : : : :::..:..:::.: :.. .. . ... ::.. .:.. CCDS12 KQQVQHNHSNRLSCRPSQEARATHTIMVLVSSFFVFYSVHSFLTIWTTVVANPGQWIVTN 250 260 270 280 290 300 270 280 290 300 pF1KE0 SALMSVCFPTLSPFVLMSCDPSVYRFCFAWKR :.:.. :::. :::::. : . .:::: CCDS12 SVLVASCFPARSPFVLIMSDTHISQFCFACRTRKTLFPNLVVMP 310 320 330 340 350 301 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 03:33:43 2016 done: Sat Nov 5 03:33:44 2016 Total Scan time: 2.380 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]