FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2659, 490 aa 1>>>pF1KE2659 490 - 490 aa - 490 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1629+/-0.000852; mu= 19.5528+/- 0.051 mean_var=75.2448+/-14.888, 0's: 0 Z-trim(107.5): 16 B-trim: 0 in 0/51 Lambda= 0.147855 statistics sampled from 9620 (9630) to 9620 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.658), E-opt: 0.2 (0.296), width: 16 Scan time: 2.830 The best scores are: opt bits E(32554) CCDS1600.1 SLC35F3 gene_id:148641|Hs108|chr1 ( 490) 3269 706.8 1.4e-203 CCDS73050.1 SLC35F3 gene_id:148641|Hs108|chr1 ( 421) 2634 571.3 7.2e-163 CCDS76684.1 SLC35F4 gene_id:341880|Hs108|chr14 ( 485) 1509 331.4 1.4e-90 >>CCDS1600.1 SLC35F3 gene_id:148641|Hs108|chr1 (490 aa) initn: 3269 init1: 3269 opt: 3269 Z-score: 3768.7 bits: 706.8 E(32554): 1.4e-203 Smith-Waterman score: 3269; 100.0% identity (100.0% similar) in 490 aa overlap (1-490:1-490) 10 20 30 40 50 60 pF1KE2 MGIREFPSGAPRGKSIAVGMRRSPDVSPRRLSDISPQLRQLKYLVVDEAIKEDLKWSRSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 MGIREFPSGAPRGKSIAVGMRRSPDVSPRRLSDISPQLRQLKYLVVDEAIKEDLKWSRSV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 EDLTSGPVGLTSIEERILRITGYYGYQPWAASCKREERPRDSPGPAEAQAPAGVEAGGRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 EDLTSGPVGLTSIEERILRITGYYGYQPWAASCKREERPRDSPGPAEAQAPAGVEAGGRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SRRCWTCSRAQLKKIFWGVAVVLCVCSSWAGSTQLAKLTFRKFDAPFTLTWFATNWNFLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 SRRCWTCSRAQLKKIFWGVAVVLCVCSSWAGSTQLAKLTFRKFDAPFTLTWFATNWNFLF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 FPLYYVGHVCKSTEKQSVKQRYRECCRFFGDNGLTLKVFFTKAAPFGVLWTLTNYLYLHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 FPLYYVGHVCKSTEKQSVKQRYRECCRFFGDNGLTLKVFFTKAAPFGVLWTLTNYLYLHA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 IKKINTTDVSVLFCCNKAFVFLLSWIVLRDRFMGVRIVAAILAIAGIVMMTYADGFHSHS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 IKKINTTDVSVLFCCNKAFVFLLSWIVLRDRFMGVRIVAAILAIAGIVMMTYADGFHSHS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 VIGIALVVASASMSALYKVLFKLLLGSAKFGEAALFLSILGVFNILFITCIPIILYFTKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 VIGIALVVASASMSALYKVLFKLLLGSAKFGEAALFLSILGVFNILFITCIPIILYFTKV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 EYWSSFDDIPWGNLCGFSVLLLTFNIVLNFGIAVTYPTLMSLGIVLSIPVNAVIDHYTSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 EYWSSFDDIPWGNLCGFSVLLLTFNIVLNFGIAVTYPTLMSLGIVLSIPVNAVIDHYTSQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 IVFNGVRVIAIIIIGLGFLLLLLPEEWDVWLIKLLTRLKVRKKEEPAEGAADLSSGPQSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 IVFNGVRVIAIIIIGLGFLLLLLPEEWDVWLIKLLTRLKVRKKEEPAEGAADLSSGPQSK 430 440 450 460 470 480 490 pF1KE2 NRRARPSFAR :::::::::: CCDS16 NRRARPSFAR 490 >>CCDS73050.1 SLC35F3 gene_id:148641|Hs108|chr1 (421 aa) initn: 2634 init1: 2634 opt: 2634 Z-score: 3037.5 bits: 571.3 E(32554): 7.2e-163 Smith-Waterman score: 2634; 100.0% identity (100.0% similar) in 395 aa overlap (96-490:27-421) 70 80 90 100 110 120 pF1KE2 GPVGLTSIEERILRITGYYGYQPWAASCKREERPRDSPGPAEAQAPAGVEAGGRASRRCW :::::::::::::::::::::::::::::: CCDS73 MKKHSARVAPLSACNSPVLTLTKVEGEERPRDSPGPAEAQAPAGVEAGGRASRRCW 10 20 30 40 50 130 140 150 160 170 180 pF1KE2 TCSRAQLKKIFWGVAVVLCVCSSWAGSTQLAKLTFRKFDAPFTLTWFATNWNFLFFPLYY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 TCSRAQLKKIFWGVAVVLCVCSSWAGSTQLAKLTFRKFDAPFTLTWFATNWNFLFFPLYY 60 70 80 90 100 110 190 200 210 220 230 240 pF1KE2 VGHVCKSTEKQSVKQRYRECCRFFGDNGLTLKVFFTKAAPFGVLWTLTNYLYLHAIKKIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 VGHVCKSTEKQSVKQRYRECCRFFGDNGLTLKVFFTKAAPFGVLWTLTNYLYLHAIKKIN 120 130 140 150 160 170 250 260 270 280 290 300 pF1KE2 TTDVSVLFCCNKAFVFLLSWIVLRDRFMGVRIVAAILAIAGIVMMTYADGFHSHSVIGIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 TTDVSVLFCCNKAFVFLLSWIVLRDRFMGVRIVAAILAIAGIVMMTYADGFHSHSVIGIA 180 190 200 210 220 230 310 320 330 340 350 360 pF1KE2 LVVASASMSALYKVLFKLLLGSAKFGEAALFLSILGVFNILFITCIPIILYFTKVEYWSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LVVASASMSALYKVLFKLLLGSAKFGEAALFLSILGVFNILFITCIPIILYFTKVEYWSS 240 250 260 270 280 290 370 380 390 400 410 420 pF1KE2 FDDIPWGNLCGFSVLLLTFNIVLNFGIAVTYPTLMSLGIVLSIPVNAVIDHYTSQIVFNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FDDIPWGNLCGFSVLLLTFNIVLNFGIAVTYPTLMSLGIVLSIPVNAVIDHYTSQIVFNG 300 310 320 330 340 350 430 440 450 460 470 480 pF1KE2 VRVIAIIIIGLGFLLLLLPEEWDVWLIKLLTRLKVRKKEEPAEGAADLSSGPQSKNRRAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 VRVIAIIIIGLGFLLLLLPEEWDVWLIKLLTRLKVRKKEEPAEGAADLSSGPQSKNRRAR 360 370 380 390 400 410 490 pF1KE2 PSFAR ::::: CCDS73 PSFAR 420 >>CCDS76684.1 SLC35F4 gene_id:341880|Hs108|chr14 (485 aa) initn: 1713 init1: 1469 opt: 1509 Z-score: 1739.8 bits: 331.4 E(32554): 1.4e-90 Smith-Waterman score: 1549; 51.1% identity (73.7% similar) in 476 aa overlap (62-482:2-475) 40 50 60 70 80 90 pF1KE2 SDISPQLRQLKYLVVDEAIKEDLKWSRSVEDLTSGPVGLTSIEERILRITGYYGYQPWAA :. ..: :...::.::::::::::: : . CCDS76 MDVKAAPNGVATIEDRILRITGYYGYYPGYS 10 20 30 100 110 120 pF1KE2 S-----------CK-------------REERP----RDSPGPA---EAQAPAGVEAGGRA : :: :. : .:: .: . :. .:: : :. CCDS76 SQKSTSRSSVTRCKPGANCPSSHSGISRQLSPLSVTEDSSAPILELQNQGSSGV-CGHRV 40 50 60 70 80 90 130 140 150 pF1KE2 SR------------------------RCWTCSRAQLKKIFWGVAVVLCVCSSWAGSTQLA : :: .:. :: : ::. ..: : :::.:.::.. CCDS76 ERQNRSADDGTQTHSENSSQENRIKARCLSCTSMVLKGI-WGLLIILSVSSSWVGTTQIV 100 110 120 130 140 160 170 180 190 200 210 pF1KE2 KLTFRKFDAPFTLTWFATNWNFLFFPLYYVGHVCKSTEKQSVKQRYRECCRFFGDNGLTL :.:...: :: .:::.::::..:::.:: ::. . :::: ...::: :.::..:::: CCDS76 KITYKNFYCPFFMTWFSTNWNIMFFPVYYSGHLATAQEKQSPMKKFRECSRIFGEDGLTL 150 160 170 180 190 200 220 230 240 250 260 270 pF1KE2 KVFFTKAAPFGVLWTLTNYLYLHAIKKINTTDVSVLFCCNKAFVFLLSWIVLRDRFMGVR :.:. ..:::..:::::::::: :.::...::::.:::::::::::::::::.::::::: CCDS76 KLFLKRTAPFSILWTLTNYLYLLALKKLTATDVSALFCCNKAFVFLLSWIVLKDRFMGVR 210 220 230 240 250 260 280 290 300 310 320 330 pF1KE2 IVAAILAIAGIVMMTYADGFHSHSVIGIALVVASASMSALYKVLFKLLLGSAKFGEAALF :::::.::.:::::.:::.::. :.::.:..:.::: :::::::::..::::.::::: : CCDS76 IVAAIMAITGIVMMAYADNFHADSIIGVAFAVGSASTSALYKVLFKMFLGSANFGEAAHF 270 280 290 300 310 320 340 350 360 370 380 390 pF1KE2 LSILGVFNILFITCIPIILYFTKVEYWSSFDDIPWGNLCGFSVLLLTFNIVLNFGIAVTY .: :: ::..::. :.::::::::.:::: .::: :::.. : :.:::..: :...:: CCDS76 VSTLGFFNLIFISFTPVILYFTKVEHWSSFAALPWGCLCGMAGLWLAFNILVNVGVVLTY 330 340 350 360 370 380 400 410 420 430 440 450 pF1KE2 PTLMSLGIVLSIPVNAVIDHYTSQIVFNGVRVIAIIIIGLGFLLLLLPEEWDVWLIKLLT : :.:.: :::.: ::..: ....:: ::. : ::: .::::.::::::: ..... CCDS76 PILISIGTVLSVPGNAAVDLLKQEVIFNVVRLAATIIICIGFLLMLLPEEWDEITLRFIN 390 400 410 420 430 440 460 470 480 490 pF1KE2 RLKVRKKEEPAEGAADLSSGPQSKNRRARPSFAR :: .:.:: .. ..: : ....: CCDS76 SLKEKKSEEHVDDVTDPSIHLRGRGRANGTVSIPLA 450 460 470 480 490 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Jan 20 09:29:01 2017 done: Fri Jan 20 09:29:01 2017 Total Scan time: 2.830 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]