FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA0064, 470 aa 1>>>pF1KSDA0064 470 - 470 aa - 470 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7566+/-0.000809; mu= 15.8736+/- 0.049 mean_var=74.3173+/-14.660, 0's: 0 Z-trim(107.7): 33 B-trim: 0 in 0/52 Lambda= 0.148775 statistics sampled from 9728 (9749) to 9728 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.68), E-opt: 0.2 (0.299), width: 16 Scan time: 2.900 The best scores are: opt bits E(32554) CCDS1750.1 SNX17 gene_id:9784|Hs108|chr2 ( 470) 3046 663.1 1.8e-190 CCDS58704.1 SNX17 gene_id:9784|Hs108|chr2 ( 445) 2734 596.1 2.4e-170 CCDS6288.1 SNX31 gene_id:169166|Hs108|chr8 ( 440) 1061 237.0 3e-62 CCDS1001.1 SNX27 gene_id:81609|Hs108|chr1 ( 528) 406 96.5 7.3e-20 CCDS81377.1 SNX27 gene_id:81609|Hs108|chr1 ( 541) 406 96.5 7.5e-20 >>CCDS1750.1 SNX17 gene_id:9784|Hs108|chr2 (470 aa) initn: 3046 init1: 3046 opt: 3046 Z-score: 3533.2 bits: 663.1 E(32554): 1.8e-190 Smith-Waterman score: 3046; 100.0% identity (100.0% similar) in 470 aa overlap (1-470:1-470) 10 20 30 40 50 60 pF1KSD MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHCRVRYSQLLGLHEQLRKEYGANVLPAFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHCRVRYSQLLGLHEQLRKEYGANVLPAFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD PKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLGSSETFNSFLRRAQQETQQVPTEEVSLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 PKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLGSSETFNSFLRRAQQETQQVPTEEVSLE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD VLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLPDDLIGYFSLFLVREKEDGAFSFVRKLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 VLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLPDDLIGYFSLFLVREKEDGAFSFVRKLQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD EFELPYVSVTSLRSQEYKIVLRKSYWDSAYDDDVMENRVGLNLLYAQTVSDIERGWILVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 EFELPYVSVTSLRSQEYKIVLRKSYWDSAYDDDVMENRVGLNLLYAQTVSDIERGWILVT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD KEQHRQLKSLQEKVSKKEFLRLAQTLRHYGYLRFDACVADFPEKDCPVVVSAGNSELSLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 KEQHRQLKSLQEKVSKKEFLRLAQTLRHYGYLRFDACVADFPEKDCPVVVSAGNSELSLQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD LRLPGQQLREGSFRVTRMRCWRVTSSVPLPSGSTSSPGRGRGEVRLELAFEYLMSKDRLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 LRLPGQQLREGSFRVTRMRCWRVTSSVPLPSGSTSSPGRGRGEVRLELAFEYLMSKDRLQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KSD WVTITSPQAIMMSICLQSMVDELMVKKSGGSIRKMLRRRVGGTLRRSDSQQAVKSPPLLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 WVTITSPQAIMMSICLQSMVDELMVKKSGGSIRKMLRRRVGGTLRRSDSQQAVKSPPLLE 370 380 390 400 410 420 430 440 450 460 470 pF1KSD SPDATRESMVKLSSKLSAVSLRGIGSPSTDASASDVHGNFAFEGIGDEDL :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS17 SPDATRESMVKLSSKLSAVSLRGIGSPSTDASASDVHGNFAFEGIGDEDL 430 440 450 460 470 >>CCDS58704.1 SNX17 gene_id:9784|Hs108|chr2 (445 aa) initn: 2859 init1: 2734 opt: 2734 Z-score: 3171.7 bits: 596.1 E(32554): 2.4e-170 Smith-Waterman score: 2813; 94.7% identity (94.7% similar) in 470 aa overlap (1-470:1-445) 10 20 30 40 50 60 pF1KSD MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHCRVRYSQLLGLHEQLRKEYGANVLPAFP ::::::::::::::::::::: :::::::::::::: CCDS58 MHFSIPETESRSGDSGGSAYV-------------------------LRKEYGANVLPAFP 10 20 30 70 80 90 100 110 120 pF1KSD PKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLGSSETFNSFLRRAQQETQQVPTEEVSLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLGSSETFNSFLRRAQQETQQVPTEEVSLE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KSD VLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLPDDLIGYFSLFLVREKEDGAFSFVRKLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLPDDLIGYFSLFLVREKEDGAFSFVRKLQ 100 110 120 130 140 150 190 200 210 220 230 240 pF1KSD EFELPYVSVTSLRSQEYKIVLRKSYWDSAYDDDVMENRVGLNLLYAQTVSDIERGWILVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 EFELPYVSVTSLRSQEYKIVLRKSYWDSAYDDDVMENRVGLNLLYAQTVSDIERGWILVT 160 170 180 190 200 210 250 260 270 280 290 300 pF1KSD KEQHRQLKSLQEKVSKKEFLRLAQTLRHYGYLRFDACVADFPEKDCPVVVSAGNSELSLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KEQHRQLKSLQEKVSKKEFLRLAQTLRHYGYLRFDACVADFPEKDCPVVVSAGNSELSLQ 220 230 240 250 260 270 310 320 330 340 350 360 pF1KSD LRLPGQQLREGSFRVTRMRCWRVTSSVPLPSGSTSSPGRGRGEVRLELAFEYLMSKDRLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LRLPGQQLREGSFRVTRMRCWRVTSSVPLPSGSTSSPGRGRGEVRLELAFEYLMSKDRLQ 280 290 300 310 320 330 370 380 390 400 410 420 pF1KSD WVTITSPQAIMMSICLQSMVDELMVKKSGGSIRKMLRRRVGGTLRRSDSQQAVKSPPLLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 WVTITSPQAIMMSICLQSMVDELMVKKSGGSIRKMLRRRVGGTLRRSDSQQAVKSPPLLE 340 350 360 370 380 390 430 440 450 460 470 pF1KSD SPDATRESMVKLSSKLSAVSLRGIGSPSTDASASDVHGNFAFEGIGDEDL :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SPDATRESMVKLSSKLSAVSLRGIGSPSTDASASDVHGNFAFEGIGDEDL 400 410 420 430 440 >>CCDS6288.1 SNX31 gene_id:169166|Hs108|chr8 (440 aa) initn: 913 init1: 782 opt: 1061 Z-score: 1231.1 bits: 237.0 E(32554): 3e-62 Smith-Waterman score: 1061; 43.4% identity (74.4% similar) in 387 aa overlap (1-386:3-382) 10 20 30 40 50 pF1KSD MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHCRVRYSQLLGLHEQLRKEYGANVLPA ::: :: ...:: :. :. :: :..:..: : :::::::: : .::::. .: : :: CCDS62 MKMHFCIPVSQQRS-DALGGRYVLYSVHLDGFLFCRVRYSQLHGWNEQLRRVFG-NCLPP 10 20 30 40 50 60 70 80 90 100 110 pF1KSD FPPKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLGSSETFNSFLRRAQQETQQVPTEEVS :::: ...: : ...::.:::.:.: : .:: . :..: ::. :: .: .. :... CCDS62 FPPKYYLAMTTAMADERRDQLEQYLQNVTMDPNVLRSDVFVEFLKLAQLNTFDIATKKAY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KSD LEVLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLPDDLIGYFSLFLVREKEDGAFSFVRK :...: : :.. ....::: .: :::.:. :. : .:.:::.:::.: ..: .: :.: CCDS62 LDIFLPNEQSIRIEIITSDTAERVLEVVSHKIGLCRELLGYFGLFLIRFGKEGKLSVVKK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KSD LQEFELPYVSVTSLRSQEYKIVLRKSYWDSAYDDDVMENRVGLNLLYAQTVSDIERGWIL : .:::::::. : . .. :. ::: : . :. .:. ::...::: :...:::.:: CCDS62 LADFELPYVSLGSSEVENCKVGLRKWYMAPSLDSVLMDCRVAVDLLYMQAIQDIEKGWAK 180 190 200 210 220 230 240 250 260 270 280 290 pF1KSD VTKEQHRQLKSLQEKVSKKEFLRLAQTLRHYGYLRFDACVADFPEKDCPVVVSAGNSELS :. :...:...:.. :. .::.::. .::::::..: :. :.::. .:.:.::.:.: CCDS62 PTQAQRQKLEAFQKEDSQTKFLELAREVRHYGYLQLDPCTCDYPESGSGAVLSVGNNEIS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KSD LQLRLPGQQLREGSFRVTRMRCWRVTSSVPLPSGSTSSPGRGRGEVRLELAFEYLMSKDR . :: .: .. :...:..::.:: : . :..: : .. ::: :.: :.: CCDS62 CCITLPDSQTQDIVFQMSRVKCWQVTFLGTLLD--TDGPQRTLNQ-NLELRFQY--SEDS 300 310 320 330 340 350 360 370 380 390 400 410 pF1KSD L-QWVTITSPQAIMMSICLQSMVDELMVKKSGGSIRKMLRRRVGGTLRRSDSQQAVKSPP :: .: . ::...: ::..:..: ::: CCDS62 CWQWFVIYTKQAFLLSSCLKKMISEKMVKLAAENTEMQIEVPEQSKSKKYHIQQSQQKDY 360 370 380 390 400 410 >>CCDS1001.1 SNX27 gene_id:81609|Hs108|chr1 (528 aa) initn: 269 init1: 174 opt: 406 Z-score: 470.1 bits: 96.5 E(32554): 7.3e-20 Smith-Waterman score: 436; 25.7% identity (58.1% similar) in 389 aa overlap (4-387:166-525) 10 20 30 pF1KSD MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHC :.:. . ...: .:.::... : : CCDS10 PPHEADNLDPSDDSLGQSFYDYTEKQAVPISVPR--YKHVEQNGEKFVVYNVYMAGRQLC 140 150 160 170 180 190 40 50 60 70 80 90 pF1KSD RVRYSQLLGLHEQLRKEYGANVLPAFPPKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLG :: .. ::..:..:.. ..: .: : :::. ... ::. ::.:.. : . ..: CCDS10 SKRYREFAILHQNLKREFANFTFPRLPGKWPFSLSEQQLDARRRGLEEYLEKVCSIRVIG 200 210 220 230 240 250 100 110 120 130 140 150 pF1KSD SSETFNSFLRRAQQETQQVPTEEVSLEVLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLP :. .. :: ..... . : .: :.: : .: : : : .. :..: .:.:::. . CCDS10 ESDIMQEFLSESDENYNGV--SDVELRVALPDGTTVTVRVKKNSTTDQVYQAIAAKVGMD 260 270 280 290 300 310 160 170 180 190 200 210 pF1KSD DDLIGYFSLFLVREKEDGAFSFVRKLQEFELPY-VSVTSLRSQEYKIVLRKSYWDSAYDD . ..::.:: : . :::::: :.:. . . . : : : . .. CCDS10 STTVNYFALFEVI-----SHSFVRKLAPNEFPHKLYIQNYTSAVPGTCLTIRKWLFTTEE 320 330 340 350 360 220 230 240 250 260 270 pF1KSD DVM--ENRVGLNLLYAQTVSDIERGWILVTKEQHRQLKSLQEKVSKKEFLRLAQTLRHYG ... .: .... .. :.:.:...:.: . .:. ::..: :. . .: . .: . :. CCDS10 EILLNDNDLAVTYFFHQAVDDVKKGYIKA-EEKSYQLQKLYEQRKMVMYLNMLRTCEGYN 370 380 390 400 410 420 280 290 300 310 320 pF1KSD YLRFDACVADFPEKDCPVVVSAGNSELSLQLRLPGQQLREG--SFRVTRMRCWRVTSSVP . : :. : .: :... . ....:. ::.. .:. .:. : CCDS10 EIIFPHCACDSRRKG-HVITAISITHFKLHACTEEGQLENQVIAFEWDEMQRW------- 430 440 450 460 470 330 340 350 360 370 380 pF1KSD LPSGSTSSPGRGRGEVRLELAFEYLMSKDRLQWVTITSPQAIMMSICLQSMVDELMVKKS .:. : . . ::: .. . .:: : .: .: :.. . :: .: CCDS10 ----DTDEEG-------MAFCFEYARGEKKPRWVKIFTPYFNYMHECFERVFCELKWRKE 480 490 500 510 520 390 400 410 420 430 440 pF1KSD GGSIRKMLRRRVGGTLRRSDSQQAVKSPPLLESPDATRESMVKLSSKLSAVSLRGIGSPS CCDS10 EY >>CCDS81377.1 SNX27 gene_id:81609|Hs108|chr1 (541 aa) initn: 269 init1: 174 opt: 406 Z-score: 469.9 bits: 96.5 E(32554): 7.5e-20 Smith-Waterman score: 438; 25.8% identity (57.9% similar) in 399 aa overlap (4-397:166-533) 10 20 30 pF1KSD MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHC :.:. . ...: .:.::... : : CCDS81 PPHEADNLDPSDDSLGQSFYDYTEKQAVPISVPR--YKHVEQNGEKFVVYNVYMAGRQLC 140 150 160 170 180 190 40 50 60 70 80 90 pF1KSD RVRYSQLLGLHEQLRKEYGANVLPAFPPKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLG :: .. ::..:..:.. ..: .: : :::. ... ::. ::.:.. : . ..: CCDS81 SKRYREFAILHQNLKREFANFTFPRLPGKWPFSLSEQQLDARRRGLEEYLEKVCSIRVIG 200 210 220 230 240 250 100 110 120 130 140 150 pF1KSD SSETFNSFLRRAQQETQQVPTEEVSLEVLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLP :. .. :: ..... . : .: :.: : .: : : : .. :..: .:.:::. . CCDS81 ESDIMQEFLSESDENYNGVS--DVELRVALPDGTTVTVRVKKNSTTDQVYQAIAAKVGMD 260 270 280 290 300 310 160 170 180 190 200 210 pF1KSD DDLIGYFSLFLVREKEDGAFSFVRKLQEFELPY-VSVTSLRSQEYKIVLRKSYWDSAYDD . ..::.:: : . :::::: :.:. . . . : : : . .. CCDS81 STTVNYFALF-----EVISHSFVRKLAPNEFPHKLYIQNYTSAVPGTCLTIRKWLFTTEE 320 330 340 350 360 220 230 240 250 260 270 pF1KSD DVM--ENRVGLNLLYAQTVSDIERGWILVTKEQHRQLKSLQEKVSKKEFLRLAQTLRHYG ... .: .... .. :.:.:...:.: . .:. ::..: :. . .: . .: . :. CCDS81 EILLNDNDLAVTYFFHQAVDDVKKGYIKA-EEKSYQLQKLYEQRKMVMYLNMLRTCEGYN 370 380 390 400 410 420 280 290 300 310 320 pF1KSD YLRFDACVADFPEKDCPVVVSAGNSELSLQLRLPGQQLREG--SFRVTRMRCWRVTSSVP . : :. : .: :... . ....:. ::.. .:. .:. : CCDS81 EIIFPHCACDSRRKG-HVITAISITHFKLHACTEEGQLENQVIAFEWDEMQRW------- 430 440 450 460 470 330 340 350 360 370 380 pF1KSD LPSGSTSSPGRGRGEVRLELAFEYLMSKDRLQWVTITSPQAIMMSICLQSMVDELMVKKS .:. : . . ::: .. . .:: : .: .: :.. . :: .: CCDS81 ----DTDEEG-------MAFCFEYARGEKKPRWVKIFTPYFNYMHECFERVFCELKWRKE 480 490 500 510 520 390 400 410 420 430 440 pF1KSD GGSIRKMLRRRVGGTLRRSDSQQAVKSPPLLESPDATRESMVKLSSKLSAVSLRGIGSPS .: .: : CCDS81 --NIFQMARSQQRDVAT 530 540 470 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 23:42:13 2016 done: Wed Nov 2 23:42:14 2016 Total Scan time: 2.900 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]