FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9937, 522 aa 1>>>pF1KB9937 522 - 522 aa - 522 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3681+/-0.000849; mu= 9.8853+/- 0.051 mean_var=136.5455+/-27.319, 0's: 0 Z-trim(111.6): 27 B-trim: 11 in 1/51 Lambda= 0.109758 statistics sampled from 12490 (12513) to 12490 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.723), E-opt: 0.2 (0.384), width: 16 Scan time: 2.920 The best scores are: opt bits E(32554) CCDS32266.1 SNX1 gene_id:6642|Hs108|chr15 ( 522) 3365 544.2 1.3e-154 CCDS58371.1 SNX1 gene_id:6642|Hs108|chr15 ( 557) 3260 527.6 1.4e-149 CCDS32268.1 SNX1 gene_id:6642|Hs108|chr15 ( 457) 2350 383.5 2.8e-106 CCDS34217.1 SNX2 gene_id:6643|Hs108|chr5 ( 519) 1894 311.3 1.7e-84 CCDS64234.1 SNX2 gene_id:6643|Hs108|chr5 ( 402) 1869 307.3 2.2e-83 >>CCDS32266.1 SNX1 gene_id:6642|Hs108|chr15 (522 aa) initn: 3365 init1: 3365 opt: 3365 Z-score: 2889.2 bits: 544.2 E(32554): 1.3e-154 Smith-Waterman score: 3365; 100.0% identity (100.0% similar) in 522 aa overlap (1-522:1-522) 10 20 30 40 50 60 pF1KB9 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ITTSLLPINNGSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVLAKTLISLPPQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ITTSLLPINNGSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVLAKTLISLPPQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 FAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 FAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 ALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDAVSKMTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDAVSKMTI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 KMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAMLGSSEDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAMLGSSEDN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 TALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRMKTWQRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 TALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRMKTWQRW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVVRKEVIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVVRKEVIR 430 440 450 460 470 480 490 500 510 520 pF1KB9 FEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAIS :::::::::::::::::::::::::::::::::::::::::: CCDS32 FEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAIS 490 500 510 520 >>CCDS58371.1 SNX1 gene_id:6642|Hs108|chr15 (557 aa) initn: 3260 init1: 3260 opt: 3260 Z-score: 2799.0 bits: 527.6 E(32554): 1.4e-149 Smith-Waterman score: 3260; 100.0% identity (100.0% similar) in 506 aa overlap (1-506:1-506) 10 20 30 40 50 60 pF1KB9 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ITTSLLPINNGSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVLAKTLISLPPQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ITTSLLPINNGSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVLAKTLISLPPQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 FAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 FAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 ALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDAVSKMTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDAVSKMTI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 KMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAMLGSSEDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAMLGSSEDN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 TALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRMKTWQRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRMKTWQRW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVVRKEVIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVVRKEVIR 430 440 450 460 470 480 490 500 510 520 pF1KB9 FEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAIS :::::::::::::::::::::::::: CCDS58 FEKEKSKDFKNHVIKYLETLLYSQQQAGEQLGIRSGILLTKKLPRYSKFFSTVHKFCAAA 490 500 510 520 530 540 >>CCDS32268.1 SNX1 gene_id:6642|Hs108|chr15 (457 aa) initn: 2944 init1: 2349 opt: 2350 Z-score: 2021.5 bits: 383.5 E(32554): 2.8e-106 Smith-Waterman score: 2818; 87.5% identity (87.5% similar) in 522 aa overlap (1-522:1-457) 10 20 30 40 50 60 pF1KB9 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ITTSLLPINNGSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVLAKTLISLPPQE :::::::::::::::::::::::::::::: CCDS32 ITTSLLPINNGSKENGIHEEQDQEPQDLFA------------------------------ 70 80 90 130 140 150 160 170 180 pF1KB9 ATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQ ::::::::::::::::::::::::: CCDS32 -----------------------------------GDGMNAYVAYKVTTQTSLPLFRSKQ 100 110 190 200 210 220 230 240 pF1KB9 FAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 FAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRA 120 130 140 150 160 170 250 260 270 280 290 300 pF1KB9 ALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDAVSKMTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDAVSKMTI 180 190 200 210 220 230 310 320 330 340 350 360 pF1KB9 KMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAMLGSSEDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAMLGSSEDN 240 250 260 270 280 290 370 380 390 400 410 420 pF1KB9 TALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRMKTWQRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 TALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRMKTWQRW 300 310 320 330 340 350 430 440 450 460 470 480 pF1KB9 QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVVRKEVIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVVRKEVIR 360 370 380 390 400 410 490 500 510 520 pF1KB9 FEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAIS :::::::::::::::::::::::::::::::::::::::::: CCDS32 FEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAIS 420 430 440 450 >>CCDS34217.1 SNX2 gene_id:6643|Hs108|chr5 (519 aa) initn: 2009 init1: 1868 opt: 1894 Z-score: 1630.4 bits: 311.3 E(32554): 1.7e-84 Smith-Waterman score: 1971; 60.2% identity (80.1% similar) in 532 aa overlap (10-522:2-519) 10 20 30 40 50 pF1KB9 MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTG--AAVVSKHQS :.:: :::. : ..: .. .:::.::. ... :. .: CCDS34 MAAEREPPPL-----------GDGKPTDFEDLEDGEDLFTSTVSTLESSPSS 10 20 30 40 60 70 80 90 100 pF1KB9 PKITTSLLPINNGSKE-NGIHEEQ---DQEPQDLFADATVELSLDSTQNNQ--------- :. .. :: .. : . :: . . :.. .::::.:: :.:::: . . CCDS34 PEPAS--LPAEDISANSNGPKPTEVVLDDDREDLFAEATEEVSLDSPEREPILSSEPSPA 50 60 70 80 90 110 120 130 140 150 160 pF1KB9 -KKVLAKTLISLPPQEATNSSKP---QPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNA : :::. : :. . : : . . ::.::: . : ::. .:..::::.:::::: CCDS34 VTPVTPTTLIA-PRIESKSMSAPVIFDRSREEIEEEANGDIFDIEIGVSDPEKVGDGMNA 100 110 120 130 140 150 170 180 190 200 210 220 pF1KB9 YVAYKVTTQTSLPLFRSKQFAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTK :.::.:::.::: .: ...:.:::::::::::. ::. :. . :.:::: ::::..:::: CCDS34 YMAYRVTTKTSLSMFSKSEFSVKRRFSDFLGLHSKLASKYLHVGYIVPPAPEKSIVGMTK 160 170 180 190 200 210 230 240 250 260 270 280 pF1KB9 VKVGKEDSSSAEFLEKRRAALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSG ::::::::::.::.::::::::::::: :.:::.:::::.:.:::. ::::::.::.::: CCDS34 VKVGKEDSSSTEFVEKRRAALERYLQRTVKHPTLLQDPDLRQFLESSELPRAVNTQALSG 220 230 240 250 260 270 290 300 310 320 330 340 pF1KB9 AGLLKMFNKATDAVSKMTIKMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELA ::.:.: :::.:::.:::::::::: ::::: :. : .:.:::::. ::.:: :::::. CCDS34 AGILRMVNKAADAVNKMTIKMNESDAWFEEKQQQFENLDQQLRKLHVSVEALVCHRKELS 280 290 300 310 320 330 350 360 370 380 390 400 pF1KB9 LNTAQFAKSLAMLGSSEDNTALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIR ::: :::: ::::.:::.:::::::::::::::::.::::::: ::....:::::::: CCDS34 ANTAAFAKSAAMLGNSEDHTALSRALSQLAEVEEKIDQLHQEQAFADFYMFSELLSDYIR 340 350 360 370 380 390 410 420 430 440 450 460 pF1KB9 LLAIVRAAFDQRMKTWQRWQDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVT :.: :...::.::: ::.:.::: :: :::::::... ::::::.::::.:: :::..: CCDS34 LIAAVKGVFDHRMKCWQKWEDAQITLLKKREAEAKMMVANKPDKIQQAKNEIREWEAKVQ 400 410 420 430 440 450 470 480 490 500 510 520 pF1KB9 QYERDFERISTVVRKEVIRFEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAKAI : :::::.:: ..:::: :::::. ::::. .:::::.:. .:::: ::::::::::::: CCDS34 QGERDFEQISKTIRKEVGRFEKERVKDFKTVIIKYLESLVQTQQQLIKYWEAFLPEAKAI 460 470 480 490 500 510 pF1KB9 S . CCDS34 A >>CCDS64234.1 SNX2 gene_id:6643|Hs108|chr5 (402 aa) initn: 1893 init1: 1868 opt: 1869 Z-score: 1610.6 bits: 307.3 E(32554): 2.2e-83 Smith-Waterman score: 1869; 71.0% identity (89.6% similar) in 393 aa overlap (130-522:10-402) 100 110 120 130 140 150 pF1KB9 STQNNQKKVLAKTLISLPPQEATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGM . ::.::: . : ::. .:..::::.:::: CCDS64 MSAPVIFDRSREEIEEEANGDIFDIEIGVSDPEKVGDGM 10 20 30 160 170 180 190 200 210 pF1KB9 NAYVAYKVTTQTSLPLFRSKQFAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGM :::.::.:::.::: .: ...:.:::::::::::. ::. :. . :.:::: ::::..:: CCDS64 NAYMAYRVTTKTSLSMFSKSEFSVKRRFSDFLGLHSKLASKYLHVGYIVPPAPEKSIVGM 40 50 60 70 80 90 220 230 240 250 260 270 pF1KB9 TKVKVGKEDSSSAEFLEKRRAALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTL ::::::::::::.::.::::::::::::: :.:::.:::::.:.:::. ::::::.::.: CCDS64 TKVKVGKEDSSSTEFVEKRRAALERYLQRTVKHPTLLQDPDLRQFLESSELPRAVNTQAL 100 110 120 130 140 150 280 290 300 310 320 330 pF1KB9 SGAGLLKMFNKATDAVSKMTIKMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKE ::::.:.: :::.:::.:::::::::: ::::: :. : .:.:::::. ::.:: :::: CCDS64 SGAGILRMVNKAADAVNKMTIKMNESDAWFEEKQQQFENLDQQLRKLHVSVEALVCHRKE 160 170 180 190 200 210 340 350 360 370 380 390 pF1KB9 LALNTAQFAKSLAMLGSSEDNTALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDY :. ::: :::: ::::.:::.:::::::::::::::::.::::::: ::....:::::: CCDS64 LSANTAAFAKSAAMLGNSEDHTALSRALSQLAEVEEKIDQLHQEQAFADFYMFSELLSDY 220 230 240 250 260 270 400 410 420 430 440 450 pF1KB9 IRLLAIVRAAFDQRMKTWQRWQDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESR :::.: :...::.::: ::.:.::: :: :::::::... ::::::.::::.:: :::.. CCDS64 IRLIAAVKGVFDHRMKCWQKWEDAQITLLKKREAEAKMMVANKPDKIQQAKNEIREWEAK 280 290 300 310 320 330 460 470 480 490 500 510 pF1KB9 VTQYERDFERISTVVRKEVIRFEKEKSKDFKNHVIKYLETLLYSQQQLAKYWEAFLPEAK : : :::::.:: ..:::: :::::. ::::. .:::::.:. .:::: ::::::::::: CCDS64 VQQGERDFEQISKTIRKEVGRFEKERVKDFKTVIIKYLESLVQTQQQLIKYWEAFLPEAK 340 350 360 370 380 390 520 pF1KB9 AIS ::. CCDS64 AIA 400 522 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:30:39 2016 done: Fri Nov 4 09:30:40 2016 Total Scan time: 2.920 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]