FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2610, 282 aa 1>>>pF1KE2610 282 - 282 aa - 282 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2782+/-0.000917; mu= 15.0694+/- 0.055 mean_var=59.6139+/-12.208, 0's: 0 Z-trim(105.4): 27 B-trim: 0 in 0/49 Lambda= 0.166112 statistics sampled from 8362 (8385) to 8362 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.619), E-opt: 0.2 (0.258), width: 16 Scan time: 1.490 The best scores are: opt bits E(32554) CCDS31798.1 AQP6 gene_id:363|Hs108|chr12 ( 282) 1818 444.0 5.7e-125 CCDS8792.1 AQP2 gene_id:359|Hs108|chr12 ( 271) 993 246.3 1.8e-65 CCDS8793.1 AQP5 gene_id:362|Hs108|chr12 ( 265) 954 236.9 1.2e-62 CCDS8919.1 MIP gene_id:4284|Hs108|chr12 ( 263) 858 213.9 9.6e-56 CCDS58617.1 AQP4 gene_id:361|Hs108|chr18 ( 301) 664 167.5 1.1e-41 CCDS11889.1 AQP4 gene_id:361|Hs108|chr18 ( 323) 664 167.5 1.1e-41 CCDS5431.1 AQP1 gene_id:358|Hs108|chr7 ( 269) 634 160.2 1.4e-39 CCDS55096.1 AQP1 gene_id:358|Hs108|chr7 ( 154) 432 111.7 3.3e-25 CCDS55098.1 AQP1 gene_id:358|Hs108|chr7 ( 186) 432 111.8 3.8e-25 CCDS55097.1 AQP1 gene_id:358|Hs108|chr7 ( 218) 432 111.8 4.4e-25 CCDS82244.1 AQP4 gene_id:361|Hs108|chr18 ( 296) 433 112.1 4.9e-25 CCDS10626.1 AQP8 gene_id:343|Hs108|chr16 ( 261) 373 97.7 9.3e-21 >>CCDS31798.1 AQP6 gene_id:363|Hs108|chr12 (282 aa) initn: 1818 init1: 1818 opt: 1818 Z-score: 2357.1 bits: 444.0 E(32554): 5.7e-125 Smith-Waterman score: 1818; 100.0% identity (100.0% similar) in 282 aa overlap (1-282:1-282) 10 20 30 40 50 60 pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTSGSPATMIGISV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTSGSPATMIGISV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 ALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTK 190 200 210 220 230 240 250 260 270 280 pF1KE2 TLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV :::::::::::::::::::::::::::::::::::::::::: CCDS31 TLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV 250 260 270 280 >>CCDS8792.1 AQP2 gene_id:359|Hs108|chr12 (271 aa) initn: 977 init1: 574 opt: 993 Z-score: 1288.9 bits: 246.3 E(32554): 1.8e-65 Smith-Waterman score: 993; 62.3% identity (85.4% similar) in 239 aa overlap (19-251:1-239) 10 20 30 40 50 pF1KE2 MDAVEPGGRGWASMLACRLWK----AISRALFAEFLATGLYVFFGVGSVMRWPTALPSVL .:. :.:::.::::::: :.::::.::.. :: :::::: CCDS87 MWELRSIAFSRAVFAEFLATLLFVFFGLGSALNWPQALPSVL 10 20 30 40 60 70 80 90 100 110 pF1KE2 QIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAA :::..:.: . ::. . :::: :::::.: ::: :.:. ::. ::::::.::..::: CCDS87 QIAMAFGLGIGTLVQALGHISGAHINPAVTVACLVGCHVSVLRAAFYVAAQLLGAVAGAA 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE2 LLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSR--QTSGSPAT ::. . :.::: :..:.. ::...::::.:::.::::::::.::::: : .. :.:: CCDS87 LLHEITPADIRGDLAVNALSNSTTAGQAVTVELFLTLQLVLCIFASTDERRGENPGTPAL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE2 MIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFV ::.:::::::.:::.::::::::::..::.. ::: :::::.:::.::.:.::.::.: CCDS87 SIGFSVALGHLLGIHYTGCSMNPARSLAPAVVTGKFDDHWVFWIGPLVGAILGSLLYNYV 170 180 190 200 210 220 240 250 260 270 280 pF1KE2 LFPDTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV ::: .:.:..:::.: : CCDS87 LFPPAKSLSERLAVLKGLEPDTDWEEREVRRRQSVELHSPQSLPRGTKA 230 240 250 260 270 >>CCDS8793.1 AQP5 gene_id:362|Hs108|chr12 (265 aa) initn: 955 init1: 564 opt: 954 Z-score: 1238.5 bits: 236.9 E(32554): 1.2e-62 Smith-Waterman score: 954; 58.5% identity (88.1% similar) in 236 aa overlap (22-254:9-244) 10 20 30 40 50 60 pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI :. .:.::::::: ..::::.::...::.:::..::::. CCDS87 MKKEVCSVAFLKAVFAEFLATLIFVFFGLGSALKWPSALPTILQIAL 10 20 30 40 70 80 90 100 110 120 pF1KE2 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG .:.:. . .:. .::.: :::.:::.:::..::: :: ::::::::: .::..::: CCDS87 AFGLAIGTLAQALGPVSGGHINPAITLALLVGNQISLLRAFFYVAAQLVGAIAGAGILYG 50 60 70 80 90 100 130 140 150 160 170 pF1KE2 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTS--GSPATMIGI : : . : .:..:.. :... :::..:::.::.::.::.:::::::.:: :::: ::. CCDS87 VAPLNARGNLAVNALNNNTTQGQAMVVELILTFQLALCIFASTDSRRTSPVGSPALSIGL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE2 SVALGHLIGIHFTGCSMNPARSFGPAIIIGKFT-VHWVFWVGPLMGALLASLIYNFVLFP ::.::::.::.:::::::::::::::.....:. .::::::::..::.::...: ..::: CCDS87 SVTLGHLVGIYFTGCSMNPARSFGPAVVMNRFSPAHWVFWVGPIVGAVLAAILYFYLLFP 170 180 190 200 210 220 240 250 260 270 280 pF1KE2 DTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV .. .:..:.::. :: : CCDS87 NSLSLSERVAIIKGTYEPDEDWEEQREERKKTMELTTR 230 240 250 260 >>CCDS8919.1 MIP gene_id:4284|Hs108|chr12 (263 aa) initn: 851 init1: 507 opt: 858 Z-score: 1114.2 bits: 213.9 E(32554): 9.6e-56 Smith-Waterman score: 858; 52.6% identity (83.3% similar) in 251 aa overlap (25-271:11-261) 10 20 30 40 50 60 pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI ::.::::.:: .:::::.:: .:: . :::.:. CCDS89 MWELRSASFWRAIFAEFFATLFYVFFGLGSSLRWAPGPLHVLQVAM 10 20 30 40 70 80 90 100 110 120 pF1KE2 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG .:.:. : :: . . ::::.:::::.::::::..:: :: :.::::.::..:::.::. CCDS89 AFGLALATLVQSVGHISGAHVNPAVTFAFLVGSQMSLLRAFCYMAAQLLGAVAGAAVLYS 50 60 70 80 90 100 130 140 150 160 170 pF1KE2 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTS--GSPATMIGI : : .: .:..:... .::.:::..::..::::.:::.::. : :... :: : .:. CCDS89 VTPPAVRGNLALNTLHPAVSVGQATTVEIFLTLQFVLCIFATYDERRNGQLGSVALAVGF 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE2 SVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPD :.:::::.:...:: .:::::::.:::. :.:: :::.::::..:. :.::.:.:.::: CCDS89 SLALGHLFGMYYTGAGMNPARSFAPAILTGNFTNHWVYWVGPIIGGGLGSLLYDFLLFPR 170 180 190 200 210 220 240 250 260 270 280 pF1KE2 TKTLAQRLAILTGTV-EVGTGAG-AGAEPLKKESQPGSGAVEMESV :....::..: :. .:..: . .::.. ..: CCDS89 LKSISERLSVLKGAKPDVSNGQPEVTGEPVELNTQAL 230 240 250 260 >>CCDS58617.1 AQP4 gene_id:361|Hs108|chr18 (301 aa) initn: 602 init1: 313 opt: 664 Z-score: 862.0 bits: 167.5 E(32554): 1.1e-41 Smith-Waterman score: 664; 44.3% identity (77.9% similar) in 235 aa overlap (14-240:1-235) 10 20 30 40 50 pF1KE2 MDAVEPGGRGWASMLACR-LW-KAISRALFAEFLATGLYVFFGVGSVMRW---PTALP-S :.: . .: .:. .:. ::::: ..:....::.. : :: . CCDS58 MVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWGGTEKPLPVD 10 20 30 40 60 70 80 90 100 110 pF1KE2 VLQIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVG .. :.. :.: : :: . ::.: :::::.:.. .::. ..: :.::: .:: .: CCDS58 MVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIAAQCLGAIIG 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE2 AALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQT--SGSP :..:: : : .. ::...:......:... :::..:.:::. .::: ::..: .:: CCDS58 AGILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFASCDSKRTDVTGSI 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE2 ATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYN : ::.:::.:::..:..:: :::::::::::.:.:.. ::..::::..::.::. .:. CCDS58 ALAIGFSVAIGHLFAINYTGASMNPARSFGPAVIMGNWENHWIYWVGPIIGAVLAGGLYE 170 180 190 200 210 220 240 250 260 270 280 pF1KE2 FVLFPDTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV .:. ::.. CCDS58 YVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEE 230 240 250 260 270 280 >>CCDS11889.1 AQP4 gene_id:361|Hs108|chr18 (323 aa) initn: 602 init1: 313 opt: 664 Z-score: 861.6 bits: 167.5 E(32554): 1.1e-41 Smith-Waterman score: 664; 44.3% identity (77.9% similar) in 235 aa overlap (14-240:23-257) 10 20 30 40 pF1KE2 MDAVEPGGRGWASMLACR-LW-KAISRALFAEFLATGLYVFFGVGSVMRW- :.: . .: .:. .:. ::::: ..:....::.. : CCDS11 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 --PTALP-SVLQIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVA :: ... :.. :.: : :: . ::.: :::::.:.. .::. ..: :.: CCDS11 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE2 AQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDS :: .:: .::..:: : : .. ::...:......:... :::..:.:::. .::: :: CCDS11 AQCLGAIIGAGILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFASCDS 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE2 RQT--SGSPATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMG ..: .:: : ::.:::.:::..:..:: :::::::::::.:.:.. ::..::::..: CCDS11 KRTDVTGSIALAIGFSVAIGHLFAINYTGASMNPARSFGPAVIMGNWENHWIYWVGPIIG 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE2 ALLASLIYNFVLFPDTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV :.::. .:..:. ::.. CCDS11 AVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLILKPGVVH 250 260 270 280 290 300 >>CCDS5431.1 AQP1 gene_id:358|Hs108|chr7 (269 aa) initn: 697 init1: 326 opt: 634 Z-score: 823.9 bits: 160.2 E(32554): 1.4e-39 Smith-Waterman score: 686; 46.2% identity (76.2% similar) in 240 aa overlap (25-254:12-251) 10 20 30 40 50 pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVM--RWP-----TALP ::. :::::: :.::...::.. ..: ::. CCDS54 MASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQTAVQ 10 20 30 40 60 70 80 90 100 110 pF1KE2 SVLQIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATV . ......:.: : .: . . :::: ::::::..:.. .::. ::. :. :: ::: : CCDS54 DNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYIIAQCVGAIV 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE2 GAALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTS--GS ..:.: :. . ..:: : . ..:..::....:.. ::::::::.:.:: :. . :: CCDS54 ATAILSGITSSLTGNSLGRNDLADGVNSGQGLGIEIIGTLQLVLCVLATTDRRRRDLGGS 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE2 PATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIY ::.:::::::..: .:::..::::::: :.: .:. ::.:::::..:. :: ::: CCDS54 APLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGPFIGGALAVLIY 170 180 190 200 210 220 240 250 260 270 280 pF1KE2 NFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGAEPLKKESQPGSGAVEMESV .:.: : .. :..:. . : : :: CCDS54 DFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK 230 240 250 260 >>CCDS55096.1 AQP1 gene_id:358|Hs108|chr7 (154 aa) initn: 431 init1: 326 opt: 432 Z-score: 566.1 bits: 111.7 E(32554): 3.3e-25 Smith-Waterman score: 432; 50.0% identity (78.0% similar) in 132 aa overlap (130-254:5-136) 100 110 120 130 140 150 pF1KE2 AVAYVAAQLVGATVGAALLYGVMPGDIRETLGINVVR----NSVSTGQAVAVELLLTLQL .: ::. ..:..::....:.. :::: CCDS55 MQSGMGWNVLDFWLADGVNSGQGLGIEIIGTLQL 10 20 30 160 170 180 190 200 210 pF1KE2 VLCVFASTDSRQTS--GSPATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVH ::::.:.:: :. . :: ::.:::::::..: .:::..::::::: :.: .:. : CCDS55 VLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNH 40 50 60 70 80 90 220 230 240 250 260 270 pF1KE2 WVFWVGPLMGALLASLIYNFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGAEPLKKESQP :.:::::..:. :: :::.:.: : .. :..:. . : : :: CCDS55 WIFWVGPFIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK 100 110 120 130 140 150 280 pF1KE2 GSGAVEMESV >>CCDS55098.1 AQP1 gene_id:358|Hs108|chr7 (186 aa) initn: 416 init1: 326 opt: 432 Z-score: 564.8 bits: 111.8 E(32554): 3.8e-25 Smith-Waterman score: 432; 48.9% identity (74.5% similar) in 141 aa overlap (117-254:35-168) 90 100 110 120 130 140 pF1KE2 LAFLVGSHISLPRAVAYVAAQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVA :: : ::. . . :.: .::... CCDS55 RPLPLVLVPQNTLAWMQLDAKAPAHPRPLQLLGRVGPGSRQLADGVN-------SGQGLG 10 20 30 40 50 150 160 170 180 190 200 pF1KE2 VELLLTLQLVLCVFASTDSRQTS--GSPATMIGISVALGHLIGIHFTGCSMNPARSFGPA .:.. ::::::::.:.:: :. . :: ::.:::::::..: .:::..::::::: : CCDS55 IEIIGTLQLVLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSA 60 70 80 90 100 110 210 220 230 240 250 260 pF1KE2 IIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGA .: .:. ::.:::::..:. :: :::.:.: : .. :..:. . : : :: CCDS55 VITHNFSNHWIFWVGPFIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDI 120 130 140 150 160 170 270 280 pF1KE2 EPLKKESQPGSGAVEMESV CCDS55 NSRVEMKPK 180 >>CCDS55097.1 AQP1 gene_id:358|Hs108|chr7 (218 aa) initn: 416 init1: 326 opt: 432 Z-score: 563.8 bits: 111.8 E(32554): 4.4e-25 Smith-Waterman score: 432; 48.9% identity (74.5% similar) in 141 aa overlap (117-254:67-200) 90 100 110 120 130 140 pF1KE2 LAFLVGSHISLPRAVAYVAAQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVA :: : ::. . . :.: .::... CCDS55 RPLPLVLVPQNTLAWMQLDAKAPAHPRPLQLLGRVGPGSRQLADGVN-------SGQGLG 40 50 60 70 80 150 160 170 180 190 200 pF1KE2 VELLLTLQLVLCVFASTDSRQTS--GSPATMIGISVALGHLIGIHFTGCSMNPARSFGPA .:.. ::::::::.:.:: :. . :: ::.:::::::..: .:::..::::::: : CCDS55 IEIIGTLQLVLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSA 90 100 110 120 130 140 210 220 230 240 250 260 pF1KE2 IIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGA .: .:. ::.:::::..:. :: :::.:.: : .. :..:. . : : :: CCDS55 VITHNFSNHWIFWVGPFIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDI 150 160 170 180 190 200 270 280 pF1KE2 EPLKKESQPGSGAVEMESV CCDS55 NSRVEMKPK 210 282 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 17:03:33 2016 done: Tue Nov 8 17:03:33 2016 Total Scan time: 1.490 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]