FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2610, 282 aa
1>>>pF1KE2610 282 - 282 aa - 282 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2782+/-0.000917; mu= 15.0694+/- 0.055
mean_var=59.6139+/-12.208, 0's: 0 Z-trim(105.4): 27 B-trim: 0 in 0/49
Lambda= 0.166112
statistics sampled from 8362 (8385) to 8362 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.619), E-opt: 0.2 (0.258), width: 16
Scan time: 1.490
The best scores are: opt bits E(32554)
CCDS31798.1 AQP6 gene_id:363|Hs108|chr12 ( 282) 1818 444.0 5.7e-125
CCDS8792.1 AQP2 gene_id:359|Hs108|chr12 ( 271) 993 246.3 1.8e-65
CCDS8793.1 AQP5 gene_id:362|Hs108|chr12 ( 265) 954 236.9 1.2e-62
CCDS8919.1 MIP gene_id:4284|Hs108|chr12 ( 263) 858 213.9 9.6e-56
CCDS58617.1 AQP4 gene_id:361|Hs108|chr18 ( 301) 664 167.5 1.1e-41
CCDS11889.1 AQP4 gene_id:361|Hs108|chr18 ( 323) 664 167.5 1.1e-41
CCDS5431.1 AQP1 gene_id:358|Hs108|chr7 ( 269) 634 160.2 1.4e-39
CCDS55096.1 AQP1 gene_id:358|Hs108|chr7 ( 154) 432 111.7 3.3e-25
CCDS55098.1 AQP1 gene_id:358|Hs108|chr7 ( 186) 432 111.8 3.8e-25
CCDS55097.1 AQP1 gene_id:358|Hs108|chr7 ( 218) 432 111.8 4.4e-25
CCDS82244.1 AQP4 gene_id:361|Hs108|chr18 ( 296) 433 112.1 4.9e-25
CCDS10626.1 AQP8 gene_id:343|Hs108|chr16 ( 261) 373 97.7 9.3e-21
>>CCDS31798.1 AQP6 gene_id:363|Hs108|chr12 (282 aa)
initn: 1818 init1: 1818 opt: 1818 Z-score: 2357.1 bits: 444.0 E(32554): 5.7e-125
Smith-Waterman score: 1818; 100.0% identity (100.0% similar) in 282 aa overlap (1-282:1-282)
10 20 30 40 50 60
pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTSGSPATMIGISV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTSGSPATMIGISV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 ALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 ALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTK
190 200 210 220 230 240
250 260 270 280
pF1KE2 TLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
::::::::::::::::::::::::::::::::::::::::::
CCDS31 TLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
250 260 270 280
>>CCDS8792.1 AQP2 gene_id:359|Hs108|chr12 (271 aa)
initn: 977 init1: 574 opt: 993 Z-score: 1288.9 bits: 246.3 E(32554): 1.8e-65
Smith-Waterman score: 993; 62.3% identity (85.4% similar) in 239 aa overlap (19-251:1-239)
10 20 30 40 50
pF1KE2 MDAVEPGGRGWASMLACRLWK----AISRALFAEFLATGLYVFFGVGSVMRWPTALPSVL
.:. :.:::.::::::: :.::::.::.. :: ::::::
CCDS87 MWELRSIAFSRAVFAEFLATLLFVFFGLGSALNWPQALPSVL
10 20 30 40
60 70 80 90 100 110
pF1KE2 QIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAA
:::..:.: . ::. . :::: :::::.: ::: :.:. ::. ::::::.::..:::
CCDS87 QIAMAFGLGIGTLVQALGHISGAHINPAVTVACLVGCHVSVLRAAFYVAAQLLGAVAGAA
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE2 LLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSR--QTSGSPAT
::. . :.::: :..:.. ::...::::.:::.::::::::.::::: : .. :.::
CCDS87 LLHEITPADIRGDLAVNALSNSTTAGQAVTVELFLTLQLVLCIFASTDERRGENPGTPAL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE2 MIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFV
::.:::::::.:::.::::::::::..::.. ::: :::::.:::.::.:.::.::.:
CCDS87 SIGFSVALGHLLGIHYTGCSMNPARSLAPAVVTGKFDDHWVFWIGPLVGAILGSLLYNYV
170 180 190 200 210 220
240 250 260 270 280
pF1KE2 LFPDTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
::: .:.:..:::.: :
CCDS87 LFPPAKSLSERLAVLKGLEPDTDWEEREVRRRQSVELHSPQSLPRGTKA
230 240 250 260 270
>>CCDS8793.1 AQP5 gene_id:362|Hs108|chr12 (265 aa)
initn: 955 init1: 564 opt: 954 Z-score: 1238.5 bits: 236.9 E(32554): 1.2e-62
Smith-Waterman score: 954; 58.5% identity (88.1% similar) in 236 aa overlap (22-254:9-244)
10 20 30 40 50 60
pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI
:. .:.::::::: ..::::.::...::.:::..::::.
CCDS87 MKKEVCSVAFLKAVFAEFLATLIFVFFGLGSALKWPSALPTILQIAL
10 20 30 40
70 80 90 100 110 120
pF1KE2 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG
.:.:. . .:. .::.: :::.:::.:::..::: :: ::::::::: .::..:::
CCDS87 AFGLAIGTLAQALGPVSGGHINPAITLALLVGNQISLLRAFFYVAAQLVGAIAGAGILYG
50 60 70 80 90 100
130 140 150 160 170
pF1KE2 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTS--GSPATMIGI
: : . : .:..:.. :... :::..:::.::.::.::.:::::::.:: :::: ::.
CCDS87 VAPLNARGNLAVNALNNNTTQGQAMVVELILTFQLALCIFASTDSRRTSPVGSPALSIGL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE2 SVALGHLIGIHFTGCSMNPARSFGPAIIIGKFT-VHWVFWVGPLMGALLASLIYNFVLFP
::.::::.::.:::::::::::::::.....:. .::::::::..::.::...: ..:::
CCDS87 SVTLGHLVGIYFTGCSMNPARSFGPAVVMNRFSPAHWVFWVGPIVGAVLAAILYFYLLFP
170 180 190 200 210 220
240 250 260 270 280
pF1KE2 DTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
.. .:..:.::. :: :
CCDS87 NSLSLSERVAIIKGTYEPDEDWEEQREERKKTMELTTR
230 240 250 260
>>CCDS8919.1 MIP gene_id:4284|Hs108|chr12 (263 aa)
initn: 851 init1: 507 opt: 858 Z-score: 1114.2 bits: 213.9 E(32554): 9.6e-56
Smith-Waterman score: 858; 52.6% identity (83.3% similar) in 251 aa overlap (25-271:11-261)
10 20 30 40 50 60
pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVMRWPTALPSVLQIAI
::.::::.:: .:::::.:: .:: . :::.:.
CCDS89 MWELRSASFWRAIFAEFFATLFYVFFGLGSSLRWAPGPLHVLQVAM
10 20 30 40
70 80 90 100 110 120
pF1KE2 TFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVGAALLYG
.:.:. : :: . . ::::.:::::.::::::..:: :: :.::::.::..:::.::.
CCDS89 AFGLALATLVQSVGHISGAHVNPAVTFAFLVGSQMSLLRAFCYMAAQLLGAVAGAAVLYS
50 60 70 80 90 100
130 140 150 160 170
pF1KE2 VMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTS--GSPATMIGI
: : .: .:..:... .::.:::..::..::::.:::.::. : :... :: : .:.
CCDS89 VTPPAVRGNLALNTLHPAVSVGQATTVEIFLTLQFVLCIFATYDERRNGQLGSVALAVGF
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE2 SVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPD
:.:::::.:...:: .:::::::.:::. :.:: :::.::::..:. :.::.:.:.:::
CCDS89 SLALGHLFGMYYTGAGMNPARSFAPAILTGNFTNHWVYWVGPIIGGGLGSLLYDFLLFPR
170 180 190 200 210 220
240 250 260 270 280
pF1KE2 TKTLAQRLAILTGTV-EVGTGAG-AGAEPLKKESQPGSGAVEMESV
:....::..: :. .:..: . .::.. ..:
CCDS89 LKSISERLSVLKGAKPDVSNGQPEVTGEPVELNTQAL
230 240 250 260
>>CCDS58617.1 AQP4 gene_id:361|Hs108|chr18 (301 aa)
initn: 602 init1: 313 opt: 664 Z-score: 862.0 bits: 167.5 E(32554): 1.1e-41
Smith-Waterman score: 664; 44.3% identity (77.9% similar) in 235 aa overlap (14-240:1-235)
10 20 30 40 50
pF1KE2 MDAVEPGGRGWASMLACR-LW-KAISRALFAEFLATGLYVFFGVGSVMRW---PTALP-S
:.: . .: .:. .:. ::::: ..:....::.. : :: .
CCDS58 MVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWGGTEKPLPVD
10 20 30 40
60 70 80 90 100 110
pF1KE2 VLQIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATVG
.. :.. :.: : :: . ::.: :::::.:.. .::. ..: :.::: .:: .:
CCDS58 MVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIAAQCLGAIIG
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE2 AALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQT--SGSP
:..:: : : .. ::...:......:... :::..:.:::. .::: ::..: .::
CCDS58 AGILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFASCDSKRTDVTGSI
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE2 ATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIYN
: ::.:::.:::..:..:: :::::::::::.:.:.. ::..::::..::.::. .:.
CCDS58 ALAIGFSVAIGHLFAINYTGASMNPARSFGPAVIMGNWENHWIYWVGPIIGAVLAGGLYE
170 180 190 200 210 220
240 250 260 270 280
pF1KE2 FVLFPDTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
.:. ::..
CCDS58 YVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEE
230 240 250 260 270 280
>>CCDS11889.1 AQP4 gene_id:361|Hs108|chr18 (323 aa)
initn: 602 init1: 313 opt: 664 Z-score: 861.6 bits: 167.5 E(32554): 1.1e-41
Smith-Waterman score: 664; 44.3% identity (77.9% similar) in 235 aa overlap (14-240:23-257)
10 20 30 40
pF1KE2 MDAVEPGGRGWASMLACR-LW-KAISRALFAEFLATGLYVFFGVGSVMRW-
:.: . .: .:. .:. ::::: ..:....::.. :
CCDS11 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE2 --PTALP-SVLQIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVA
:: ... :.. :.: : :: . ::.: :::::.:.. .::. ..: :.:
CCDS11 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE2 AQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDS
:: .:: .::..:: : : .. ::...:......:... :::..:.:::. .::: ::
CCDS11 AQCLGAIIGAGILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFASCDS
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE2 RQT--SGSPATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMG
..: .:: : ::.:::.:::..:..:: :::::::::::.:.:.. ::..::::..:
CCDS11 KRTDVTGSIALAIGFSVAIGHLFAINYTGASMNPARSFGPAVIMGNWENHWIYWVGPIIG
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE2 ALLASLIYNFVLFPDTKTLAQRLAILTGTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
:.::. .:..:. ::..
CCDS11 AVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLILKPGVVH
250 260 270 280 290 300
>>CCDS5431.1 AQP1 gene_id:358|Hs108|chr7 (269 aa)
initn: 697 init1: 326 opt: 634 Z-score: 823.9 bits: 160.2 E(32554): 1.4e-39
Smith-Waterman score: 686; 46.2% identity (76.2% similar) in 240 aa overlap (25-254:12-251)
10 20 30 40 50
pF1KE2 MDAVEPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGSVM--RWP-----TALP
::. :::::: :.::...::.. ..: ::.
CCDS54 MASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQTAVQ
10 20 30 40
60 70 80 90 100 110
pF1KE2 SVLQIAITFNLVTAMAVQVTWKASGAHANPAVTLAFLVGSHISLPRAVAYVAAQLVGATV
. ......:.: : .: . . :::: ::::::..:.. .::. ::. :. :: ::: :
CCDS54 DNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYIIAQCVGAIV
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE2 GAALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVFASTDSRQTS--GS
..:.: :. . ..:: : . ..:..::....:.. ::::::::.:.:: :. . ::
CCDS54 ATAILSGITSSLTGNSLGRNDLADGVNSGQGLGIEIIGTLQLVLCVLATTDRRRRDLGGS
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE2 PATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVHWVFWVGPLMGALLASLIY
::.:::::::..: .:::..::::::: :.: .:. ::.:::::..:. :: :::
CCDS54 APLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGPFIGGALAVLIY
170 180 190 200 210 220
240 250 260 270 280
pF1KE2 NFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGAEPLKKESQPGSGAVEMESV
.:.: : .. :..:. . : : ::
CCDS54 DFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK
230 240 250 260
>>CCDS55096.1 AQP1 gene_id:358|Hs108|chr7 (154 aa)
initn: 431 init1: 326 opt: 432 Z-score: 566.1 bits: 111.7 E(32554): 3.3e-25
Smith-Waterman score: 432; 50.0% identity (78.0% similar) in 132 aa overlap (130-254:5-136)
100 110 120 130 140 150
pF1KE2 AVAYVAAQLVGATVGAALLYGVMPGDIRETLGINVVR----NSVSTGQAVAVELLLTLQL
.: ::. ..:..::....:.. ::::
CCDS55 MQSGMGWNVLDFWLADGVNSGQGLGIEIIGTLQL
10 20 30
160 170 180 190 200 210
pF1KE2 VLCVFASTDSRQTS--GSPATMIGISVALGHLIGIHFTGCSMNPARSFGPAIIIGKFTVH
::::.:.:: :. . :: ::.:::::::..: .:::..::::::: :.: .:. :
CCDS55 VLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNH
40 50 60 70 80 90
220 230 240 250 260 270
pF1KE2 WVFWVGPLMGALLASLIYNFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGAEPLKKESQP
:.:::::..:. :: :::.:.: : .. :..:. . : : ::
CCDS55 WIFWVGPFIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK
100 110 120 130 140 150
280
pF1KE2 GSGAVEMESV
>>CCDS55098.1 AQP1 gene_id:358|Hs108|chr7 (186 aa)
initn: 416 init1: 326 opt: 432 Z-score: 564.8 bits: 111.8 E(32554): 3.8e-25
Smith-Waterman score: 432; 48.9% identity (74.5% similar) in 141 aa overlap (117-254:35-168)
90 100 110 120 130 140
pF1KE2 LAFLVGSHISLPRAVAYVAAQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVA
:: : ::. . . :.: .::...
CCDS55 RPLPLVLVPQNTLAWMQLDAKAPAHPRPLQLLGRVGPGSRQLADGVN-------SGQGLG
10 20 30 40 50
150 160 170 180 190 200
pF1KE2 VELLLTLQLVLCVFASTDSRQTS--GSPATMIGISVALGHLIGIHFTGCSMNPARSFGPA
.:.. ::::::::.:.:: :. . :: ::.:::::::..: .:::..::::::: :
CCDS55 IEIIGTLQLVLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSA
60 70 80 90 100 110
210 220 230 240 250 260
pF1KE2 IIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGA
.: .:. ::.:::::..:. :: :::.:.: : .. :..:. . : : ::
CCDS55 VITHNFSNHWIFWVGPFIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDI
120 130 140 150 160 170
270 280
pF1KE2 EPLKKESQPGSGAVEMESV
CCDS55 NSRVEMKPK
180
>>CCDS55097.1 AQP1 gene_id:358|Hs108|chr7 (218 aa)
initn: 416 init1: 326 opt: 432 Z-score: 563.8 bits: 111.8 E(32554): 4.4e-25
Smith-Waterman score: 432; 48.9% identity (74.5% similar) in 141 aa overlap (117-254:67-200)
90 100 110 120 130 140
pF1KE2 LAFLVGSHISLPRAVAYVAAQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVA
:: : ::. . . :.: .::...
CCDS55 RPLPLVLVPQNTLAWMQLDAKAPAHPRPLQLLGRVGPGSRQLADGVN-------SGQGLG
40 50 60 70 80
150 160 170 180 190 200
pF1KE2 VELLLTLQLVLCVFASTDSRQTS--GSPATMIGISVALGHLIGIHFTGCSMNPARSFGPA
.:.. ::::::::.:.:: :. . :: ::.:::::::..: .:::..::::::: :
CCDS55 IEIIGTLQLVLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSA
90 100 110 120 130 140
210 220 230 240 250 260
pF1KE2 IIIGKFTVHWVFWVGPLMGALLASLIYNFVLFPDTKTLAQRLAILT-GTVEVGTGAGAGA
.: .:. ::.:::::..:. :: :::.:.: : .. :..:. . : : ::
CCDS55 VITHNFSNHWIFWVGPFIGGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDI
150 160 170 180 190 200
270 280
pF1KE2 EPLKKESQPGSGAVEMESV
CCDS55 NSRVEMKPK
210
282 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 17:03:33 2016 done: Tue Nov 8 17:03:33 2016
Total Scan time: 1.490 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]