FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3853, 212 aa 1>>>pF1KB3853 212 - 212 aa - 212 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1838+/-0.000772; mu= 13.9024+/- 0.046 mean_var=59.3033+/-12.279, 0's: 0 Z-trim(106.8): 28 B-trim: 0 in 0/49 Lambda= 0.166546 statistics sampled from 9203 (9223) to 9203 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.671), E-opt: 0.2 (0.283), width: 16 Scan time: 2.030 The best scores are: opt bits E(32554) CCDS82244.1 AQP4 gene_id:361|Hs108|chr18 ( 296) 878 219.1 2.2e-57 CCDS11889.1 AQP4 gene_id:361|Hs108|chr18 ( 323) 878 219.1 2.4e-57 CCDS58617.1 AQP4 gene_id:361|Hs108|chr18 ( 301) 709 178.5 3.8e-45 CCDS5431.1 AQP1 gene_id:358|Hs108|chr7 ( 269) 322 85.5 3.4e-17 CCDS8792.1 AQP2 gene_id:359|Hs108|chr12 ( 271) 321 85.3 4e-17 CCDS8793.1 AQP5 gene_id:362|Hs108|chr12 ( 265) 314 83.6 1.3e-16 CCDS8919.1 MIP gene_id:4284|Hs108|chr12 ( 263) 310 82.6 2.4e-16 >>CCDS82244.1 AQP4 gene_id:361|Hs108|chr18 (296 aa) initn: 876 init1: 876 opt: 878 Z-score: 1143.6 bits: 219.1 E(32554): 2.2e-57 Smith-Waterman score: 911; 65.6% identity (65.6% similar) in 244 aa overlap (1-160:1-244) 10 20 30 40 50 60 pF1KB3 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA 70 80 90 100 110 120 pF1KB3 AQCLG------------------------------------------------------- ::::: CCDS82 AQCLGAIIGAGILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFASCDS 130 140 150 160 170 180 130 140 150 pF1KB3 -----------------------------PIIGAVLAGGLYEYVFCPDVEFKRRFKEAFS ::::::::::::::::::::::::::::::: CCDS82 KRTDVTGSIALAIGFSVAIGHLFAIYWVGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFS 190 200 210 220 230 240 160 170 180 190 200 210 pF1KB3 KAAQQTKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEEKKGKDQSGEVLSSV :::: CCDS82 KAAQQTKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEEKKGKDQSGEVLSSV 250 260 270 280 290 >-- initn: 331 init1: 331 opt: 331 Z-score: 433.3 bits: 87.7 E(32554): 8.2e-18 Smith-Waterman score: 331; 100.0% identity (100.0% similar) in 52 aa overlap (161-212:245-296) 140 150 160 170 180 190 pF1KB3 VLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLILKPGVVHV :::::::::::::::::::::::::::::: CCDS82 VLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLILKPGVVHV 220 230 240 250 260 270 200 210 pF1KB3 IDVDRGEEKKGKDQSGEVLSSV :::::::::::::::::::::: CCDS82 IDVDRGEEKKGKDQSGEVLSSV 280 290 >>CCDS11889.1 AQP4 gene_id:361|Hs108|chr18 (323 aa) initn: 876 init1: 876 opt: 878 Z-score: 1143.0 bits: 219.1 E(32554): 2.4e-57 Smith-Waterman score: 878; 91.7% identity (93.1% similar) in 145 aa overlap (1-145:1-141) 10 20 30 40 50 60 pF1KB3 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 AQCLGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDD ::::: :::: :. : :.: CCDS11 AQCLGAIIGA----GILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFA 130 140 150 160 170 >-- initn: 594 init1: 575 opt: 588 Z-score: 766.4 bits: 149.5 E(32554): 2.3e-36 Smith-Waterman score: 588; 69.5% identity (83.0% similar) in 141 aa overlap (73-212:189-323) 50 60 70 80 90 100 pF1KB3 LAMLIFVLLSLGSTINWGGTEKPLPVDMVLISLCFGLSIATMVQCFG-HISGGHINPAVT :.: .:.:.: . . :. . .:. .::: . CCDS11 GLLVELIITFQLVFTIFASCDSKRTDVTGSIALAIGFSVA-IGHLFAINYTGASMNPARS 160 170 180 190 200 210 110 120 130 140 150 160 pF1KB3 VAMVCTRKISIAKSVFYIAAQCLGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQ . . . .... ::::::::::::::::::::::::::::::::::::: CCDS11 FGPAVIMGNWENHWIYWV-----GPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQ 220 230 240 250 260 270 170 180 190 200 210 pF1KB3 TKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEEKKGKDQSGEVLSSV ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEEKKGKDQSGEVLSSV 280 290 300 310 320 >>CCDS58617.1 AQP4 gene_id:361|Hs108|chr18 (301 aa) initn: 707 init1: 707 opt: 709 Z-score: 924.0 bits: 178.5 E(32554): 3.8e-45 Smith-Waterman score: 709; 91.9% identity (92.7% similar) in 123 aa overlap (23-145:1-119) 10 20 30 40 50 60 pF1KB3 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG :::::::::::::::::::::::::::::::::::::: CCDS58 MVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG 10 20 30 70 80 90 100 110 120 pF1KB3 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB3 AQCLGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDD ::::: :::: : :: : :.: CCDS58 AQCLGAIIGA---GILY-LVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFA 100 110 120 130 140 150 >-- initn: 594 init1: 575 opt: 588 Z-score: 766.9 bits: 149.4 E(32554): 2.1e-36 Smith-Waterman score: 588; 69.5% identity (83.0% similar) in 141 aa overlap (73-212:167-301) 50 60 70 80 90 100 pF1KB3 LAMLIFVLLSLGSTINWGGTEKPLPVDMVLISLCFGLSIATMVQCFG-HISGGHINPAVT :.: .:.:.: . . :. . .:. .::: . CCDS58 GLLVELIITFQLVFTIFASCDSKRTDVTGSIALAIGFSVA-IGHLFAINYTGASMNPARS 140 150 160 170 180 190 110 120 130 140 150 160 pF1KB3 VAMVCTRKISIAKSVFYIAAQCLGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQ . . . .... ::::::::::::::::::::::::::::::::::::: CCDS58 FGPAVIMGNWENHWIYWV-----GPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQ 200 210 220 230 240 250 170 180 190 200 210 pF1KB3 TKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEEKKGKDQSGEVLSSV ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TKGSYMEVEDNRSQVETDDLILKPGVVHVIDVDRGEEKKGKDQSGEVLSSV 260 270 280 290 300 >>CCDS5431.1 AQP1 gene_id:358|Hs108|chr7 (269 aa) initn: 324 init1: 234 opt: 322 Z-score: 422.3 bits: 85.5 E(32554): 3.4e-17 Smith-Waterman score: 322; 48.6% identity (80.0% similar) in 105 aa overlap (34-134:10-114) 10 20 30 40 50 60 pF1KB3 RPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG--- ::.::.::::: .::..:.::.... CCDS54 MASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPV 10 20 30 70 80 90 100 110 120 pF1KB3 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA :... : : .:: :::::::..: :::::.:.:::::.... . .::: ....:: CCDS54 GNNQTAVQDNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYII 40 50 60 70 80 90 130 140 150 160 170 pF1KB3 AQCLGPIIG-AVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETD :::.: :.. :.:.: CCDS54 AQCVGAIVATAILSGITSSLTGNSLGRNDLADGVNSGQGLGIEIIGTLQLVLCVLATTDR 100 110 120 130 140 150 >>CCDS8792.1 AQP2 gene_id:359|Hs108|chr12 (271 aa) initn: 320 init1: 222 opt: 321 Z-score: 420.9 bits: 85.3 E(32554): 4e-17 Smith-Waterman score: 321; 52.0% identity (80.0% similar) in 100 aa overlap (33-132:8-103) 10 20 30 40 50 60 pF1KB3 DRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWGGT :: .:: ::::: :.::...:::..:: CCDS87 MWELRSIAFSRAVFAEFLATLLFVFFGLGSALNW--- 10 20 30 70 80 90 100 110 120 pF1KB3 EKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIAAQ . :: ... :.. :::.:.:.:: .:::::.::::::::: . ..:. ...::.::: CCDS87 PQALP-SVLQIAMAFGLGIGTLVQALGHISGAHINPAVTVACLVGCHVSVLRAAFYVAAQ 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB3 CLGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDDLI :: . ::.: CCDS87 LLGAVAGAALLHEITPADIRGDLAVNALSNSTTAGQAVTVELFLTLQLVLCIFASTDERR 100 110 120 130 140 150 >>CCDS8793.1 AQP5 gene_id:362|Hs108|chr12 (265 aa) initn: 396 init1: 213 opt: 314 Z-score: 412.0 bits: 83.6 E(32554): 1.3e-16 Smith-Waterman score: 314; 49.5% identity (77.1% similar) in 109 aa overlap (27-134:3-107) 10 20 30 40 50 60 pF1KB3 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG : : . :: ::: ::::: ::::...:::...: CCDS87 MKKEVCSVAFLKAVFAEFLATLIFVFFGLGSALKWP 10 20 30 70 80 90 100 110 120 pF1KB3 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA .. ::. .. :.: :::.:.:..: .: .::::::::.:.:.. .::. .. ::.: CCDS87 SA---LPT-ILQIALAFGLAIGTLAQALGPVSGGHINPAITLALLVGNQISLLRAFFYVA 40 50 60 70 80 90 130 140 150 160 170 pF1KB3 AQCLGPIIGA-VLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETD :: .: : :: .: : CCDS87 AQLVGAIAGAGILYGVAPLNARGNLAVNALNNNTTQGQAMVVELILTFQLALCIFASTDS 100 110 120 130 140 150 >>CCDS8919.1 MIP gene_id:4284|Hs108|chr12 (263 aa) initn: 289 init1: 194 opt: 310 Z-score: 406.8 bits: 82.6 E(32554): 2.4e-16 Smith-Waterman score: 310; 42.2% identity (76.5% similar) in 102 aa overlap (31-132:6-103) 10 20 30 40 50 60 pF1KB3 MSDRPTARRWGKCGPLCTRENIMVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWG . .::.:. :::.: :..:...:::.. :. CCDS89 MWELRSASFWRAIFAEFFATLFYVFFGLGSSLRWA 10 20 30 70 80 90 100 110 120 pF1KB3 GTEKPLPVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIA : :. .. ... :::..::.:: :::::.:.::::: :.. ..:. .. :.: CCDS89 ----PGPLHVLQVAMAFGLALATLVQSVGHISGAHVNPAVTFAFLVGSQMSLLRAFCYMA 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB3 AQCLGPIIGAVLAGGLYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQVETDD :: :: . ::.. CCDS89 AQLLGAVAGAAVLYSVTPPAVRGNLALNTLHPAVSVGQATTVEIFLTLQFVLCIFATYDE 100 110 120 130 140 150 212 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 09:05:47 2016 done: Sat Nov 5 09:05:48 2016 Total Scan time: 2.030 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]