FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0447, 175 aa 1>>>pF1KE0447 175 - 175 aa - 175 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0697+/-0.000621; mu= 14.4142+/- 0.038 mean_var=65.0184+/-13.199, 0's: 0 Z-trim(111.2): 14 B-trim: 0 in 0/51 Lambda= 0.159058 statistics sampled from 12191 (12205) to 12191 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.762), E-opt: 0.2 (0.375), width: 16 Scan time: 1.970 The best scores are: opt bits E(32554) CCDS8351.1 CRYAB gene_id:1410|Hs108|chr11 ( 175) 1196 282.3 1.1e-76 CCDS81626.1 CRYAB gene_id:1410|Hs108|chr11 ( 108) 708 170.2 3.7e-43 CCDS82651.1 LOC102724652 gene_id:102724652|Hs108|c ( 173) 657 158.6 1.8e-39 CCDS13695.1 CRYAA gene_id:1409|Hs108|chr21 ( 173) 657 158.6 1.8e-39 CCDS12475.1 HSPB6 gene_id:126393|Hs108|chr19 ( 160) 480 118.0 2.8e-27 CCDS82652.1 LOC102724652 gene_id:102724652|Hs108|c ( 136) 434 107.4 3.7e-24 CCDS5583.1 HSPB1 gene_id:3315|Hs108|chr7 ( 205) 384 96.0 1.5e-20 CCDS8352.1 HSPB2 gene_id:3316|Hs108|chr11 ( 182) 381 95.3 2.2e-20 CCDS9189.1 HSPB8 gene_id:26353|Hs108|chr12 ( 196) 245 64.1 5.7e-11 >>CCDS8351.1 CRYAB gene_id:1410|Hs108|chr11 (175 aa) initn: 1196 init1: 1196 opt: 1196 Z-score: 1490.5 bits: 282.3 E(32554): 1.1e-76 Smith-Waterman score: 1196; 100.0% identity (100.0% similar) in 175 aa overlap (1-175:1-175) 10 20 30 40 50 60 pF1KE0 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPSW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPSW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 FDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 FDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFHR 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 KYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIPITREEKPAVTAAPKK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 KYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIPITREEKPAVTAAPKK 130 140 150 160 170 >>CCDS81626.1 CRYAB gene_id:1410|Hs108|chr11 (108 aa) initn: 708 init1: 708 opt: 708 Z-score: 888.4 bits: 170.2 E(32554): 3.7e-43 Smith-Waterman score: 708; 100.0% identity (100.0% similar) in 108 aa overlap (68-175:1-108) 40 50 60 70 80 90 pF1KE0 FPTSTSLSPFYLRPPSFLRAPSWFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDV :::::::::::::::::::::::::::::: CCDS81 MRLEKDRFSVNLDVKHFSPEELKVKVLGDV 10 20 30 100 110 120 130 140 150 pF1KE0 IEVHGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 IEVHGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPER 40 50 60 70 80 90 160 170 pF1KE0 TIPITREEKPAVTAAPKK :::::::::::::::::: CCDS81 TIPITREEKPAVTAAPKK 100 >>CCDS82651.1 LOC102724652 gene_id:102724652|Hs108|chr21 (173 aa) initn: 504 init1: 406 opt: 657 Z-score: 822.2 bits: 158.6 E(32554): 1.8e-39 Smith-Waterman score: 657; 54.5% identity (80.9% similar) in 178 aa overlap (1-173:1-171) 10 20 30 40 50 pF1KE0 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFP-TSTSLSPFYLRPPSFLRAPS ::..:.:::..: . ::. ::::::::::: :.: ::.: :...::.: . :..: . CCDS82 MDVTIQHPWFKRTLGPFY-PSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQ--SLFR--T 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 WFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFH .:.:.::.: ..:.: . :::::::::.: ::: : .:.::::.::::.::.:::::: CCDS82 VLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 RKYRIPADVDPLTITSSLSSDGVLTVNGPRKQV----SGPERTIPITREEKPAVTAAPKK :.::.:..:: ... :::.::.:: ::. :. . ::.::..::::: :.:: CCDS82 RRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKP--TSAPSS 120 130 140 150 160 170 >>CCDS13695.1 CRYAA gene_id:1409|Hs108|chr21 (173 aa) initn: 504 init1: 406 opt: 657 Z-score: 822.2 bits: 158.6 E(32554): 1.8e-39 Smith-Waterman score: 657; 54.5% identity (80.9% similar) in 178 aa overlap (1-173:1-171) 10 20 30 40 50 pF1KE0 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFP-TSTSLSPFYLRPPSFLRAPS ::..:.:::..: . ::. ::::::::::: :.: ::.: :...::.: . :..: . CCDS13 MDVTIQHPWFKRTLGPFY-PSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQ--SLFR--T 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 WFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFH .:.:.::.: ..:.: . :::::::::.: ::: : .:.::::.::::.::.:::::: CCDS13 VLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 RKYRIPADVDPLTITSSLSSDGVLTVNGPRKQV----SGPERTIPITREEKPAVTAAPKK :.::.:..:: ... :::.::.:: ::. :. . ::.::..::::: :.:: CCDS13 RRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKP--TSAPSS 120 130 140 150 160 170 >>CCDS12475.1 HSPB6 gene_id:126393|Hs108|chr19 (160 aa) initn: 478 init1: 345 opt: 480 Z-score: 603.1 bits: 118.0 E(32554): 2.8e-27 Smith-Waterman score: 480; 46.5% identity (73.6% similar) in 159 aa overlap (1-155:3-154) 10 20 30 40 50 pF1KE0 MDIAIHHPWIRR---PFFPFHSPSRLFDQFFGEHLLESDLFPTS-TSLSPFYLRPPSF . . .. :.:: :. . .:.::::: ::: :::..: :.:.:.::: :: CCDS12 MEIPVPVQPSWLRRASAPLPGLSAPGRLFDQRFGEGLLEAELAALCPTTLAPYYLRAPS- 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 LRAPSWFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFI . : .... . .::: ::::::::::. :::.:. .:::..:::: :::::. CCDS12 VALP------VAQVPTDPGHFSVLLDVKHFSPEEIAVKVVGEHVEVHARHEERPDEHGFV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 SREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIPITREEKPAVTAAPK .:::::.::.: ::: ..::.:: .:::.... ....: CCDS12 AREFHRRYRLPPGVDPAAVTSALSPEGVLSIQAAPASAQAPPPAAAK 120 130 140 150 160 pF1KE0 K >>CCDS82652.1 LOC102724652 gene_id:102724652|Hs108|chr21 (136 aa) initn: 417 init1: 347 opt: 434 Z-score: 547.1 bits: 107.4 E(32554): 3.7e-24 Smith-Waterman score: 434; 52.3% identity (75.0% similar) in 128 aa overlap (50-173:10-134) 20 30 40 50 60 70 pF1KE0 PSRLFDQFFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPSWFDTGLSEMRLEKDRFSVNL :::. : : ..: ..:.: . : CCDS82 MPVCPGDSHRPPKALPHLVCGRRG-RQVRSDRDKFVIFL 10 20 30 80 90 100 110 120 130 pF1KE0 DVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSS ::::::::.: ::: : .:.::::.::::.::.:::::::.::.:..:: ... :::. CCDS82 DVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSA 40 50 60 70 80 90 140 150 160 170 pF1KE0 DGVLTVNGPRKQV----SGPERTIPITREEKPAVTAAPKK ::.:: ::. :. . ::.::..::::: :.:: CCDS82 DGMLTFCGPKIQTGLDATHAERAIPVSREEKP--TSAPSS 100 110 120 130 >>CCDS5583.1 HSPB1 gene_id:3315|Hs108|chr7 (205 aa) initn: 444 init1: 349 opt: 384 Z-score: 482.5 bits: 96.0 E(32554): 1.5e-20 Smith-Waterman score: 417; 44.4% identity (64.3% similar) in 171 aa overlap (13-166:18-188) 10 20 30 40 50 pF1KE0 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFP-TSTSLSPFYLRP--- :: .. :::::: :: : . . : : :.:: CCDS55 MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQWLGGSSWPGYVRPLPP 10 20 30 40 50 60 60 70 80 90 100 pF1KE0 ----------PSFLRAPS-WFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEV :.. :: : ...:.::.: ::. :.:::.::.:.:: ::. :.:. CCDS55 AAIESPAVAAPAYSRALSRQLSSGVSEIRHTADRWRVSLDVNHFAPDELTVKTKDGVVEI 70 80 90 100 110 120 110 120 130 140 150 pF1KE0 HGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVS--GPERT :::::::::::.::: : ::: .: ::: ..:::: .:.:::..: ... . : : CCDS55 TGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPEGTLTVEAPMPKLATQSNEIT 130 140 150 160 170 180 160 170 pF1KE0 IPITREEKPAVTAAPKK ::.: : . CCDS55 IPVTFESRAQLGGPEAAKSDETAAK 190 200 >>CCDS8352.1 HSPB2 gene_id:3316|Hs108|chr11 (182 aa) initn: 358 init1: 310 opt: 381 Z-score: 479.6 bits: 95.3 E(32554): 2.2e-20 Smith-Waterman score: 381; 43.1% identity (72.3% similar) in 137 aa overlap (15-149:16-148) 10 20 30 40 50 pF1KE0 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPS . : .:::: .: ::: :: ... : : .:.:: . ::. CCDS83 MSGRSVPHAHPATAEYEFANPSRLGEQRFGEGLLPEEIL-TPTLYHGYYVRPRA---APA 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 WFDT--GLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISRE . : ::.:: . .:.. :::.::.:.:. :... ...:: ..: .: :.:::.::: CCDS83 GEGSRAGASELRLSEGKFQAFLDVSHFTPDEVTVRTVDNLLEVSARHPQRLDRHGFVSRE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 FHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIPITREEKPAVTAAPKK : : : .:::::: . ..:: ::.:....:: CCDS83 FCRTYVLPADVDPWRVRAALSHDGILNLEAPRGGRHLDTEVNEVYISLLPAPPDPEEEEE 120 130 140 150 160 170 CCDS83 AAIVEP 180 >>CCDS9189.1 HSPB8 gene_id:26353|Hs108|chr12 (196 aa) initn: 273 init1: 224 opt: 245 Z-score: 310.4 bits: 64.1 E(32554): 5.7e-11 Smith-Waterman score: 284; 35.2% identity (61.2% similar) in 165 aa overlap (1-149:6-170) 10 20 30 40 pF1KE0 MDIAIHHP-WIRR-PFFPFHSPSRLFDQFFGEHLLESDL--------FPTSTSLS : .. :.: .:: :: :::.:. :: . .:: .: .: CCDS91 MADGQMPFSCHYPSRLRRDPFRDSPLSSRLLDDGFGMDPFPDDLTASWPDWALPRLSSAW 10 20 30 40 50 60 50 60 70 80 90 pF1KE0 PFYLRPPSFLRAPSW---FDT---GLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIE : :: :.:. : . : . . . ..: ..:. :.:::: ::. .: CCDS91 PGTLRSGMVPRGPTATARFGVPAEGRTPPPFPGEPWKVCVNVHSFKPEELMVKTKDGYVE 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE0 VHGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTI : :::::.:.: :..:..: .: ..::.:::.:. .::: .:.: ...:. CCDS91 VSGKHEEKQQEGGIVSKNFTKKIQLPAEVDPVTVFASLSPEGLLIIEAPQVPPYSTFGES 130 140 150 160 170 180 160 170 pF1KE0 PITREEKPAVTAAPKK CCDS91 SFNNELPQDSQEVTCT 190 175 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 08:20:10 2016 done: Thu Nov 3 08:20:10 2016 Total Scan time: 1.970 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]