FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2370, 173 aa 1>>>pF1KE2370 173 - 173 aa - 173 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5686+/-0.00069; mu= 10.7540+/- 0.041 mean_var=56.5042+/-11.458, 0's: 0 Z-trim(108.8): 14 B-trim: 0 in 0/50 Lambda= 0.170622 statistics sampled from 10468 (10480) to 10468 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.723), E-opt: 0.2 (0.322), width: 16 Scan time: 1.900 The best scores are: opt bits E(32554) CCDS13695.1 CRYAA gene_id:1409|Hs108|chr21 ( 173) 1178 297.6 2.5e-81 CCDS82651.1 LOC102724652 gene_id:102724652|Hs108|c ( 173) 1178 297.6 2.5e-81 CCDS82652.1 LOC102724652 gene_id:102724652|Hs108|c ( 136) 745 191.0 2.4e-49 CCDS8351.1 CRYAB gene_id:1410|Hs108|chr11 ( 175) 657 169.4 1e-42 CCDS12475.1 HSPB6 gene_id:126393|Hs108|chr19 ( 160) 437 115.2 1.9e-26 CCDS81626.1 CRYAB gene_id:1410|Hs108|chr11 ( 108) 431 113.7 3.6e-26 CCDS5583.1 HSPB1 gene_id:3315|Hs108|chr7 ( 205) 360 96.3 1.2e-20 CCDS8352.1 HSPB2 gene_id:3316|Hs108|chr11 ( 182) 324 87.4 5.1e-18 >>CCDS13695.1 CRYAA gene_id:1409|Hs108|chr21 (173 aa) initn: 1178 init1: 1178 opt: 1178 Z-score: 1573.5 bits: 297.6 E(32554): 2.5e-81 Smith-Waterman score: 1178; 100.0% identity (100.0% similar) in 173 aa overlap (1-173:1-173) 10 20 30 40 50 60 pF1KE2 MDVTIQHPWFKRTLGPFYPSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MDVTIQHPWFKRTLGPFYPSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRL 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 PSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKPTSAPSS ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 PSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKPTSAPSS 130 140 150 160 170 >>CCDS82651.1 LOC102724652 gene_id:102724652|Hs108|chr21 (173 aa) initn: 1178 init1: 1178 opt: 1178 Z-score: 1573.5 bits: 297.6 E(32554): 2.5e-81 Smith-Waterman score: 1178; 100.0% identity (100.0% similar) in 173 aa overlap (1-173:1-173) 10 20 30 40 50 60 pF1KE2 MDVTIQHPWFKRTLGPFYPSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MDVTIQHPWFKRTLGPFYPSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRL 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 PSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKPTSAPSS ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 PSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKPTSAPSS 130 140 150 160 170 >>CCDS82652.1 LOC102724652 gene_id:102724652|Hs108|chr21 (136 aa) initn: 745 init1: 745 opt: 745 Z-score: 999.2 bits: 191.0 E(32554): 2.4e-49 Smith-Waterman score: 745; 99.1% identity (100.0% similar) in 111 aa overlap (63-173:26-136) 40 50 60 70 80 90 pF1KE2 EYDLLPFLSSTISPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDD .::::::::::::::::::::::::::::: CCDS82 MPVCPGDSHRPPKALPHLVCGRRGRQVRSDRDKFVIFLDVKHFSPEDLTVKVQDD 10 20 30 40 50 100 110 120 130 140 150 pF1KE2 FVEIHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 FVEIHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDA 60 70 80 90 100 110 160 170 pF1KE2 THAERAIPVSREEKPTSAPSS ::::::::::::::::::::: CCDS82 THAERAIPVSREEKPTSAPSS 120 130 >>CCDS8351.1 CRYAB gene_id:1410|Hs108|chr11 (175 aa) initn: 504 init1: 406 opt: 657 Z-score: 880.3 bits: 169.4 E(32554): 1e-42 Smith-Waterman score: 657; 54.5% identity (80.9% similar) in 178 aa overlap (1-171:1-173) 10 20 30 40 50 pF1KE2 MDVTIQHPWFKRTLGPFY-PSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQ--SLFR--T ::..:.:::..: . ::. ::::::::::: :.: ::.: :...::.: . :..: . CCDS83 MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFP-TSTSLSPFYLRPPSFLRAPS 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 VLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFH .:.:.::.: ..:.: . :::::::::.: ::: : .:.::::.::::.::.:::::: CCDS83 WFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 RRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKP--TSAPSS :.::.:..:: ... :::.::.:: ::. :. . ::.::..::::: :.:: CCDS83 RKYRIPADVDPLTITSSLSSDGVLTVNGPRKQV----SGPERTIPITREEKPAVTAAPKK 120 130 140 150 160 170 >>CCDS12475.1 HSPB6 gene_id:126393|Hs108|chr19 (160 aa) initn: 395 init1: 289 opt: 437 Z-score: 588.3 bits: 115.2 E(32554): 1.9e-26 Smith-Waterman score: 437; 44.4% identity (73.6% similar) in 144 aa overlap (1-140:3-143) 10 20 30 40 50 pF1KE2 MDVTIQHPWFKRTLGPF----YPSRLFDQFFGEGLFEYDLLPFLSSTISPYYRQSLFR . : .: :..:. .:. :.::::: :::::.: .: . .:..::: : CCDS12 MEIPVPVQPSWLRRASAPLPGLSAPGRLFDQRFGEGLLEAELAALCPTTLAPYY---LRA 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 TVLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREF . ...: .: .: ..:::::::::...::: . ::.:..:.:: :.::...::: CCDS12 PSVALPVAQVPTDPGHFSVLLDVKHFSPEEIAVKVVGEHVEVHARHEERPDEHGFVAREF 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 HRRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIPVSREEKPTSAPSS ::::::: .:: .:.. .:: .:.:. CCDS12 HRRYRLPPGVDPAAVTSALSPEGVLSIQAAPASAQAPPPAAAK 120 130 140 150 160 >>CCDS81626.1 CRYAB gene_id:1410|Hs108|chr11 (108 aa) initn: 402 init1: 347 opt: 431 Z-score: 583.2 bits: 113.7 E(32554): 3.6e-26 Smith-Waterman score: 431; 56.4% identity (80.9% similar) in 110 aa overlap (64-171:1-106) 40 50 60 70 80 90 pF1KE2 YDLLPFLSSTISPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDF .: ..:.: . :::::::::.: ::: : CCDS81 MRLEKDRFSVNLDVKHFSPEELKVKVLGDV 10 20 30 100 110 120 130 140 150 pF1KE2 VEIHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDAT .:.::::.::::.::.:::::::.::.:..:: ... :::.::.:: ::. :. . CCDS81 IEVHGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQV----S 40 50 60 70 80 160 170 pF1KE2 HAERAIPVSREEKP--TSAPSS ::.::..::::: :.:: CCDS81 GPERTIPITREEKPAVTAAPKK 90 100 >>CCDS5583.1 HSPB1 gene_id:3315|Hs108|chr7 (205 aa) initn: 382 init1: 327 opt: 360 Z-score: 484.0 bits: 96.3 E(32554): 1.2e-20 Smith-Waterman score: 364; 41.4% identity (60.9% similar) in 174 aa overlap (16-166:18-188) 10 20 30 40 pF1KE2 MDVTIQHPWFKRTLGPF---YP-SRLFDQFFGEGLFEYDLLPFLSST----------- :: :: :::::: :: . . .:... CCDS55 MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQWLGGSSWPGYVRPLPP 10 20 30 40 50 60 50 60 70 80 90 pF1KE2 --------ISPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVE .: : ..: : :.::.::.: :.. . :::.::.:..::::..: :: CCDS55 AAIESPAVAAPAYSRALSRQ-LSSGVSEIRHTADRWRVSLDVNHFAPDELTVKTKDGVVE 70 80 90 100 110 100 110 120 130 140 150 pF1KE2 IHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHA : :::.::::.:::::: : :.: :: .:: . .: ::: .: :: .: . : . CCDS55 ITGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPEGTLTVEAPMPK--LATQSN 120 130 140 150 160 170 160 170 pF1KE2 ERAIPVSREEKPTSAPSS : .:::. : . CCDS55 EITIPVTFESRAQLGGPEAAKSDETAAK 180 190 200 >>CCDS8352.1 HSPB2 gene_id:3316|Hs108|chr11 (182 aa) initn: 336 init1: 279 opt: 324 Z-score: 437.0 bits: 87.4 E(32554): 5.1e-18 Smith-Waterman score: 324; 38.0% identity (62.0% similar) in 158 aa overlap (19-170:21-178) 10 20 30 40 50 pF1KE2 MDVTIQHPWFKRTLGPFYPSRLFDQFFGEGLFEYDLL-PFLSSTISPYYRQSLFRTVL :::: .: :::::. ..: : : : . CCDS83 MSGRSVPHAHPATAEYEFANPSRLGEQRFGEGLLPEEILTPTLYHGYYVRPRAAPAGEGS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 DSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNERQDDHGYISREFHRR .: ::.: .. :: ::::.::.:...::.. :...:. ..: .: : ::..:::: : CCDS83 RAGASELRLSEGKFQAFLDVSHFTPDEVTVRTVDNLLEVSARHPQRLDRHGFVSREFCRT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE2 YRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAI-----PVSREEKPTSAPS : ::..:: . .:: ::.:.. .:. ::. : : : . ::. .: CCDS83 YVLPADVDPWRVRAALSHDGILNLEAPRGGRHLDTEVNEVYISLLPAPPDPEEEEEAAIV 130 140 150 160 170 180 pF1KE2 S CCDS83 EP 173 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:58:05 2016 done: Sun Nov 6 12:58:05 2016 Total Scan time: 1.900 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]