FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA0573, 406 aa 1>>>pF1KSDA0573 406 - 406 aa - 406 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5411+/-0.000831; mu= 16.1736+/- 0.050 mean_var=68.2532+/-13.510, 0's: 0 Z-trim(106.9): 42 B-trim: 0 in 0/52 Lambda= 0.155243 statistics sampled from 9191 (9231) to 9191 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.284), width: 16 Scan time: 2.290 The best scores are: opt bits E(32554) CCDS35082.1 ERP44 gene_id:23071|Hs108|chr9 ( 406) 2753 625.5 2.7e-179 CCDS5893.1 PDIA4 gene_id:9601|Hs108|chr7 ( 645) 373 92.5 1.2e-18 CCDS11787.1 P4HB gene_id:5034|Hs108|chr17 ( 508) 329 82.6 8.9e-16 >>CCDS35082.1 ERP44 gene_id:23071|Hs108|chr9 (406 aa) initn: 2753 init1: 2753 opt: 2753 Z-score: 3332.3 bits: 625.5 E(32554): 2.7e-179 Smith-Waterman score: 2753; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406) 10 20 30 40 50 60 pF1KSD MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLDTENIDEILNNADVALVNFYADWCRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLDTENIDEILNNADVALVNFYADWCRF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD SQMLHPIFEEASDVIKEEFPNENQVVFARVDCDQHSDIAQRYRISKYPTLKLFRNGMMMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SQMLHPIFEEASDVIKEEFPNENQVVFARVDCDQHSDIAQRYRISKYPTLKLFRNGMMMK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD REYRGQRSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNYRVFER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 REYRGQRSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNYRVFER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD VANILHDDCAFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 VANILHDDCAFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD CVPLVREITFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 CVPLVREITFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHAD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD CDKFRHPLLHIQKTPADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 CDKFRHPLLHIQKTPADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREF 310 320 330 340 350 360 370 380 390 400 pF1KSD HHGPDPTDTAPGEQAQDVASSPPESSFQKLAPSEYRYTLLRDRDEL :::::::::::::::::::::::::::::::::::::::::::::: CCDS35 HHGPDPTDTAPGEQAQDVASSPPESSFQKLAPSEYRYTLLRDRDEL 370 380 390 400 >>CCDS5893.1 PDIA4 gene_id:9601|Hs108|chr7 (645 aa) initn: 262 init1: 122 opt: 373 Z-score: 448.3 bits: 92.5 E(32554): 1.2e-18 Smith-Waterman score: 373; 24.5% identity (58.4% similar) in 363 aa overlap (22-368:172-522) 10 20 30 40 50 pF1KSD MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLDTENIDEILNNADVALV :. : .: . : ::.::..:.::. :: CCDS58 ILKKGQAVDYEGSRTQEEIVAKVREVSQPDWTPPPEVTLV--LTKENFDEVVNDADIILV 150 160 170 180 190 60 70 80 90 100 110 pF1KSD NFYADWCRFSQMLHPIFEEASDVIKEEFPNENQVVFARVDCDQHSDIAQRYRISKYPTLK .::: :: . : : .:.:. .... : . .:.:: ..:.:.:. .: ::::: CCDS58 EFYAPWCGHCKKLAPEYEKAAKELSKRSP---PIPLAKVDATAETDLAKRFDVSGYPTLK 200 210 220 230 240 250 120 130 140 150 160 pF1KSD LFRNGMMMKREYRGQRSVKALADYIRQQKSDPIQEIRDLAEITTL--DRSKRNIIGYFEQ .::.: .: : : ...::. .:.. : .:: : .. . : . ::: :. CCDS58 IFRKGR--PYDYNGPREKYGIVDYMIEQSGPPSKEILTLKQVQEFLKDGDDVIIIGVFKG 260 270 280 290 300 310 170 180 190 200 210 220 pF1KSD KDSDNYRVFERVANILHDDCAFLSAFG-DVSKPERYS-GDNIIYKPPGHSA---P--DMV ... :. .. .:: :..: : .:. ...: . : :. ....: .. : :. CCDS58 ESDPAYQQYQDAANNLREDYKFHHTFSTEIAKFLKVSQGQLVVMQPEKFQSKYEPRSHMM 320 330 340 350 360 370 230 240 250 260 270 pF1KSD YLGAMTNFDVTYNWIQDKCVPLV--REITFENGEELTEEGLPFLILFHMKE---DTESLE . . :. .. ... .::: :... ..... :.. :...... . : .. CCDS58 DVQGSTQDSAIKDFVLKYALPLVGHRKVS-NDAKRYTRR--PLVVVYYSVDFSFDYRAAT 380 390 400 410 420 430 280 290 300 310 320 330 pF1KSD IFQNEVARQLISEKGTINFLHADCDKFRHPL--LHIQKTPADCPVIAIDSFRHMYVFGDF : . .. .. .: :: . . . : .... : . .: . ... CCDS58 QFWRSKVLEVAKDFPEYTFAIADEEDYAGEVKDLGLSESGEDVNAAILDESGKKFAME-- 440 450 460 470 480 340 350 360 370 380 390 pF1KSD KDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPGEQAQDVASSPPESSFQKLAPSEY . . :..:: ...:::. .. : : . CCDS58 PEEFDSDTLREFVTAFKKGKLKPVIKSQPVPKNNKGPVKVVVGKTFDSIVMDPKKDVLIE 490 500 510 520 530 540 400 pF1KSD RYTLLRDRDEL CCDS58 FYAPWCGHCKQLEPVYNSLAKKYKGQKGLVIAKMDATANDVPSDRYKVEGFPTIYFAPSG 550 560 570 580 590 600 >>CCDS11787.1 P4HB gene_id:5034|Hs108|chr17 (508 aa) initn: 190 init1: 122 opt: 329 Z-score: 396.7 bits: 82.6 E(32554): 8.9e-16 Smith-Waterman score: 340; 23.6% identity (54.9% similar) in 377 aa overlap (31-392:26-395) 10 20 30 40 50 60 pF1KSD MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLDTENIDEILNNADVALVNFYADWCRF . : :. : : ::.::: :: CCDS11 MLRRALLCLAVAALVRADAPEEEDHVLVLRKSNFAEALAAHKYLLVEFYAPWCGH 10 20 30 40 50 70 80 90 100 110 pF1KSD SQMLHPIFEEASDVIKEEFPNENQVVFARVDCDQHSDIAQRYRISKYPTLKLFRNG-MMM . : : . .:. .: : ... .:.:: ..::.::.: . :::.:.:::: CCDS11 CKALAPEYAKAAGKLKAE---GSEIRLAKVDATEESDLAQQYGVRGYPTIKFFRNGDTAS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KSD KREYRGQRSVKALADYIRQQKSDPIQEIRDLAEITTL-DRSKRNIIGYFEQKDSDNYRVF .:: . : . ........ . . : : .: . :. .::.:.. .::. . : CCDS11 PKEYTAGREADDIVNWLKKRTGPAATTLPDGAAAESLVESSEVAVIGFFKDVESDSAKQF 120 130 140 150 160 170 180 190 200 210 220 230 pF1KSD ERVANILHDDCAF-LSAFGDVSKPERYSGDNII-YKPPGHSAPDMVYLGAMTNFDVTYNW ..:. . :: : ... .:: . . . :... .: .. . . : .:. .. .. CCDS11 LQAAEAI-DDIPFGITSNSDVFSKYQLDKDGVVLFKKFDEGRNN--FEGEVTKENLL-DF 180 190 200 210 220 240 250 260 270 280 290 pF1KSD IQDKCVPLVREITFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINF :. . .::: :.: ... .. . ::. . ... . . .. : :: : : CCDS11 IKHNQLPLVIEFTEQTAPKIFGGEIKTHILLFLPKSVSDYDGKLSNFKTAAESFKGKILF 230 240 250 260 270 280 300 310 320 330 340 350 pF1KSD LHADCDKFRHP--LLHIQKTPADCPVIAIDSFRH-MYVFGDFKDVLIPGKLKQFVFDLHS . : :. . : . .::.. . .... : . .. : .. .: . CCDS11 IFIDSDHTDNQRILEFFGLKKEECPAVRLITLEEEMTKYKPESEELTAERITEFCHRFLE 290 300 310 320 330 340 360 370 380 390 400 pF1KSD GKL--HREFHHGPDPTDTAP-----GEQAQDVASSPPESSFQKL-APSEYRYTLLRDRDE ::. : .. :. : : :.. .::: . .. : .. :: CCDS11 GKIKPHLMSQELPEDWDKQPVKVLVGKNFEDVAFDEKKNVFVEFYAPWCGHCKQLAPIWD 350 360 370 380 390 400 pF1KSD L CCDS11 KLGETYKDHENIVIAKMDSTANEVEAVKVHSFPTLKFFPASADRTVIDYNGERTLDGFKK 410 420 430 440 450 460 406 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 02:07:46 2016 done: Thu Nov 3 02:07:46 2016 Total Scan time: 2.290 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]