FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9822, 312 aa 1>>>pF1KB9822 312 - 312 aa - 312 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2390+/-0.000734; mu= 13.4843+/- 0.044 mean_var=122.0526+/-24.920, 0's: 0 Z-trim(113.2): 13 B-trim: 0 in 0/51 Lambda= 0.116092 statistics sampled from 13894 (13904) to 13894 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.427), width: 16 Scan time: 2.660 The best scores are: opt bits E(32554) CCDS5499.1 PURB gene_id:5814|Hs108|chr7 ( 312) 2117 364.9 4.4e-101 CCDS4220.1 PURA gene_id:5813|Hs108|chr5 ( 322) 753 136.5 2.7e-32 CCDS6081.1 PURG gene_id:29942|Hs108|chr8 ( 347) 469 89.0 5.8e-18 CCDS34878.1 PURG gene_id:29942|Hs108|chr8 ( 322) 349 68.8 6.2e-12 >>CCDS5499.1 PURB gene_id:5814|Hs108|chr7 (312 aa) initn: 2117 init1: 2117 opt: 2117 Z-score: 1928.2 bits: 364.9 E(32554): 4.4e-101 Smith-Waterman score: 2117; 100.0% identity (100.0% similar) in 312 aa overlap (1-312:1-312) 10 20 30 40 50 60 pF1KB9 MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQNAKGRFLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQNAKGRFLK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 IAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLAAGAEEGGGPRRALK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 IAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLAAGAEEGGGPRRALK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 SEFLVRENRKYYLDLKENQRGRFLRIRQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SEFLVRENRKYYLDLKENQRGRFLRIRQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 EFRDALAKLIDDYGGEDDELAGGPGGGAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EFRDALAKLIDDYGGEDDELAGGPGGGAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 YGVFLRVSEVKPSYRNAITVPFKAWGKFGGAFCRYADEMKEIQERQRDKLYERRGGGSGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YGVFLRVSEVKPSYRNAITVPFKAWGKFGGAFCRYADEMKEIQERQRDKLYERRGGGSGG 250 260 270 280 290 300 310 pF1KB9 GEESEGEEVDED :::::::::::: CCDS54 GEESEGEEVDED 310 >>CCDS4220.1 PURA gene_id:5813|Hs108|chr5 (322 aa) initn: 1048 init1: 498 opt: 753 Z-score: 693.4 bits: 136.5 E(32554): 2.7e-32 Smith-Waterman score: 1270; 71.7% identity (83.6% similar) in 286 aa overlap (4-289:31-288) 10 20 30 pF1KB9 MADGDSGSERGGGGGPCGFQPASRGGGEQETQE : .:. ::::: : .. :: ..:::: CCDS42 MADRDSGSEQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSGGGGGGAPGGLQHETQE 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB9 LASKRLDIQNKRFYLDVKQNAKGRFLKIAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEH :::::.:::::::::::::::::::::::::::::.::::::::.::.:::: ::::::: CCDS42 LASKRVDIQNKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTLSMSVAVEFRDYLGDFIEH 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB9 YAQLGPSSPEQLAAGAEEGGGPRRALKSEFLVRENRKYYLDLKENQRGRFLRIRQTVNRG :::::::.: .:: . .: ::::::::::::::::::.:::::::::::::::::::: CCDS42 YAQLGPSQPPDLAQAQDE---PRRALKSEFLVRENRKYYMDLKENQRGRFLRIRQTVNRG 130 140 150 160 170 160 170 180 190 200 210 pF1KB9 GGGFGAGPGPGGLQSGQTIALPAQGLIEFRDALAKLIDDYGGEDDELAGGPGGGAGGPGG :: :. : :::::::::::::::::::::::::: :.. :. CCDS42 -------PGLGSTQ-GQTIALPAQGLIEFRDALAKLIDDYGVEEE-----PA-------- 180 190 200 210 220 230 240 250 260 270 pF1KB9 GLYGELPEGTSITVDSKRFFFDVGCNKYGVFLRVSEVKPSYRNAITVPFKAWGKFGGAFC :::::::.:::.:::::::: ::::::.:::::::.:::.::::.:.:.::: .:: CCDS42 ----ELPEGTSLTVDNKRFFFDVGSNKYGVFMRVSEVKPTYRNSITVPYKVWAKFGHTFC 220 230 240 250 260 270 280 290 300 310 pF1KB9 RYADEMKEIQERQRDKLYERRGGGSGGGEESEGEEVDED .:..:::.:::.::.: CCDS42 KYSEEMKKIQEKQREKRAACEQLHQQQQQQQEETAAATLLLQGEEEGEED 280 290 300 310 320 >>CCDS6081.1 PURG gene_id:29942|Hs108|chr8 (347 aa) initn: 1024 init1: 338 opt: 469 Z-score: 435.9 bits: 89.0 E(32554): 5.8e-18 Smith-Waterman score: 944; 53.7% identity (71.8% similar) in 309 aa overlap (23-305:49-344) 10 20 30 40 50 pF1KB9 MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQ ...:: : :::::::.:::.::::::::: CCDS60 NVGGSGLSKSRLYPQAQHSHYPHYAASATPNQAGGAAEIQELASKRVDIQKKRFYLDVKQ 20 30 40 50 60 70 60 70 80 90 100 pF1KB9 NAKGRFLKIAEVGAGGS------KSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLA ...::::::::: : . ::.::::..::::..: :::::::::.:: .. .: CCDS60 SSRGRFLKIAEVWIGRGRQDNIRKSKLTLSLSVAAELKDCLGDFIEHYAHLGLKGHRQEH 80 90 100 110 120 130 110 120 130 140 pF1KB9 AGAEEGGGPRR--------------------ALKSEFLVRENRKYYLDLKENQRGRFLRI . ..: :. :: .::.... :.::::::::::::::::::: CCDS60 GHSKEQGSRRRQKHSAPSPPVSVGSEEHPHSVLKTDYIERDNRKYYLDLKENQRGRFLRI 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB9 RQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLIEFRDALAKLIDDYGGEDDELAGGPGG :::. :: : .: : . :::.:::::.:::::::..::.::: : : : CCDS60 RQTMMRGTGMIGYFGHSLGQE--QTIVLPAQGMIEFRDALVQLIEDYGEGDIEERRG--- 200 210 220 230 240 250 210 220 230 240 250 260 pF1KB9 GAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNKYGVFLRVSEVKPSYRNAITVPFKAWG : : :::::::. ::.:::.:::: ::::.::.::::.: :::.:::::::: CCDS60 GDDDP-----LELPEGTSFRVDNKRFYFDVGSNKYGIFLKVSEVRPPYRNTITVPFKAWT 260 270 280 290 300 270 280 290 300 310 pF1KB9 KFGGAFCRYADEMKEIQERQRDKLYERRGGGSGGGEESEGEEVDED .:: : .: .::..: . ...: : : ...:::.: CCDS60 RFGENFIKYEEEMRKICNSHKEK---RMDGRKASGEEQECLD 310 320 330 340 >>CCDS34878.1 PURG gene_id:29942|Hs108|chr8 (322 aa) initn: 855 init1: 205 opt: 349 Z-score: 327.7 bits: 68.8 E(32554): 6.2e-12 Smith-Waterman score: 761; 53.1% identity (70.4% similar) in 260 aa overlap (23-256:49-298) 10 20 30 40 50 pF1KB9 MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQ ...:: : :::::::.:::.::::::::: CCDS34 NVGGSGLSKSRLYPQAQHSHYPHYAASATPNQAGGAAEIQELASKRVDIQKKRFYLDVKQ 20 30 40 50 60 70 60 70 80 90 100 pF1KB9 NAKGRFLKIAEVGAGGS------KSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLA ...::::::::: : . ::.::::..::::..: :::::::::.:: .. .: CCDS34 SSRGRFLKIAEVWIGRGRQDNIRKSKLTLSLSVAAELKDCLGDFIEHYAHLGLKGHRQEH 80 90 100 110 120 130 110 120 130 140 pF1KB9 AGAEEGGGPRR--------------------ALKSEFLVRENRKYYLDLKENQRGRFLRI . ..: :. :: .::.... :.::::::::::::::::::: CCDS34 GHSKEQGSRRRQKHSAPSPPVSVGSEEHPHSVLKTDYIERDNRKYYLDLKENQRGRFLRI 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB9 RQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLIEFRDALAKLIDDYGGEDDELAGGPGG :::. :: : .: : . :::.:::::.:::::::..::.::: : : : CCDS34 RQTMMRGTGMIGYFGHSLGQE--QTIVLPAQGMIEFRDALVQLIEDYGEGDIEER---RG 200 210 220 230 240 250 210 220 230 240 250 260 pF1KB9 GAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNKYGVFLRVSEVKPSYRNAITVPFKAWG : : :::::::. ::.:::.:::: ::::.::.... : .: CCDS34 GDDDPL-----ELPEGTSFRVDNKRFYFDVGSNKYGIFLKLTNYPKSRENINLFHCCQIK 260 270 280 290 300 270 280 290 300 310 pF1KB9 KFGGAFCRYADEMKEIQERQRDKLYERRGGGSGGGEESEGEEVDED CCDS34 HKEQPHDTTKTVEE 310 320 312 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 19:28:04 2016 done: Fri Nov 4 19:28:04 2016 Total Scan time: 2.660 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]