FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9477, 333 aa 1>>>pF1KB9477 333 - 333 aa - 333 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6197+/-0.000792; mu= 8.0192+/- 0.048 mean_var=169.5051+/-33.746, 0's: 0 Z-trim(114.8): 9 B-trim: 0 in 0/50 Lambda= 0.098511 statistics sampled from 15321 (15327) to 15321 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.471), width: 16 Scan time: 3.150 The best scores are: opt bits E(32554) CCDS12310.1 GIPC1 gene_id:10755|Hs108|chr19 ( 333) 2231 328.3 5.3e-90 CCDS12311.1 GIPC1 gene_id:10755|Hs108|chr19 ( 236) 1553 231.8 4.1e-61 CCDS32871.1 GIPC3 gene_id:126326|Hs108|chr19 ( 312) 1288 194.3 1.1e-49 CCDS685.1 GIPC2 gene_id:54810|Hs108|chr1 ( 315) 1263 190.7 1.3e-48 >>CCDS12310.1 GIPC1 gene_id:10755|Hs108|chr19 (333 aa) initn: 2231 init1: 2231 opt: 2231 Z-score: 1729.3 bits: 328.3 E(32554): 5.3e-90 Smith-Waterman score: 2231; 100.0% identity (100.0% similar) in 333 aa overlap (1-333:1-333) 10 20 30 40 50 60 pF1KB9 MPLGLGRRKKAPPLVENEEAEPGRGGLGVGEPGPLGGGGSGGPQMGLPPPPPALRPRLVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MPLGLGRRKKAPPLVENEEAEPGRGGLGVGEPGPLGGGGSGGPQMGLPPPPPALRPRLVF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 HTQLAHGSPTGRIEGFTNVKELYGKIAEAFRLPTAEVMFCTLNTHKVDMDKLLGGQIGLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 HTQLAHGSPTGRIEGFTNVKELYGKIAEAFRLPTAEVMFCTLNTHKVDMDKLLGGQIGLE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 DFIFAHVKGQRKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 DFIFAHVKGQRKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 INGQSLLGCRHYEVARLLKELPRGRTFTLKLTEPRKAFDMISQRSAGGRPGSGPQLGTGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 INGQSLLGCRHYEVARLLKELPRGRTFTLKLTEPRKAFDMISQRSAGGRPGSGPQLGTGR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 GTLRLRSRGPATVEDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GTLRLRSRGPATVEDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDEL 250 260 270 280 290 300 310 320 330 pF1KB9 AEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY ::::::::::::::::::::::::::::::::: CCDS12 AEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY 310 320 330 >>CCDS12311.1 GIPC1 gene_id:10755|Hs108|chr19 (236 aa) initn: 1553 init1: 1553 opt: 1553 Z-score: 1210.6 bits: 231.8 E(32554): 4.1e-61 Smith-Waterman score: 1553; 100.0% identity (100.0% similar) in 236 aa overlap (98-333:1-236) 70 80 90 100 110 120 pF1KB9 SPTGRIEGFTNVKELYGKIAEAFRLPTAEVMFCTLNTHKVDMDKLLGGQIGLEDFIFAHV :::::::::::::::::::::::::::::: CCDS12 MFCTLNTHKVDMDKLLGGQIGLEDFIFAHV 10 20 30 130 140 150 160 170 180 pF1KB9 KGQRKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEAINGQSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KGQRKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEAINGQSLL 40 50 60 70 80 90 190 200 210 220 230 240 pF1KB9 GCRHYEVARLLKELPRGRTFTLKLTEPRKAFDMISQRSAGGRPGSGPQLGTGRGTLRLRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GCRHYEVARLLKELPRGRTFTLKLTEPRKAFDMISQRSAGGRPGSGPQLGTGRGTLRLRS 100 110 120 130 140 150 250 260 270 280 290 300 pF1KB9 RGPATVEDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELAEALDER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RGPATVEDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELAEALDER 160 170 180 190 200 210 310 320 330 pF1KB9 LGDFAFPDEFVFDVWGAIGDAKVGRY :::::::::::::::::::::::::: CCDS12 LGDFAFPDEFVFDVWGAIGDAKVGRY 220 230 >>CCDS32871.1 GIPC3 gene_id:126326|Hs108|chr19 (312 aa) initn: 1306 init1: 1280 opt: 1288 Z-score: 1005.4 bits: 194.3 E(32554): 1.1e-49 Smith-Waterman score: 1288; 66.1% identity (84.9% similar) in 298 aa overlap (39-329:11-308) 10 20 30 40 50 60 pF1KB9 KKAPPLVENEEAEPGRGGLGVGEPGPLGGGGSGGPQMGLPPP----PPAL---RPRLVFH :. :. . ::: ::: ::::::. CCDS32 MEGAAAREARGTETPRASAPPPAPSEPPAAPRARPRLVFR 10 20 30 40 70 80 90 100 110 120 pF1KB9 TQLAHGSPTGRIEGFTNVKELYGKIAEAFRLPTAEVMFCTLNTHKVDMDKLLGGQIGLED ::::::::::.:::::::.:::.:::::: . .:..:::::.:::::.::::::::::: CCDS32 TQLAHGSPTGKIEGFTNVRELYAKIAEAFGIAPTEILFCTLNSHKVDMQKLLGGQIGLED 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB9 FIFAHVKGQRKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEAI ::::::.:. ::::: :.:::::::::::::::::::::::::.:..:. . ::: :::: CCDS32 FIFAHVRGETKEVEVTKTEDALGLTITDNGAGYAFIKRIKEGSIINRIEAVCVGDSIEAI 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB9 NGQSLLGCRHYEVARLLKELPRGRTFTLKLTEPRKAFDMISQRSAGGRPGSGPQLGTGRG : .:..::::::::..:.:::... :::.:..:..:::::.::: ... .. .:: CCDS32 NDHSIVGCRHYEVAKMLRELPKSQPFTLRLVQPKRAFDMIGQRSRSSKCPVEAKVTSGRE 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB9 TLRLRSRGPATVEDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDELA :::::: : ::::. :: :::.: .:::::::::::::: :::.:::: .: . .:.: CCDS32 TLRLRSGGAATVEEAPSEFEEEASRKVDDLLESYMGIRDPELASTMVETSKKTASAQEFA 230 240 250 260 270 280 310 320 330 pF1KB9 EALDERLGDFAFPDEFVFDVWGAIGDAKVGRY . :: ::.:::::::: .::.:::.:. CCDS32 RCLDSVLGEFAFPDEFVVEVWAAIGEAREACG 290 300 310 >>CCDS685.1 GIPC2 gene_id:54810|Hs108|chr1 (315 aa) initn: 856 init1: 764 opt: 1263 Z-score: 986.1 bits: 190.7 E(32554): 1.3e-48 Smith-Waterman score: 1275; 61.7% identity (81.5% similar) in 329 aa overlap (1-329:1-311) 10 20 30 40 50 60 pF1KB9 MPLGLGRRKKAPPLVENEEAEPGRGGLGVGEPGPLGGGGSGGPQMGLPPPPPALRPRLVF ::: : .::: ...:. .:: ::: :::. .. . :: :::: CCDS68 MPLKLRGKKKA----KSKET----AGLVEGEPTGAGGGSLSASRA------PAR--RLVF 10 20 30 40 70 80 90 100 110 120 pF1KB9 HTQLAHGSPTGRIEGFTNVKELYGKIAEAFRLPTAEVMFCTLNTHKVDMDKLLGGQIGLE :.:::::: :::.:::....:::..:: ::.. .:...::::: :.::..:::::.::: CCDS68 HAQLAHGSATGRVEGFSSIQELYAQIAGAFEISPSEILYCTLNTPKIDMERLLGGQLGLE 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB9 DFIFAHVKGQRKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHLISVGDMIEA ::::::::: .:::.:.::::.:::::::::.:::::::::.:.::: .. : ::: ::. CCDS68 DFIFAHVKGIEKEVNVYKSEDSLGLTITDNGVGYAFIKRIKDGGVIDSVKTICVGDHIES 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB9 INGQSLLGCRHYEVARLLKELPRGRTFTLKLTEPRKAFDMISQRSAGGRPGSGPQLGTGR :::....: :::.::. :::: . . ::.:: ::.:::. : :: .:. .:: ..: :: CCDS68 INGENIVGWRHYDVAKKLKELKKEELFTMKLIEPKKAFE-IELRSKAGK-SSGEKIGCGR 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB9 GTLRLRSRGPATVEDLPSAFEEKAIEKVDDLLESYMGIRDTELAATMVELGKDKRNPDEL .::::::.::::::..:: . :::::.::.:: :::::: .::.:: : :::: ::::. CCDS68 ATLRLRSKGPATVEEMPSETKAKAIEKIDDVLELYMGIRDIDLATTMFEAGKDKVNPDEF 230 240 250 260 270 280 310 320 330 pF1KB9 AEALDERLGDFAFPDEFVFDVWGAIGDAKVGRY : :::: ::::::::::::::::.::::: CCDS68 AVALDETLGDFAFPDEFVFDVWGVIGDAKRRGL 290 300 310 333 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 23:46:09 2016 done: Thu Nov 3 23:46:10 2016 Total Scan time: 3.150 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]