FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3044, 624 aa 1>>>pF1KB3044 624 - 624 aa - 624 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4288+/-0.00099; mu= 13.7027+/- 0.059 mean_var=87.9432+/-17.327, 0's: 0 Z-trim(105.7): 63 B-trim: 136 in 1/51 Lambda= 0.136764 statistics sampled from 8490 (8549) to 8490 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.628), E-opt: 0.2 (0.263), width: 16 Scan time: 3.120 The best scores are: opt bits E(32554) CCDS4028.1 COL4A3BP gene_id:10087|Hs108|chr5 ( 624) 4214 841.9 0 CCDS47235.1 COL4A3BP gene_id:10087|Hs108|chr5 ( 752) 4214 842.0 0 CCDS4029.1 COL4A3BP gene_id:10087|Hs108|chr5 ( 598) 2509 505.5 8.2e-143 >>CCDS4028.1 COL4A3BP gene_id:10087|Hs108|chr5 (624 aa) initn: 4214 init1: 4214 opt: 4214 Z-score: 4495.3 bits: 841.9 E(32554): 0 Smith-Waterman score: 4214; 100.0% identity (100.0% similar) in 624 aa overlap (1-624:1-624) 10 20 30 40 50 60 pF1KB3 MSDNQSWNSSGSEEDPETESGPPVERCGVLSKWTNYIHGWQDRWVVLKNNALSYYKSEDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MSDNQSWNSSGSEEDPETESGPPVERCGVLSKWTNYIHGWQDRWVVLKNNALSYYKSEDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 TEYGCRGSICLSKAVITPHDFDECRFDISVNDSVWYLRAQDPDHRQQWIDAIEQHKTESG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 TEYGCRGSICLSKAVITPHDFDECRFDISVNDSVWYLRAQDPDHRQQWIDAIEQHKTESG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 YGSESSLRRHGSMVSLVSGASGYSATSTSSFKKGHSLREKLAEMETFRDILCRQVDTLQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 YGSESSLRRHGSMVSLVSGASGYSATSTSSFKKGHSLREKLAEMETFRDILCRQVDTLQK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 YFDACADAVSKDELQRDKVVEDDEDDFPTTRSDGDFLHSTNGNKEKLFPHVTPKGINGID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 YFDACADAVSKDELQRDKVVEDDEDDFPTTRSDGDFLHSTNGNKEKLFPHVTPKGINGID 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 FKGEAITFKATTAGILATLSHCIELMVKREDSWQKRLDKETEKKRRTEEAYKNAMTELKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 FKGEAITFKATTAGILATLSHCIELMVKREDSWQKRLDKETEKKRRTEEAYKNAMTELKK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 KSHFGGPDYEEGPNSLINEEEFFDAVEAALDRQDKIEEQSQSEKVRLHWPTSLPSGDAFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 KSHFGGPDYEEGPNSLINEEEFFDAVEAALDRQDKIEEQSQSEKVRLHWPTSLPSGDAFS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 SVGTHRFVQKPYSRSSSMSSIDLVSASDDVHRFSSQVEEMVQNHMTYSLQDVGGDANWQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 SVGTHRFVQKPYSRSSSMSSIDLVSASDDVHRFSSQVEEMVQNHMTYSLQDVGGDANWQL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 VVEEGEMKVYRREVEENGIVLDPLKATHAVKGVTGHEVCNYFWNVDVRNDWETTIENFHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 VVEEGEMKVYRREVEENGIVLDPLKATHAVKGVTGHEVCNYFWNVDVRNDWETTIENFHV 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 VETLADNAIIIYQTHKRVWPASQRDVLYLSVIRKIPALTENDPETWIVCNFSVDHDSAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 VETLADNAIIIYQTHKRVWPASQRDVLYLSVIRKIPALTENDPETWIVCNFSVDHDSAPL 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB3 NNRCVRAKINVAMICQTLVSPPEGNQEISRDNILCKITYVANVNPGGWAPASVLRAVAKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 NNRCVRAKINVAMICQTLVSPPEGNQEISRDNILCKITYVANVNPGGWAPASVLRAVAKR 550 560 570 580 590 600 610 620 pF1KB3 EYPKFLKRFTSYVQEKTAGKPILF :::::::::::::::::::::::: CCDS40 EYPKFLKRFTSYVQEKTAGKPILF 610 620 >>CCDS47235.1 COL4A3BP gene_id:10087|Hs108|chr5 (752 aa) initn: 4214 init1: 4214 opt: 4214 Z-score: 4494.0 bits: 842.0 E(32554): 0 Smith-Waterman score: 4214; 100.0% identity (100.0% similar) in 624 aa overlap (1-624:129-752) 10 20 30 pF1KB3 MSDNQSWNSSGSEEDPETESGPPVERCGVL :::::::::::::::::::::::::::::: CCDS47 SPDPSPRGLGASSGAAEGAGAGLLLGCRASMSDNQSWNSSGSEEDPETESGPPVERCGVL 100 110 120 130 140 150 40 50 60 70 80 90 pF1KB3 SKWTNYIHGWQDRWVVLKNNALSYYKSEDETEYGCRGSICLSKAVITPHDFDECRFDISV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SKWTNYIHGWQDRWVVLKNNALSYYKSEDETEYGCRGSICLSKAVITPHDFDECRFDISV 160 170 180 190 200 210 100 110 120 130 140 150 pF1KB3 NDSVWYLRAQDPDHRQQWIDAIEQHKTESGYGSESSLRRHGSMVSLVSGASGYSATSTSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NDSVWYLRAQDPDHRQQWIDAIEQHKTESGYGSESSLRRHGSMVSLVSGASGYSATSTSS 220 230 240 250 260 270 160 170 180 190 200 210 pF1KB3 FKKGHSLREKLAEMETFRDILCRQVDTLQKYFDACADAVSKDELQRDKVVEDDEDDFPTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 FKKGHSLREKLAEMETFRDILCRQVDTLQKYFDACADAVSKDELQRDKVVEDDEDDFPTT 280 290 300 310 320 330 220 230 240 250 260 270 pF1KB3 RSDGDFLHSTNGNKEKLFPHVTPKGINGIDFKGEAITFKATTAGILATLSHCIELMVKRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 RSDGDFLHSTNGNKEKLFPHVTPKGINGIDFKGEAITFKATTAGILATLSHCIELMVKRE 340 350 360 370 380 390 280 290 300 310 320 330 pF1KB3 DSWQKRLDKETEKKRRTEEAYKNAMTELKKKSHFGGPDYEEGPNSLINEEEFFDAVEAAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DSWQKRLDKETEKKRRTEEAYKNAMTELKKKSHFGGPDYEEGPNSLINEEEFFDAVEAAL 400 410 420 430 440 450 340 350 360 370 380 390 pF1KB3 DRQDKIEEQSQSEKVRLHWPTSLPSGDAFSSVGTHRFVQKPYSRSSSMSSIDLVSASDDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DRQDKIEEQSQSEKVRLHWPTSLPSGDAFSSVGTHRFVQKPYSRSSSMSSIDLVSASDDV 460 470 480 490 500 510 400 410 420 430 440 450 pF1KB3 HRFSSQVEEMVQNHMTYSLQDVGGDANWQLVVEEGEMKVYRREVEENGIVLDPLKATHAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 HRFSSQVEEMVQNHMTYSLQDVGGDANWQLVVEEGEMKVYRREVEENGIVLDPLKATHAV 520 530 540 550 560 570 460 470 480 490 500 510 pF1KB3 KGVTGHEVCNYFWNVDVRNDWETTIENFHVVETLADNAIIIYQTHKRVWPASQRDVLYLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KGVTGHEVCNYFWNVDVRNDWETTIENFHVVETLADNAIIIYQTHKRVWPASQRDVLYLS 580 590 600 610 620 630 520 530 540 550 560 570 pF1KB3 VIRKIPALTENDPETWIVCNFSVDHDSAPLNNRCVRAKINVAMICQTLVSPPEGNQEISR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VIRKIPALTENDPETWIVCNFSVDHDSAPLNNRCVRAKINVAMICQTLVSPPEGNQEISR 640 650 660 670 680 690 580 590 600 610 620 pF1KB3 DNILCKITYVANVNPGGWAPASVLRAVAKREYPKFLKRFTSYVQEKTAGKPILF :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DNILCKITYVANVNPGGWAPASVLRAVAKREYPKFLKRFTSYVQEKTAGKPILF 700 710 720 730 740 750 >>CCDS4029.1 COL4A3BP gene_id:10087|Hs108|chr5 (598 aa) initn: 2509 init1: 2509 opt: 2509 Z-score: 2677.5 bits: 505.5 E(32554): 8.2e-143 Smith-Waterman score: 3989; 95.8% identity (95.8% similar) in 624 aa overlap (1-624:1-598) 10 20 30 40 50 60 pF1KB3 MSDNQSWNSSGSEEDPETESGPPVERCGVLSKWTNYIHGWQDRWVVLKNNALSYYKSEDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MSDNQSWNSSGSEEDPETESGPPVERCGVLSKWTNYIHGWQDRWVVLKNNALSYYKSEDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 TEYGCRGSICLSKAVITPHDFDECRFDISVNDSVWYLRAQDPDHRQQWIDAIEQHKTESG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 TEYGCRGSICLSKAVITPHDFDECRFDISVNDSVWYLRAQDPDHRQQWIDAIEQHKTESG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 YGSESSLRRHGSMVSLVSGASGYSATSTSSFKKGHSLREKLAEMETFRDILCRQVDTLQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 YGSESSLRRHGSMVSLVSGASGYSATSTSSFKKGHSLREKLAEMETFRDILCRQVDTLQK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 YFDACADAVSKDELQRDKVVEDDEDDFPTTRSDGDFLHSTNGNKEKLFPHVTPKGINGID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 YFDACADAVSKDELQRDKVVEDDEDDFPTTRSDGDFLHSTNGNKEKLFPHVTPKGINGID 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 FKGEAITFKATTAGILATLSHCIELMVKREDSWQKRLDKETEKKRRTEEAYKNAMTELKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 FKGEAITFKATTAGILATLSHCIELMVKREDSWQKRLDKETEKKRRTEEAYKNAMTELKK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 KSHFGGPDYEEGPNSLINEEEFFDAVEAALDRQDKIEEQSQSEKVRLHWPTSLPSGDAFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 KSHFGGPDYEEGPNSLINEEEFFDAVEAALDRQDKIEEQSQSEKVRLHWPTSLPSGDAFS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 SVGTHRFVQKPYSRSSSMSSIDLVSASDDVHRFSSQVEEMVQNHMTYSLQDVGGDANWQL :::::::::: :::::::::::::::::::::::: CCDS40 SVGTHRFVQK--------------------------VEEMVQNHMTYSLQDVGGDANWQL 370 380 390 430 440 450 460 470 480 pF1KB3 VVEEGEMKVYRREVEENGIVLDPLKATHAVKGVTGHEVCNYFWNVDVRNDWETTIENFHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 VVEEGEMKVYRREVEENGIVLDPLKATHAVKGVTGHEVCNYFWNVDVRNDWETTIENFHV 400 410 420 430 440 450 490 500 510 520 530 540 pF1KB3 VETLADNAIIIYQTHKRVWPASQRDVLYLSVIRKIPALTENDPETWIVCNFSVDHDSAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 VETLADNAIIIYQTHKRVWPASQRDVLYLSVIRKIPALTENDPETWIVCNFSVDHDSAPL 460 470 480 490 500 510 550 560 570 580 590 600 pF1KB3 NNRCVRAKINVAMICQTLVSPPEGNQEISRDNILCKITYVANVNPGGWAPASVLRAVAKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 NNRCVRAKINVAMICQTLVSPPEGNQEISRDNILCKITYVANVNPGGWAPASVLRAVAKR 520 530 540 550 560 570 610 620 pF1KB3 EYPKFLKRFTSYVQEKTAGKPILF :::::::::::::::::::::::: CCDS40 EYPKFLKRFTSYVQEKTAGKPILF 580 590 624 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 07:03:05 2016 done: Sat Nov 5 07:03:05 2016 Total Scan time: 3.120 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]