FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7227, 220 aa 1>>>pF1KB7227 220 - 220 aa - 220 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3817+/-0.000831; mu= 13.2927+/- 0.050 mean_var=62.5773+/-12.503, 0's: 0 Z-trim(105.9): 26 B-trim: 0 in 0/54 Lambda= 0.162131 statistics sampled from 8648 (8668) to 8648 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.654), E-opt: 0.2 (0.266), width: 16 Scan time: 2.010 The best scores are: opt bits E(32554) CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 ( 220) 1477 353.9 4.7e-98 CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 ( 213) 483 121.4 4.4e-28 CCDS4104.1 STARD4 gene_id:134429|Hs108|chr5 ( 205) 379 97.0 9e-21 CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 ( 427) 281 74.3 1.4e-13 CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 281 74.3 1.4e-13 CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 281 74.3 1.4e-13 CCDS6102.1 STAR gene_id:6770|Hs108|chr8 ( 285) 275 72.8 2.5e-13 >>CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 (220 aa) initn: 1477 init1: 1477 opt: 1477 Z-score: 1873.9 bits: 353.9 E(32554): 4.7e-98 Smith-Waterman score: 1477; 100.0% identity (100.0% similar) in 220 aa overlap (1-220:1-220) 10 20 30 40 50 60 pF1KB7 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGIIPESPAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGIIPESPAK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 LSDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDFIDLVYIKRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LSDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDFIDLVYIKRY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 EGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFVQTEMRGKLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFVQTEMRGKLS 130 140 150 160 170 180 190 200 210 220 pF1KB7 PSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS :::::::::::::::::::::::::::::::::::::::: CCDS11 PSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS 190 200 210 220 >>CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 (213 aa) initn: 475 init1: 336 opt: 483 Z-score: 617.6 bits: 121.4 E(32554): 4.4e-28 Smith-Waterman score: 483; 34.4% identity (65.1% similar) in 209 aa overlap (1-204:1-209) 10 20 30 40 50 pF1KB7 MDFKAIAQQT---AQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGIIPES :: ::.. :...: : :::.:::. . .. ..:: . : .: ::::: :::. . CCDS10 MDPALAAQMSEAVAEKMLQYRRDTAGWKICREGNGVSVSWRPSVEFPGNLYRGEGIVYGT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 PAKLSDFLYQT--GDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDFIDLV .. : . . : :. ::... ..... : . . .: : : :. ::::::.::: CCDS10 LEEVWDCVKPAVGGLRVKWDENVTGFEIIQSITDTLCVSRTSTPSAAMKLISPRDFVDLV 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 YIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFVQTEM .:::: . .. :. : ::. ...::.::::: : :. .:. ..:: : .:.. CCDS10 LVKRYEDGTISSNATHVEHPLCPPKPGFVRGFNHPCGCFCEPLPGEPTKTNLVTFFHTDL 130 140 150 160 170 180 180 190 200 210 220 pF1KB7 RGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS : : ..... .: ... : : . ..: CCDS10 SGYLPQNVVDSFFPRSMTRFYANLQKAVKQFHE 190 200 210 >>CCDS4104.1 STARD4 gene_id:134429|Hs108|chr5 (205 aa) initn: 324 init1: 203 opt: 379 Z-score: 486.4 bits: 97.0 E(32554): 9e-21 Smith-Waterman score: 379; 29.7% identity (63.6% similar) in 195 aa overlap (2-195:6-197) 10 20 30 40 50 pF1KB7 MDFKAIAQQTAQEVLGYNR-DTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGIIP : ..: . . .. :. . . :.:.: .: .:: : :..:.: ::...:.: CCDS41 MEGLSDVASFATKLKNTLIQYHSIEEDKWRVAKKTKDVTVWRKPSEEFNGYLYKAQGVID 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ESPAKLSDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDFIDLV . .. : . :. ::. . .... .. . . . : . . ::::.:.:. CCDS41 DLVYSIIDHIRPGPCRLDWDSLMTSLDILENFEENCCVMRYTTAGQLWNIISPREFVDFS 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 YIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFVQTEM : :. .. . . :.:. : : ...::::::::. : :...:: : :. ..::.. CCDS41 YTVGYKEGL-LSCGISLDWDEKRP--EFVRGYNHPCGWFCVPLKDNPNQSLLTGYIQTDL 130 140 150 160 170 180 190 200 210 220 pF1KB7 RGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS :: . : .. .: :.:.:: CCDS41 RGMIPQSAVDTAMASTLTNFYGDLRKAL 180 190 200 >>CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 (427 aa) initn: 183 init1: 86 opt: 281 Z-score: 357.5 bits: 74.3 E(32554): 1.4e-13 Smith-Waterman score: 281; 24.2% identity (62.1% similar) in 182 aa overlap (24-203:241-420) 10 20 30 40 50 pF1KB7 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGI :: :... . :::. . .. . CCDS54 KSFSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTF 220 230 240 250 260 270 60 70 80 90 100 110 pF1KB7 IPESPAKL--SDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDF .: ::.: .. . : . :.:.. . ....:....:.: . .. . : : .::::: CCDS54 LP-CPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDF 280 290 300 310 320 120 130 140 150 160 170 pF1KB7 IDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFV ... :.: . . . :. ... ::. .:.:: : : ::. :: .: .. CCDS54 VNVRRIERRRDRY-LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWIL 330 340 350 360 370 380 180 190 200 210 220 pF1KB7 QTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS .:...:.: .:.... ... .: .. .. : CCDS54 NTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA 390 400 410 420 >>CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa) initn: 183 init1: 86 opt: 281 Z-score: 357.2 bits: 74.3 E(32554): 1.4e-13 Smith-Waterman score: 281; 24.2% identity (62.1% similar) in 182 aa overlap (24-203:259-438) 10 20 30 40 50 pF1KB7 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGI :: :... . :::. . .. . CCDS11 KSFSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTF 230 240 250 260 270 280 60 70 80 90 100 110 pF1KB7 IPESPAKL--SDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDF .: ::.: .. . : . :.:.. . ....:....:.: . .. . : : .::::: CCDS11 LP-CPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDF 290 300 310 320 330 340 120 130 140 150 160 170 pF1KB7 IDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFV ... :.: . . . :. ... ::. .:.:: : : ::. :: .: .. CCDS11 VNVRRIERRRDRY-LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWIL 350 360 370 380 390 400 180 190 200 210 220 pF1KB7 QTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS .:...:.: .:.... ... .: .. .. : CCDS11 NTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA 410 420 430 440 >>CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa) initn: 183 init1: 86 opt: 281 Z-score: 357.2 bits: 74.3 E(32554): 1.4e-13 Smith-Waterman score: 281; 24.2% identity (62.1% similar) in 182 aa overlap (24-203:259-438) 10 20 30 40 50 pF1KB7 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGI :: :... . :::. . .. . CCDS54 KSFSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTF 230 240 250 260 270 280 60 70 80 90 100 110 pF1KB7 IPESPAKL--SDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDF .: ::.: .. . : . :.:.. . ....:....:.: . .. . : : .::::: CCDS54 LP-CPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDF 290 300 310 320 330 340 120 130 140 150 160 170 pF1KB7 IDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFV ... :.: . . . :. ... ::. .:.:: : : ::. :: .: .. CCDS54 VNVRRIERRRDRY-LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWIL 350 360 370 380 390 400 180 190 200 210 220 pF1KB7 QTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS .:...:.: .:.... ... .: .. .. : CCDS54 NTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA 410 420 430 440 >>CCDS6102.1 STAR gene_id:6770|Hs108|chr8 (285 aa) initn: 241 init1: 119 opt: 275 Z-score: 352.7 bits: 72.8 E(32554): 2.5e-13 Smith-Waterman score: 275; 23.8% identity (63.9% similar) in 202 aa overlap (8-206:80-278) 10 20 30 pF1KB7 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSS ... :..:: . ::: : :.. . .. CCDS61 NQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK--KESQQDNGDK 50 60 70 80 90 100 40 50 60 70 80 90 pF1KB7 KASRKFH--GNLYRVEGIIPESPAKLSDFLYQTGDRI-TWDKSLQVYNMVHRIDSDTFIC :. :...:.: .. . .: . : . . . :. ... .....: .:::: CCDS61 VMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFIT 110 120 130 140 150 160 100 110 120 130 140 150 pF1KB7 HTITQSFAVGSISPRDFIDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFV : .. : . ..::::... :: .:. .... ..:: ..: ... ::. . : .: CCDS61 HELAAEAAGNLVGPRDFVSVRCAKR-RGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMV 170 180 190 200 210 220 160 170 180 190 200 210 pF1KB7 CSPMEENPAYSKLVMFVQTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGF :. .:. .::. ... ...: : :::.... .. :.: . . ...: CCDS61 LHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC 230 240 250 260 270 280 220 pF1KB7 HHNSHS 220 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 09:44:01 2016 done: Sat Nov 5 09:44:01 2016 Total Scan time: 2.010 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]