FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0043, 544 aa 1>>>pF1KE0043 544 - 544 aa - 544 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0624+/-0.00126; mu= 14.0905+/- 0.075 mean_var=63.1393+/-12.715, 0's: 0 Z-trim(99.8): 30 B-trim: 2 in 1/47 Lambda= 0.161408 statistics sampled from 5861 (5864) to 5861 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.527), E-opt: 0.2 (0.18), width: 16 Scan time: 2.850 The best scores are: opt bits E(32554) CCDS11850.2 PIEZO2 gene_id:63895|Hs108|chr18 (2752) 1954 464.4 7.9e-130 CCDS54058.1 PIEZO1 gene_id:9780|Hs108|chr16 (2521) 1244 299.0 4.3e-80 >>CCDS11850.2 PIEZO2 gene_id:63895|Hs108|chr18 (2752 aa) initn: 1954 init1: 1954 opt: 1954 Z-score: 2444.3 bits: 464.4 E(32554): 7.9e-130 Smith-Waterman score: 3236; 89.1% identity (89.1% similar) in 576 aa overlap (32-544:2177-2752) 10 20 30 40 50 60 pF1KE0 EKLQEHLIKAKAFTIKKTLEIYVPIKQFFYNLIHPEYSAVTDVYVLMFLADTVDFIIIVF :::::::::::::::::::::::::::::: CCDS11 EKLQEHLIKAKAFTIKKTLEIYVPIKQFFYNLIHPEYSAVTDVYVLMFLADTVDFIIIVF 2150 2160 2170 2180 2190 2200 70 80 90 100 110 120 pF1KE0 GFWAFGKHSAAADITSSLSEDQVPGPFLVMVLIQFGTMVVDRALYLRKTVLGKVIFQVIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GFWAFGKHSAAADITSSLSEDQVPGPFLVMVLIQFGTMVVDRALYLRKTVLGKVIFQVIL 2210 2220 2230 2240 2250 2260 130 140 150 160 170 180 pF1KE0 VFGIHFWMFFILPGVTERKFSQNLVAQLWYFVKCVYFGLSAYQIRCGYPTRVLGNFLTKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VFGIHFWMFFILPGVTERKFSQNLVAQLWYFVKCVYFGLSAYQIRCGYPTRVLGNFLTKS 2270 2280 2290 2300 2310 2320 190 200 210 220 230 240 pF1KE0 YNYVNLFLFQGFRLVPFLTELRAVMDWVWTDTTLSLSSWICVEDIYAHIFILKCWRESEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YNYVNLFLFQGFRLVPFLTELRAVMDWVWTDTTLSLSSWICVEDIYAHIFILKCWRESEK 2330 2340 2350 2360 2370 2380 pF1KE0 ------------------------------------------------------------ CCDS11 RYPQPRGQKKKKVVKYGMGGMIIVLLICIVWFPLLFMSLIKSVAGVINQPLDVSVTITLG 2390 2400 2410 2420 2430 2440 250 260 270 280 290 pF1KE0 ---PIFTMSAQQSQLKVMDQQSFNKFIQAFSRDTGAMQFLENYEKEDITVAELEGNSNSL ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GYQPIFTMSAQQSQLKVMDQQSFNKFIQAFSRDTGAMQFLENYEKEDITVAELEGNSNSL 2450 2460 2470 2480 2490 2500 300 310 320 330 340 350 pF1KE0 WTISPPSKQKMIHELLDPNSSFSVVFSWSIQRNLSLGAKSEIATDKLSFPLKNITRKNIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 WTISPPSKQKMIHELLDPNSSFSVVFSWSIQRNLSLGAKSEIATDKLSFPLKNITRKNIA 2510 2520 2530 2540 2550 2560 360 370 380 390 400 410 pF1KE0 KMIAGNSTESSKTPVTIEKIYPYYVKAPSDSNSKPIKQLLSENNFMDITIILSRDNTTKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KMIAGNSTESSKTPVTIEKIYPYYVKAPSDSNSKPIKQLLSENNFMDITIILSRDNTTKY 2570 2580 2590 2600 2610 2620 420 430 440 450 460 470 pF1KE0 NSEWWVLNLTGNRIYNPNSQALELVVFNDKVSPPSLGFLAGYGIMGLYASVVLVIGKFVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NSEWWVLNLTGNRIYNPNSQALELVVFNDKVSPPSLGFLAGYGIMGLYASVVLVIGKFVR 2630 2640 2650 2660 2670 2680 480 490 500 510 520 530 pF1KE0 EFFSGISHSIMFEELPNVDRILKLCTDIFLVRETGELELEEDLYAKLIFLYRSPETMIKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EFFSGISHSIMFEELPNVDRILKLCTDIFLVRETGELELEEDLYAKLIFLYRSPETMIKW 2690 2700 2710 2720 2730 2740 540 pF1KE0 TREKTN :::::: CCDS11 TREKTN 2750 >>CCDS54058.1 PIEZO1 gene_id:9780|Hs108|chr16 (2521 aa) initn: 1849 init1: 1239 opt: 1244 Z-score: 1551.4 bits: 299.0 E(32554): 4.3e-80 Smith-Waterman score: 1924; 53.7% identity (74.7% similar) in 574 aa overlap (10-514:1926-2492) 10 20 30 pF1KE0 MEKLQEHLIKAKAFTIKKTLEIYVPIKQFFYNLIHPEYS . ..: .. . : :...::....: .: CCDS54 EEEGEEEKEAPTGREKRPSRSGGRVRAAGRRLQGFCLSLAQGTYRPLRRFFHDILHTKYR 1900 1910 1920 1930 1940 1950 40 50 60 70 80 90 pF1KE0 AVTDVYVLMFLADTVDFIIIVFGFWAFGKHSAAADITSSLSEDQVPGPFLVMVLIQFGTM :.::::.::::::.::::::.::::::::::::.:::::::.:::: ::::.::::.:: CCDS54 AATDVYALMFLADVVDFIIIIFGFWAFGKHSAATDITSSLSDDQVPEAFLVMLLIQFSTM 1960 1970 1980 1990 2000 2010 100 110 120 130 140 150 pF1KE0 VVDRALYLRKTVLGKVIFQVILVFGIHFWMFFILPGVTERKFSQNLVAQLWYFVKCVYFG :::::::::::::::. ::: ::..::.:::::::.:::: :.::.::::::::::.::. CCDS54 VVDRALYLRKTVLGKLAFQVALVLAIHLWMFFILPAVTERMFNQNVVAQLWYFVKCIYFA 2020 2030 2040 2050 2060 2070 160 170 180 190 200 210 pF1KE0 LSAYQIRCGYPTRVLGNFLTKSYNYVNLFLFQGFRLVPFLTELRAVMDWVWTDTTLSLSS :::::::::::::.:::::::.::..::::::::::::::.::::::::::::::::::: CCDS54 LSAYQIRCGYPTRILGNFLTKKYNHLNLFLFQGFRLVPFLVELRAVMDWVWTDTTLSLSS 2080 2090 2100 2110 2120 2130 220 230 240 pF1KE0 WICVEDIYAHIFILKCWRESEK-------------------------------------- :.:::::::.:::.:: ::.:: CCDS54 WMCVEDIYANIFIIKCSRETEKKYPQPKGQKKKKIVKYGMGGLIILFLIAIIWFPLLFMS 2140 2150 2160 2170 2180 2190 250 260 270 pF1KE0 -------------------------PIFTMSAQQSQLKVMDQQSFNKFIQAFSRDTGAMQ :.::::::: .. . :..... . :. . ::: CCDS54 LVRSVVGVVNQPIDVTVTLKLGGYEPLFTMSAQQPSIIPFTAQAYEELSRQFDPQPLAMQ 2200 2210 2220 2230 2240 2250 280 290 300 310 320 330 pF1KE0 FLENYEKEDITVAELEGNSNSLWTISPPSKQKMIHELLDPNSSFSVVFSWSIQRNLSLGA :. .: :::..:..::.:..:: :::::. .: .:: . ...... :.:..::.:. :. CCDS54 FISQYSPEDIVTAQIEGSSGALWRISPPSRAQMKRELYNGTADITLRFTWNFQRDLAKGG 2260 2270 2280 2290 2300 2310 340 350 360 370 380 390 pF1KE0 KSEIATDK--LSFPLKNITRKNIAKMIAGNSTESSKTPVTIEKIYPYYVKAPSDSNSKPI : :..: :.. .. .:...:... :.: .: :.: ...: :..::. ...:. CCDS54 TVEYANEKHMLALAPNSTARRQLASLLEGTSDQS----VVIPNLFPKYIRAPNGPEANPV 2320 2330 2340 2350 2360 2370 400 410 420 430 440 450 pF1KE0 KQLLS--ENNFMDITIILSRDNTTKYNS--EWWVLNLTGNRIYNPNSQALELVVFNDKVS ::: : ... . : : :.. . .. ::::..: : . . : .:.:.:::: CCDS54 KQLQPNEEADYLGVRIQLRREQGAGATGFLEWWVIELQECR---TDCNLLPMVIFSDKVS 2380 2390 2400 2410 2420 460 470 480 490 500 510 pF1KE0 PPSLGFLAGYGIMGLYASVVLVIGKFVREFFSGISHSIMFEELPNVDRILKLCTDIFLVR ::::::::::::::::.:.::::::::: ::: ::::::::::: :::::::: :::::: CCDS54 PPSLGFLAGYGIMGLYVSIVLVIGKFVRGFFSEISHSIMFEELPCVDRILKLCQDIFLVR 2430 2440 2450 2460 2470 2480 520 530 540 pF1KE0 ETGELELEEDLYAKLIFLYRSPETMIKWTREKTN :: : CCDS54 ETRELELEEELYAKLIFLYRSPETMIKWTREKE 2490 2500 2510 2520 544 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 06:21:11 2016 done: Fri Nov 4 06:21:11 2016 Total Scan time: 2.850 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]