Hot springs as a Source for Studying Rhomboid Protease Genes and Inhibitors in Silico Study

 

Belal Al Shomali, Muhd Danish-Daniel*

Bioinformatics, ICAMB Institute of Climate Adaptation and Marine Biotechnology,

University Malaysia Terengganu, Kuala Nerus, Malaysia.

*Corresponding Author E-mail: mdda@umt.edu.my

 

ABSTRACT:

Rhomboid proteases (Rho) are affected by amino acid composition, protein structure, oligomerization, strong contacts, salt bridges, and bonding patterns. Protein tertiary structure can change with a single amino acid substitution. Rhomboids cleave misfolded membrane substrates. They help signal growth factors, maintain mitochondrial homeostasis, regulate protein quality, and invade parasites. Studying these proteins and their inhibitors will improve the medication targeting of this rhomboid protease, which is involved in the pathophysiology of numerous disorders like type II diabetes and Parkinson's. Their importance in eukaryotes is widely known, but their involvement in bacterial physiology is not. Rho genes are studied using Hot Springs metagenomic samples. JGI and IMG provided thermophilic protease sequences. MAFFT-aligned sequences. InterProScan examined every protein domain, whereas ProtParam calculated protease amino acid frequencies. I-TASSER predicts three-dimensional protein structures; CB-Dock, and Discovery Studio simulate and dock. Hot spring isolates in rhomboid gene alignments hindered the protein's evolution at high temperatures. Isolations conserved amino acid composition and active domains. Rhomboids' fundamental structure and functional locations have stayed intact across most life forms, preserving their proteolytic action. Asn 62, Trp 57, Ile 143, Phe (61, 100), Leu 99, and Arg 284 were critical in hot spring Rho genes. The fact that Rho inhibitors are active on hot spring rhomboids suggests that the enzyme has maintained a high degree of structural and functional homogeneity despite its presence in hot environments.

 

KEYWORDS: Hot Springs, Rhomboid Proteases, Thermophiles, Metagenomics, Protolytic.

 

 


1. INTRODUCTION:

Researchers have found a wealth of new information about organisms known as "extremophiles," which are those that are able to survive in environments of extreme cold and heat alkalinity or salinity. Since their discovery, these species have been given a stimulating and challenging environment for study 1,2.

 

Exotic habitats that were more common on the primordial Earth are where these severe microbial growth conditions can be found today. When organic solvents or heavy metals are found to be present is also common in extreme settings. Increasing numbers of studies are being conducted on the evolution and taxonomy of extreme-environment organisms, such as thermophiles and Pscryophiles 3.

 

Warm water seeps from the Earth in the form of a spring. However, no universally agreed-upon definition exists for what constitutes a "hot spring." 4 described hot springs merely as warm groundwater that flows naturally to the ground. described hot springs as springs where the water temperature is substantially above that region's average annual air temperature 5.

 

As a potential source of thermophilic bacteria, several hot springs in the area were investigated. As a result of the wide range of hot springs and the fact that most microbiologists attempt to extract thermophiles from hot environments, beneficial enzyme-generating bacteria have been found. Biochemical and molecular techniques can be used for both the bacteria and their enzymes 6. Consequently, there is a growing need for enzymes capable of operating in a variety of industrial conditions and that can handle various production processes. Throughout the ages, as DNA and protein technology has advanced, many ways have been devised to meet these ever-increasing demands. Bioprospecting is one of these methods, which entails searching for new enzymes in various environments with a high amount of natural variation in microorganisms7. For discovering these enzymes, metagenomics is also an essential new technique, and this mining strategy for the biotechnology and pharmaceutical industries has been revealed as a promising one 8. Metagenomics has two words; Meta(analysis), a statistical analysis method of the outcomes of two different analyses, and Genomics, which is the genetic makeup analysis 9. Genomic DNA is extracted from environmental samples and sequenced using high-throughput methods such as shotgun sequencing and 454 pyrosequencing, which reduces the failure of essential organisms during cultivation. Analysis of gene or species richness and distribution is generally done using sequence-based screening, whereas functional-based screening is used to evaluate the functional capabilities of microbial communities 10.

 

 

The protease enzyme is one of the essential enzymes in the body. Proteases are degradative enzymes that hydrolyze the peptide bond found in the amino acid sequence of polypeptides, allowing the polypeptide to be broken down. Peptides from microorganisms are the most often utilized enzymes in a wide range of industrial applications such as detergents and animal feed and leather, textiles, trash, and therapeutic applications 11. Depending on their pH, they are classified as either acidic, neutral, or alkaline. Proteases are classified as endopeptidases or exopeptidases based on the location of their active site on the protein substrate. Additionally, they can be classed as metal, cysteine, aspartic, or serine proteases based on their catalytic properties 12, 13. The rhomboid type was taken into consideration in our research. Even though it is a large family of associated membrane proteins with a diverse range of biological functions, they all share the same catalytic core domain, composed of six membrane-spanning segments containing several highly conserved sequence motifs that are characteristic of the family. Rhomboid proteases have their active sites incorporated in the lipid bilayer where their substrate is hydrolyzed to or adjacent to the transmembrane domains (TMDs) 14,15.

 

 

The primary purpose of this study is to conduct a thorough examination of the genes linked to Rho activity in bacteria found in hot springs. The ultimate goal is to gain a deeper understanding of the genetic mechanisms that underlie this enzymatic activity. This involves the discovery of distinct genes that are accountable for the function of rhomboid proteases, followed by a comparative analysis of these genes. This study entails the examination of gene sequences in order to determine if differences in enzyme temperature ranges are a result of changes in the general structure of polypeptides or adjustments to catalytic sites. Additionally, it aims to comprehend the importance of amino acid composition in maintaining protein stability. Moreover, the present study aims to assess protein domains across various sequences in order to ascertain putative functional domains. In addition, this study aims to employ computational techniques in order to forecast the three-dimensional configuration of rhomboid proteases and evaluate their secondary structure, solvent accessibility, and B factor. The ultimate goal entails performing docking investigations to predict the protolytic and aminopeptidase functions of certain rhomboid proteases, with the intention of aiding future endeavors in protein engineering to improve psychrophilic and thermophilic properties. This work seeks to provide useful insights into the molecular processes that underlie the function, evolution, and possible therapeutic applications of rhomboid proteases through a series of comprehensive investigations.

 

2. MATERIALS AND METHODS:

2.1 Data collection for Rho from JGI and PATRIC site:

Rho gene of bacteria isolated from hot springs was obtained from the Joint Genome Institute (JGI) data base 16. We considered the gene ID, isolate name, DNA sequence length, and amino acid length. After eliminating the redundancy, non-redundant sequences were obtained using Cluster Database at High Identity with Tolerance (CD-Hit) (http://weizhong-lab.ucsd.edu/cdhit-web-server/cgi-bin/index.cgi?cmd=Server%20home). Ninety percent sequence identity cut-off was considered for clustering the sequences 17.

 

2.2 Gene sequence alignment of Rho using MAFFT and MEGA X:

Rho gene sequences of bacteria isolated from hot springs were aligned with the MAFFT website http://mafft.cbrc.jp/alignment/server/ and phylogenetic tree was constructed by the use of Mega X v 11.0 sequence analysis software and Tree Of Life (http://itol.embl.de) is a web-based tool18 19 20.

 

 

2.3 Prediction of Rho amino acid content using expasy:

The ExPASy server has a utility called ProtParam that may be accessed online. It may analyze a protein sequence obtained from Swiss-Prot or TrEMBL or a user-entered. 27 Rho gene sequences were isolated from hot springs to determine the amino acid structure and composition. FASTA sequences of the selected Rho gene were submitted to a web server for analysis, and the results were viewed in this study 21.

 

2.4 Prediction of Rho active domain using interproscan:

The InterProScan (http://www.ebi.ac.uk/InterProScan) was used to evaluate the protein domains of all sequences 27 Rho gene sequences were isolated from hot springs were submitted to analyze the conserved motifs of the encoded proteins 22.

 

2.5 Prediction of 3d structure for Rho using I-TASSER:

The Rho gene sequence of the hot spring isolated strain was submitted to homology modeling by the I-TASSER Server for 3D-structure prediction. After the structure assembly simulation, I-TASSER uses the TM-align structural alignment program to match the first I-TASSER model to all structures in the PDB library23.

 

2.6 Improvement of 3d structure Rho using modrefiner:

ModRefiner (https://zhanglab.ccmb.med.umich.edu/ModRefiner/) was utilized. In my study, the Rho gene sequences extracted from hot springs was submitted to the ModRefiner server. This process was repeated five times until a tridimensional model corroborated the consensus secondary structure and domain boundary predictions possessed, RMSD and TM-score within range 24,25.

 

2.7 Prediction binding site for Rho using dogsitescorer:

The modeled Rho gene from hot-springs isolates were submitted to DoGSiteScorer (https://bio.tools/dogsitescorer):server to prediction active site and obtained the possible binding sites 26.

 

2.8 Ligand design using chemdsketch and chem3d ultra:

2D structures of the Methionine, Leucine, Rivastigmine, N-carbobenzyloxy-L-alanine and 4- amidino phenyl methanesulfonyl fluoride compounds were generated with chemdsketch v 11.0, a software 27. These ligands were then copied into Chem3D ultra v 16.0, which was used to generate a 3D model. Molecular mechanics was then used to minimize the amount of energy required to run the model (MM2) . These structures with the lowest possible energy expenditure are considered for docking 28, 29.

 

2.9 Energy minimization of Rho using chiron server:

The Rho sequences for hot springs have been uploaded to Chiron (https://dokhlab.med.psu.edu/chiron/processManager.php) The clashes to resolve were selected from relevant work areas, and results were delivered within 24 hours 30, 31.

 

2.10 Substrates for molecular docking studies using CB-DOCK:

The 3D structures of substrates like methionine, leucine, rivastigmine, N-carbobenzyloxy-L-alanine and 4-amidino phenyl methanesulfonyl flurode were docked with the binding sites of modeled Rho from the isolates by using CB-Dock Server (http://clab.labshare.cn/cb-dock/php/blinddock.php) 32, 33.

 

2.11 Prediction of ligand-Rho interactions using discovery studio:

The interactions between methionine, leucine, rivastigmine, N-carbobenzyloxy-L-alanine and 4- amidino phenyl methanesulfonyl fluoride and Rho from the isolates as docked complex were analyzed by Discovery Studio (http://www.adrianomartinelli.it/Fondamenti/Tutorial_0.pdf) 34.

 

3. RESULT:

Hot springs isolates collections and analysis of rhomboid gene:


 

Figure 1. Shows information of Rho gene of hot springs isolates retrieved from JGI.

 

Figure 2. a) Represent conserved area of amino acid sequences of the Rho were isolated from hot-springs by MAFFT software. b) Represent the evolutionary history for hot spring isolates was inferred base on Neighbor-Joining method by using MEGA X 11. Bootstrap frequencies are given for a multiple data set of 100 trials.

 


JGI offers comprehensive high-throughput sequencing, design, and synthesis of DNA, metabolomics, and computational methods that allow system based scientific approaches to these challenges. While (IMG) introduces a way for comparative analysis of predominantly microbial genomes, the program also supports samples from the environment. The purpose is to promote genome visualization and exploration from a practical and evolutionary point of view 16. CD-HIT is a widely used technique for minimizing sequence duplication and is regarded as the most advanced method available35. From the JGI site, 93 Rho gene sequences from bacteria retrieved from hot springs samples, after removing redundancies by the CD Hit program, and according to the number of amino acids, 27 Rho gene sequences were chosen for the study (Please see figure 1). all the genomic sequences of bacteria, archaea, and eukaryotes, except some small genome’s organisms. Based on laboratory experiments done on Drosophila and AarA protein of the bacterium Providencia stuartii to study the developmental regulator rhomboid, rhomboids are believed to be intramembrane serine proteases, the signaling mechanism of which is retained in eukaryotes and prokaryotes 36. According of (Figure 2 (a, b)) shows amino acid sequences of the Rho highly conserved, the highlighted boxes also indicate some samples of conserved areas in amino acid sequences. Phylogenetic tree Using Neighbor-Joining, the evolutionary history of hot springs isolates was inferred. The ideal tree is illustrated. Next to each branch is the percentage of 500 replicate trees in which the relevant taxa clustered together in the bootstrap test. (Alongside the branches) The evolutionary distances were calculated using the Poisson correction method and are expressed in terms of the number of substitutions of amino acids per site. Twenty-six Rho sequences were analyzed based on the hot spring isolates. The final dataset had 562 locations.

 

Comparison of amino acid frequencies in Rho isolated from extremophile bacteria:

In general, thermophilic enzymes alter their amino acid composition to withstand high temperature, and this includes additional charged residues providing more ionic interactions that will stabilize the structure, more aromatic residues that stabilize the protein core, fewer uncharged polar residues, fewer amides amino acids, and more hydrogen bonding and salt bridges. In addition, increase, hydrophobic residues in the protein core maintain tight molecular packing, higher hydrophobic core, lower backbone, and entropy of the side chains 37, 38. In comparison, cold-adapted proteins have a variety of modifications in amino acids that offer increased flexibility, including lower salt bridges and hydrogen bonds, lower proline residue content, lower Arg/(Arg + Lys) ratio, higher serine and glycine availability, more asparagines, increase in the negative net charge, broad accessible hydrophobic surfaces, and longer loops 39,40. In our study, the aliphatic amino acids are predominant (the amino acids that contain alkyl (CH2) group) in rhomboid protease; Leu come in the first place, and Ala in the third place, followed by Ile and Val, these hydrophobic non-polar amino acids play an essential role in the hydrophobic activity, which is the critical factor in maintaining conformational stability within the protein's inner and outer portions (Please see figure 3).  Gly is the second most presented amino acid. It is a hydrophobic amino acid with no side chain, or we can consider the hydrogen ion as its side chain. This gives the Gly more conformational flexibility. Ser is a polar amino acid with a hydroxymethyl group at its active site, which makes it capable of forming hydrogen bonds with a number of polar substrates; it is presented in protein active sites as the classical Asp-His-Ser catalytic triad responsible for the hydrolysis of other molecules 41. Histidine is a polar positively charged amino acid with imidazole side chain, it’s percent 2.09 in hot springs isolates, its presence is widely common in metal-binding sites (like zinc) of the proteins; also it is presented in the catalytic triads in which its basic nitrogen abstract a proton from serine, threonine, or cysteine to activate it as a nucleophile 42.

 

Figure 3. The orange column displays the average amino acid ratio in a Rho Hot-springs isolates.

Identification of active domain for rhomboid protease:

The rhomboid family, S54, has been little explored. Researchers have discovered this phenomenon in bacteria, archaea, and more recently, eukaryotes, beginning with Drosophila melangolaster 43. InterProScan results in (figure 4) have shown various domains of the proteins determined from the isolates; Peptidase S54-rhomboid domain is the predominant domain in Rho isolated from hot springs. The rhomboid family (S54) is a family that has received limited research attention. The presence of this phenomenon has been extensively seen in bacteria, archaea, and more recently, eukaryotic creatures, with the inaugural discovery in Drosophila melangolaster 44.


 

Figure 4. Represent InterPro scan results of rhomboid protease genes isolated from hot springs bacteria.

 


3.2.1 Identifying Rho structure for hot-spring isolate using I-TASSER server:

On the result page of the I-TASSER, we consider several parameters; the first one is the confidence score, which is a number between 0 and 9, where a score of 9 is likely an exact match, while a score of 0 means that no matching answer was found. A higher score means a more confident prediction of secondary structure.  For Rho isolated from Hot spring secondary structure has more helix and no strands, so it is more stable. The stability of these Extremophilic proteins is based on levels of amino acids that promote alpha-helical secondary structures. The second parameter is the Solvent Accessibility (Solvent Accessible Surface Area (SASA)) or briefly Accessible Surface Area (ASA), considered the main feature for determining protein folding and stability and defined as the polar solvent accessible area of a given protein. It is a function of the protein's three-dimensional structure and, based on ASA values, amino acid residues of a protein can be classified as buried or exposed; the values range from 0 to 9, as 0 mean buried residue and 9 means highly exposed residue. For Rho isolated from hot-springs: The results show more buried residues, so the structure is more stable. The stability of thermophilic proteins is based on increased levels of amino acids that promote alpha-helical secondary structures, deletion/shortening of surface loops, and immobilization of terminal ends Proteins also have a higher hydrophobic core, increased polar/charged interactions, i.e., hydrogen bonds, salt bridges around the active site, and more ionic interactions on the surface, Impacts of low temperatures in the microorganisms are distributed along with different parts of the cell; mainly the cytoplasmic membranes and enzymes that appear to stiffen as the temperature decreases, which affects the permeability of the membrane, and the transfer of nutrients and waste materials, and catalysis, since enzymes need a certain resilience to function. 45-47 .Also, proteins can suffer from impaired folding and cold denaturation. Besides, primary biological processes involving nucleic acids such as DNA replication, translation, and transcription can also be impaired by exposure to low temperatures through the development of secondary structures or supercoiled structures, in addition the microbes express cold shock proteins. 48-50 The third parameter is the Predicted normalized B-factor which is a value to indicate the extent of the inherent thermal mobility of residues/atoms in proteins. Negative values mean the residue is relatively more stable in the structure. More results are above zero, so Rho isolated from Hot-springs, fewer values are above zero, so the structure is more stable (please see figure 5). I-TASSER modeling starts from the structure Rho for Candidatus Caldatribacterium (hot-springs) templates identified by LOMETS from the PDB library separately. Every threading program has the capability of producing tens of thousands of template alignments. LOMETS is a meta-server threading technique with numerous threading programs 51. I-TASSER will only use the most important threading alignment templates. The Z-score, the difference between raw scores and average scores reported in standard deviation, determines this. The ten best LOMETS threading program templates are here. The template with the greatest Z-score from each threading program is usually picked. Ranking threading applications by average performance in large-scale benchmark tests. Normalized Z-score >1 indicates alignment confidence. An "Easy" query protein target has at least one template for each threading program with a normalized Z-score greater than one. If not, the protein is a “Hard" target. 52 A C-score based on threading template alignment importance and structure assembly simulation convergence parameters quantifies each model's confidence. C-scores range from -5 to 2, with higher values indicating model confidence and vice versa. The I-TASSER service provides five top models for Rho isolated from hot-springs (3.54: -2.84); we choose the first model for modrefine with the best C-score for the two findings (Figure 6).


 

Figure 5. Represent secondary structure, solvent accessibility and normalized B-factor predicted for the Rho for Candidatus Caldatribacterium (hot-springs).

 

Figure 1. Shows I-TASSER modeling starts from the structure protease for Rho for Candidatus Caldatribacterium (Hot-springs) templates. I-TASSER's top ten most popular threading templates. The Z-score is the standard deviation between the raw alignment score and the mean. It has been frequently used to estimate the significance and quality of template alignments. Lift side shows final top model predict by I-TASSER.

 


RMSD and TM score of Rho for extremophiles after refinement:

The model provided by I-TASSER was not refined; it contained high energy residues or loops of unsymmetrical geometry. Therefore, the model was further refined using ModRefiner 53 the alignment quality was evaluated using RMSD and TM-score. TM-score, which might have values anywhere from (0 to 1). TM scores less than 0.2 suggest that there is no fold similarity; however, scores more than 0.5 indicate that there is a similar fold. On another side, Because of the inherent flexibility of proteins and the resolution constraints of experiments, the RMSD can be anywhere from (0 to 1.2) 54-56. In my study the results were for Rho isolated from Hot springs: RMSD=0.254, TM-score=0.9984 which are within the range (please see figure 7).

 

Figure 7. This figure shows 3D structures for Rho for Candidatus Caldatribacterium; after refinement. RMSD, and TM-Score value (within range).

Prediction of binding site pockets results:

DoGSiteScorer is a grid-based method that uses a Difference of Gaussian filter to detect potential binding pockets solely based on the 3D structure of the protein. 26 Global parameters like size, shape, depth, biophysical, and chemical aspects of projected pockets are calculated. Each (sub) pocket is assigned a simple druggability score (0–1) by default, with higher scores indicating greater druggability. DoGSiteScorer is increasingly used to assess the druggability of pharmacological targets, particularly membrane proteins 57-59. Each discovered pocket's volume, surface, and depth are depicted in figure 8. Descending pocket volume sorts the rows. The last table column contains the determined Simple Score and druggability score. The pocket representation color code matches the pocket descriptor table entry's backdrop color. DoGSiteScorer lists twelve probable pockets on Hot-spring Rho in the result tables and visualizes them with Jmol. Rho for Candidatus Caldatribacterium (hot springs) has three druggable pockets, the largest of which has a volume of 501.38. Druggable pockets have exclusive properties such P0 = 0.79, P1 = 0.83, and P2 = 0.70.

 


Figure 8. Represent server result page. Left: Table containing the main shape descriptors for all. The Rho for Candidatus Caldatribacterium (hot-springs). Right: The twelve detected pockets together with a cartoon representation of Rho for Candidatus Caldatribacterium (hot-springs).

 


Ligand and Rho preparation:

Docking methods seek the minimum of a molecular mechanics-based scoring function that may include realistic solvation terms. The goal is to match two molecules as closely as possible. We focus on finding the binding site; thus, we don't address the problem of estimating binding free energy. Most scoring functions have several local minima, creating extremely rugged energy landscapes. Therefore, most docking techniques include continuous local minimization of the energy function, regardless of the conformational space sampling algorithm. Steric conflicts are eliminated and energy measurements are more accurate 60, 61. When using chem draw to build a model, an atom's position may not match its molecule position. The model may show atom conformational strain or bond high-energy strain. For this reason, the model may oversimplify the molecule to fix the energy minimization computation with MM2 or MMFF94. It then analyzes the model to identify its atoms. The algorithm then repositions each atom to lower model potential energy. Chem3D calculates all permutations to relocate each atom in your model for the lowest energy 62-64  all ligands were minimized using MM2 tools in chem draw. Pink atoms represent hydrogen atoms, which contribute to ligand energy minimization by using the default parameters to generate a dock parameter file. (See figure 9). Chiron generates a clash score, a size-independent parameter calculated by dividing the total VDW repulsion energy by the total number of contacts. Chiron can evaluate if a protein has artifacts (excessive steric clashes) and return the clash score to physiological tolerance (0.02 kcal.mol-1.contact-1) based on data gathered from high-resolution structures.65 Web servers were used to prepare Rho structure for Hot-spring to dock the Chiron (https://dokhlab. med.psu.edu/chiron) used 66. According to (Figure 10) Rho structure was energy minimized using the Chiron web server. The Clash score value in the two graphs shows less than 0.02. Which mean the structures ready for docking.


 

Figure 9. Represent a total of 5 ligand were selected for molecular docking. Ligand structures are drawn using the chemsketch v. 11 software and energy minimization don with chem draw v. 16.

 

Figure 10. Shows Graph shows summary of the chiron clash energy minimization Procedure The color red depicts the class energy for the modeled Rho for Candidatus Caldatribacterium Californiense OP9-Cscg (Hot-springs), green indicates the final structure after minimization, and black indicates a series of high-resolution structures.

 

Figure 11. shows 2D interaction between Rho for Candidatus Caldatribacterium Californiense OP9-Cscg (Hot-springs), with five ligands ((a)4- amidino phenyl methanesulfonyl fluoride, b) Leucine, c) Methionine, d) N-carbobenzyloxy-L-alanine and e) Rivastigmine)) using Discovery Studio.

 

Table 1. Shows interaction vina score for ligands and rhomboid protease isolates from Candidatus Caldatribacterium Californiense OP9-Cscg (Hot-springs).

ligands Name

Vina score

Cavity size

center

size

X

Y

Z

X

Y

Z

4- amidino phenyl methanesulfonyl fluoride

-7.2

88

50

55

65

19

19

19

leucine

-5.3

81

56

66

78

17

17

17

methionine

-4.1

81

56

66

78

17

17

17

N-Carbobenzyloxy-L-Alanine

-7.8

282

59

52

49

21

21

21

rivastigmine

-6.9

282

59

52

49

20

20

20

 


Molecular docking:

Molecular docking is a computational technique in drug design used to predict the binding mode of small drug-like molecules within the active site of an enzyme or cell surface receptor binding pocket. This can be done by placing the molecules in a "docking station," which simulates the enzyme's active site. Through the use of molecular modeling, we can view the protein active site with ligands in a variety of different orientations and conformations. In addition, it enables us to rank the various ligands based on their binding affinity, which is graded according to the ideal binding geometries and energies of the individual molecules. This ranking can then determine which ligands are the most effective. 67, 68 In the current investigation, all docking tests were carried out using the CB-Dock server, discover studio. The docked conformation of 4- Amidino Phenyl Methanesulfonyl Fluoride, Leucine, Methionine, N-Carbobenzyloxy-L-Alanine and Rivastigmine (ligands) with the active conformation of Rho (receptor) isolated hot-springs bacteria (please see figure 11a, 11b, 11c, 11d and 11e) revealed that numerous potential interactions were present. The vina score of ligands compared after interaction with studied Rho for Hot Spring bacteria were present in (Table 1).

 

Evaluation of extremophiles Rho- ligand interaction:

2D interaction of 4- Amidino Phenyl Methanesulfonyl Fluoride with Rho was isolated from hot-spring bacteria (please see figure 11.a). It showed (Gly) 17, TYR 31, and ILE number 220 forming conventional hydrogen bonds represented in dark green dotted lines. In addition, TYR 31, ILE number 220 formed Pi-Pi stacked and Pi-sigma bonds respectively represented in dark pink color and purple color, VAL 148 and PRO number 43 formed Pi-Alkyl bond, GLY 42 formed Pi-sulfur bond represented in dark yellow color. Also, van der Waals non-bonded interactions in light green with a vina score (-7.2). The 2D interaction of leucine with Rho was isolated from hot springs (see figure 11b). ARG 212, LYS 274, MET 157, LEU 256 forming conventional hydrogen bonds represented in dark green dotted lines. On the other hand, GLU 2 forms Attractive charge bonds represented in orange dotted lines. Also, LEU 256, 278, and ARG 260 Alkyl bonds are represented in pink dotted lines. Van der Waals non-bonded interactions in light green with a vina score (-5.3). It shows interaction of methionine interaction with Rho was isolated from hot springs (see figure 11.c). It shows the ARG 260 forming conventional hydrogen bonds represented in dark green dotted lines. On the other hand, ARG 260 and LEU (278, 259) form alkyl bonds represented in pink dotted lines. Also, van der Waals's non-bonded interactions are in light green with a vina score (-4.1). The interaction of N-Carbobenzyloxy-L-Alanine with Rho was isolated from hot springs (see figure 11d). It shows SER 50 forming conventional hydrogen bonds represented in dark green dotted lines. In addition, ILE number 220 forms Pi-sigma bonds represented in purple color, TYR 31 forming Pi-Pi stacked represented dark pink color, VAL 148 and LEU 223 forming Pi-Alkyl bond represented pink color, GLY 42 forming Pi-sulfur bond represented in dark yellow color. Also, van der Waals non-bonded interactions in light green with a vina score (-7.8). The interaction of Rivastigmine with Rho isolated from hot springs (see figure 11e) revealed five Carbon Hydrogen bonds depicted by light green dotted lines: VAL 148, GLN (219, 216), TYR 31, and GLN 151. In addition, ILE number 220 forms Pi-sigma bonds depicted in purple, TYR 31 forms Pi-Pi stacking depicted in dark pink, VAL 51, 148, and PRO 43 form Alkyl and Pi-Alkyl links, respectively, depicted in pink, and GLY 42 creates Pi-sulfur bond depicted in dark yellow. In addition, van der Waals non-bonded interactions are represented in light green with a vina score (-6.9). The aminopeptidase activity was determined by docking the amino acids like Leucine and Methionine. Unfavorable bonds affect the activity stability of the ligand. The formation of any unfavorable bond between/in the protein-ligand complex reduces the stability of the complex as these types of bonds indicate a force of repulsion between 2 molecules and an atom. 69 The Leucine interaction with Rho isolated from a Hot spring form a stable complex structure by forming four conventional hydrogen bonds. Furthermore, the Rho inhibitors such as Rivastigmine, N-carbobenzyloxy-L-alanine, and 4-amidino phenyl methane sulfonyl fluoride 70 were docked by CB-Dock to understand the proteolytic activity of the predicted Rho model from Hot spring bacteria. The docking interactions between the binding site residues and the compounds with their respective docking (vina) scores were given, also visualized in discovery studio, to compare and validate the docking results. The docking output was the free binding energy with which the ligand binds to the pocket of the receptor protein. The highest scoring ligand (4- amidino phenyl methane sulfonyl fluoride, N-Carbobenzyloxy-L-Alanine, and Rivastigmine) gave a better binding affinity score of -7.2, -7.8, and -6.9, respectively, moreover, three ligands demonstrated pi-sulfur bond with GLY 42 and Pi-stacked with TYR 31 for Rho isolated from Hot spring isolate (see Table 1). The docking studies showed that critical amino acid residues in Rho isolated from Hot spring bacteria were VAL (148), ILE (220), GLY (42), and TYR (31). Rho isolated from Hot spring bacteria  demonstrated conventional hydrogen bond, Carbon Hydrogen bond, Pi-sulfur with inhibitor ligands. Probably these structures are the most stable interaction energy.

 

CONCLUSION:

In conclusion, the rhomboid gene sequence showed a lot of similarities between the hot spring isolates. It is not clear how this protein has changed at high and low temperatures, but the fact that this family is found in almost all bacteria, archaea, and eukaryotes suggests that this protein is part of the last common ancestor. Phylogenetic tree analysis studies indicate a likely bacterial or archaeal origin. Different amino acids have different functions, and the presence of active domains determines the protein's function. By looking at the isolates' amino acids and active domains, we found that their structure and function are very similar to those of other living things. In fact, the main structure and functional sites of rhomboids stay the same in almost all living things. Good selection may have retained the protein's proteolytic activity. Rhomboid proteases are released by many bacteria, but their biological activity is unknown due to the lack of endogenous substrates. In vitro, various bacterial rhomboids have been shown to process eukaryotic rhomboids using a typical enzyme mechanism. Rhomboid intramembrane proteases are gaining popularity in infectious diseases and cancer. Rhomboid intramembrane proteases rapidly link to Parkinson's disease, cancer, type II diabetes, and bacterial and malaria infections. Recent discoveries about rhomboids' structure and enzyme activity, as well as efforts to identify inhibitors, are shedding light on their potential as medicinal targets.

 

CONFLICT OF INTEREST:

The authors have no conflicts of interest regarding this investigation.

 

ACKNOWLEDGMENTS:

I like to extend my sincere appreciation to the esteemed faculty members and academic advisors who have offered invaluable assistance, constructive critique, and unwavering support throughout the entirety of my academic endeavors. The individual's proficiency, sagacity, and commitment to instruction and guidance have played a pivotal role in molding my aptitude for research and academic endeavors.

 

REFERENCE:

1.      Yin W, Wang Y, Liu L, He J. Biofilms: the microbial “protective clothing” in extreme environments. International Journal of Molecular Sciences. 2019; 20(14): 3423.

2.      Devi R, Kaur T, Kour D, Rana KL, Yadav A, Yadav AN. Beneficial fungal communities from different habitats and their roles in plant growth promotion and soil health. Microbial Biosystems. 2020; 5(1): 21-47.

3.      Gizaw B. Potential Microbial Ecologyin Ethiopia and Ex-situ Conservation Effort. 2018.

4.      Waugh DW. Fluid Dynamics of Polar Vortices on Earth, Mars, and Titan. Annual Review of Fluid Mechanics. 2022; 55.

5.      Erfurt P. The Geology of Hot Springs. The Geoheritage of Hot Springs: Springer; 2021. p. 17-50.

6.      Zabulica M. Modelling and Genetic Correction of Liver Genetic Diseases: Karolinska Institutet (Sweden); 2021.

7.      Arbige MV, Shetty JK, Chotani GK. Industrial enzymology: the next chapter. Trends in biotechnology. 2019; 37(12): 1355-66.

8.      Prayogo FA, Budiharjo A, Kusumaningrum HP, Wijanarka W, Suprihadi A, Nurhayati N. Metagenomic applications in exploration and development of novel enzymes from nature: a review. Journal of Genetic Engineering and Biotechnology. 2020; 18(1): 1-10.

9.      Chong J, Liu P, Zhou G, Xia J. Using MicrobiomeAnalyst for comprehensive statistical, functional, and meta-analysis of microbiome data. Nature Protocols. 2020; 15(3): 799-821.

10.   Lear G, Dickie I, Banks J, Boyer S, Buckley HL, Buckley TR, et al. Methods for the extraction, storage, amplification and sequencing of DNA from environmental samples. New Zealand Journal of Ecology. 2018; 42(1): 10-50A.

11.   Cruz-Casas DE, Aguilar CN, Ascacio-Valdés JA, Rodríguez-Herrera R, Chávez-González ML, Flores-Gallegos AC. Enzymatic hydrolysis and microbial fermentation: The most favorable biotechnological methods for the release of bioactive peptides. Food Chemistry: Molecular Sciences. 2021; 3: 100047.

12.   Naveed M, Nadeem F, Mehmood T, Bilal M, Anwar Z, Amjad F. Protease—a versatile and ecofriendly biocatalyst with multi-industrial applications: an updated review. Catalysis Letters. 2021; 151(2): 307-23.

13.   Matkawala F, Nighojkar S, Kumar A, Nighojkar A. Microbial alkaline serine proteases: Production, properties and applications. World Journal of Microbiology and Biotechnology. 2021; 37(4): 1-12.

14.   Began J. Rhomboid family intramembrane proteases in prokaryotes: mechanism, substrate repertoires and biological functions in the Gram-positive bacterium Bacillus subtilis. 2020.

15.   Almeida Hernandez Y. Integrative analysis of the dynamics of rhomboid protease GlpG from Escherichia coli: Staats-und Universitätsbibliothek Hamburg Carl von Ossietzky; 2019.

16.   Chen I-MA, Chu K, Palaniappan K, Pillay M, Ratner A, Huang J, et al. IMG/M v. 5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes. 2019;47(D1):D666-D77.

17.   Li W, Fu L, Niu B, Wu S, Wooley JJBib. Ultrafast clustering algorithms for metagenomic sequence analysis. 2012; 13(6): 656-68.

18.   Letunic I, Bork PJNar. Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy. 2011; 39(suppl_2):W475-W8.

19.   Chowdhury B, Garai GJG. A review on multiple sequence alignment from the perspective of genetic algorithm. 2017; 109(5-6): 419-31.

20.   Chatzou M, Magis C, Chang J-M, Kemena C, Bussotti G, Erb I, et al. Multiple sequence alignment modeling: methods and applications. 2016; 17(6): 1009-23.

21.   Garg VK, Avashthi H, Tiwari A, Jain PA, Ramkete PW, Kayastha AM, et al. MFPPI–multi FASTA ProtParam interface. 2016; 12(2): 74.

22.   Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, et al. InterProScan: protein domains identifier. 2005; 33(suppl_2): W116-W20.

23.   Liu Y, Li J, Yang J, Zeng H, Guo L, Ren S, et al. Emergence of Different Recombinant Porcine Reproductive and Respiratory Syndrome Viruses, China. Scientific Reports. 2018; 8(1): 4118.

24.   Cueno ME, Ueno M, Iguchi R, Harada T, Miki Y, Yasumaru K, et al. Insights on the Structural Variations of the Furin-Like Cleavage Site Found Among the December 2019–July 2020 SARS-CoV-2 Spike Glycoprotein: A Computational Study Linking Viral Evolution and Infection. Frontiers in Medicine. 2021; 8: 240.

25.   Xu D, Zhang Y. Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. Biophysical Journal. 2011; 101(10): 2525-34.

26.   Volkamer A, Kuhn D, Rippmann F, Rarey M. DoGSiteScorer: a web server for automatic binding site prediction, analysis and druggability assessment. Bioinformatics. 2012; 28(15): 2074-5.

27.   Weininger D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. Journal of Chemical Information and Computer Sciences. 1988; 28(1): 31-6.

28.   Khatik GL. Identification of Possible Molecular Targets of Potential Anti-Alzheimer Drugs by Predicting their Binding Affinities Using Molecular Docking Technique. International Journal of Green Pharmacy (IJGP). 2018; 12(02).

29.   Avupati VR, Kurre PN, Bagadi SR, Muthyala MK, Yejella RP. De novo based ligand generation and docking studies of PPARδ agonists: Correlations between predicted biological activity vs. biopharmaceutical descriptors. Chem-Bio Informatics Journal. 2010; 10: 74-86.

30.   Rowaiye AB, Olubiyi J, Bur D, Uzochukwu IC, Akpa A, Esimone CO. In silico screening and molecular dynamic simulation studies of potential small molecule immuno-modulators of the KIR2DS2 receptor. Journal of Phytomedicine and Therapeutics. 2021; 20(1): 542–67-–67.

31.   Ohanyan S, Grabski H, Rshtuni L, Tiratsuyan S, Hovhannisyan A, editors. Improvement of the Antibacterial Activity of Benzylpenicillin in combination with Green Silver Nanoparticles against Staphylococcus aureus. International Conference on Nanotechnologies and Biomedical Engineering; 2019: Springer.

32.   Liu Y, Grimm M, Dai W-t, Hou M-c, Xiao Z-X, Cao Y. CB-Dock: a web server for cavity detection-guided protein–ligand blind docking. Acta Pharmacologica Sinica. 2020; 41(1): 138-44.

33.   Dharmasiri B, Senanayake G, Perera P, Halmillawewa A, Attanayake R, editors. Evaluation of Lignin Degrading Ability and In-silico Analysis and Molecular Docking of Laccase of Schizophyllum commune with Lignin. Proceedings of International Forestry and Environment Symposium; 2022.

34.   Sharma S, Sharma A, Gupta U. Molecular Docking studies on the Anti-fungal activity of Allium sativum (Garlic) against Mucormycosis (black fungus) by BIOVIA discovery studio visualizer 21.1. 0.0. Annals of Antivirals and Antiretrovirals. 2021; 5(1): 028-32.

35.   Qingyu C, Yu W, Yang L, Zobel J, Verspoor K, editors. Evaluation of CD-HIT for constructing non-redundant databases. 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); 2016; 15-18 Dec. 2016.

36.   Koonin EV, Makarova KS, Rogozin IB, Davidovic L, Letellier M-C, Pellegrini LJGb. The rhomboids: a nearly ubiquitous family of intramembrane serine proteases that probably evolved by multiple Ancient Horizontal gene Transfers. 2003; 4(3): 1-12.

37.   Kumar S, Dangi AK, Shukla P, Baishya D, Khare SKJBt. Thermozymes: Adaptive strategies and tools for their Biotechnological Applications. 2019; 278: 372-82.

38.   Hait S, Mallik S, Basu S, Kundu SJPS, Function,, Bioinformatics. Finding the generalized molecular principles of protein thermal stability. 2020; 88(6): 788-808.

39.   Raymond-Bouchard I, Goordial J, Zolotarov Y, Ronholm J, Stromvik M, Bakermans C, et al. Conserved genomic and amino acid traits of cold adaptation in subzero-growing Arctic permafrost bacteria. FEMS Microbiology Ecology. 2018; 94(4).

40.   Xie B-B, Bian F, Chen X-L, He H-L, Guo J, Gao X, et al. Cold adaptation of zinc metalloproteases in the thermolysin family from deep sea and arctic sea ice bacteria revealed by catalytic and structural properties and molecular dynamics new insights into relationship between conformational flexibility and hydrogen Bonding. 2009; 284(14): 9257-69.

41.   Betts MJ, Russell RBJBfg. Amino acid properties and consequences of substitutions. 2003; 317: 289.

42.   Ingle RAJTABASoPB. Histidine Biosynthesis. 2011;9.

43.   García-Lorenzo M, Sjödin A, Jansson S, Funk C. Protease gene families in Populus and Arabidopsis. BMC Plant Biology. 2006; 6(1): 30.

44.   Wang Y, Zhang Y, Ha YJN. Crystal structure of a rhomboid family intramembrane Protease. 2006; 444(7116): 179-80.

45.   Marx J, Collins T, D’Amico S, Feller G, Gerday CJMb. Cold-adapted enzymes from marine Antarctic Microorganisms. 2007; 9(3): 293-304.

46.   Jongruja N, You DJ, Angkawidjaja C, Kanaya E, Koga Y, Kanaya S. Structure and characterization of RNase H3 from Aquifex aeolicus. The FEBS Journal. 2012; 279(15): 2737-53.

47.   Siliakus MF, van der Oost J, Kengen SW. Adaptations of archaeal and bacterial membranes to variations in temperature, pH and pressure. Extremophiles. 2017; 21(4): 651-70.

48.   D'Amico S, Collins T, Marx JC, Feller G, Gerday C, Gerday CJEr. Psychrophilic microorganisms: challenges for life. 2006; 7(4): 385-9.

49.   Ratkowsky DA, Olley J, Ross TJJotb. Unifying temperature effects on the growth rate of bacteria and the stability of globular proteins. 2005; 233(3): 351-62.

50.   Tribelli PM, López NIJL. Reporting key features in cold-adapted bacteria. 2018; 8(1): 8.

51.   Xu D, Zhang J, Roy A, Zhang Y. Automated protein structure modeling in CASP9 by I‐TASSER pipeline combined with QUARK‐based ab initio folding and FG‐MD‐based structure refinement. Proteins: Structure, Function, and Bioinformatics. 2011; 79(S10): 147-60.

52.   Yang J, Zhang Y. I-TASSER server: new development for protein structure and function predictions. Nucleic Acids Research. 2015; 43(W1): W174-W81.

53.   Iqrar U, Javaid H, Ashraf N, Ahmad A, Latief N, Shahid AA, et al. Structural and functional analysis of pullulanase type 1 (PulA) from Geobacillus thermopakistaniensis. Molecular Biotechnology. 2020; 62(8): 370-9.

54.   Li Y, Zhang C, Zheng W, Zhou X, Bell EW, Yu DJ, et al. Protein inter‐residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14. Proteins: Structure, Function, and Bioinformatics. 2021; 89(12): 1911-21.

55.   Sharma P, Joshi T, Joshi T, Mathpal S, Maiti P, Nand M, et al. In silico screening of natural compounds to inhibit interaction of human ACE2 receptor and spike protein of SARS-CoV-2 for the prevention of COVID-19. Journal of Biomolecular Structure and Dynamics. 2021: 1-13.

56.   Rudden LS, Degiacomi MT. Protein docking using a single representation for protein surface, electrostatics, and local dynamics. Journal of Chemical Theory and Computation. 2019; 15(9): 5135-43.

57.   Aretz J, Wamhoff E-C, Hanske J, Heymann D, Rademacher C. Computational and experimental prediction of human C-type lectin receptor druggability. Frontiers in Immunology. 2014; 5: 323.

58.   Lima AN, Philot EA, Trossini GHG, Scott LPB, Maltarollo VG, Honorio KM. Use of machine learning approaches for novel drug discovery. Expert Opinion on Drug Discovery. 2016; 11(3): 225-39.

59.   Volkamer A, Eid S, Turk S, Jaeger S, Rippmann F, Fulle S. Pocketome of human kinases: prioritizing the ATP binding sites of (yet) untapped protein kinases for drug discovery. Journal of Chemical Information and Modeling. 2015; 55(3): 538-49.

60.   Mirzaei H, Zarbafian S, Villar E, Mottarella S, Beglov D, Vajda S, et al. Energy Minimization on Manifolds for Docking Flexible Molecules. J Chem Theory Comput. 2015; 11(3): 1063-76.

61.   Cournia Z, Allen BK, Beuming T, Pearlman DA, Radak BK, Sherman W. Rigorous free energy simulations in virtual screening. Journal of Chemical Information and Modeling. 2020; 60(9): 4153-69.

62.   Drexler KE. Molecular machinery and manufacturing with applications to computation: Massachusetts Institute of Technology; 1991.

63.   Harper PM, Gani R, Kolar P, Ishikawa T. Computer-aided molecular design with combined molecular modeling and group contribution. Fluid Phase Equilibria. 1999; 158: 337-47.

64.   Chandra A, Ghate MV, Aithal K, Lewis SA. In silico prediction coupled with in vitro experiments and absorption modeling to study the inclusion complex of telmisartan with modified beta-cyclodextrin. Journal of Inclusion Phenomena and Macrocyclic Chemistry. 2018; 91(1): 47-60.

65.   Rowaiye AB, Oni S, Uzochukwu IC, Akpa A, Esimone CO. The Structure-Based Virtual Screening for Natural Compounds that Bind with the Activating Receptors of Natural Killer Cells. bioRxiv. 2020:2020.06.19.160861.

66.   Claranciaa PS, Naveenarania M, Valarmathia R, Sureshaa G, Hemaprabhaa G, Rama B, et al. Isolation, characterization and expression analysis of novel water deficit stress-responsive DEEPER ROOTING 1 (DRO1) gene from drought-tolerant Erianthus arundinaceus. Journal of Sugarcane Research. 2020; 10(1): 1-11.

67.   Torres PH, Sodero AC, Jofily P, Silva-Jr FP. Key topics in molecular docking for drug design. International Journal of Molecular Sciences. 2019; 20(18): 4574.

68.   Sepay N, Saha PC, Shahzadi Z, Chakraborty A, Halder UC. A crystallography-based investigation of weak interactions for drug design against COVID-19. Physical Chemistry Chemical Physics. 2021; 23(12): 7261-70.

69.   Dhorajiwala TM, Halder ST, Samant L. Comparative in silico molecular docking analysis of l-threonine-3-dehydrogenase, a protein target against African trypanosomiasis using selected phytochemicals. Journal of Applied Biotechnology Reports. 2019; 6(3): 101-8.

70.   Kandasamy S, Duraisamy S, Chinnappan S, Balakrishnan S, Thangasamy S, Muthusamy G, et al. Molecular modeling and docking of protease from Bacillus sp. for the keratin degradation. Biocatalysis and Agricultural Biotechnology. 2018; 13: 95-104.

 

 

 

Received on 24.04.2024      Revised on 30.07.2024

Accepted on 06.10.2024      Published on 24.12.2024

Available online from December 27, 2024

Research J. Pharmacy and Technology. 2024;17(12):5890-5900.

DOI: 10.52711/0974-360X.2024.00894

© RJPT All right reserved