Application of computational methods in elucidating the isomerization step in the biosynthesis of coumarins
- Authors: Tshiwawa, Tendamudzimu
- Date: 2019
- Subjects: Coumarins , Isomerization , Biosynthesis , Organic compounds -- Synthesis , Cinnamic acid
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/67646 , vital:29124
- Description: The identity of the enzyme(s) responsible for the biosynthetic transformation of cinnamic acid derivatives to important, naturally occurring coumarins has yet to be established. This study constitutes a high-level theoretical analysis of the possibility that a recently reported molecular mechanism of the synthesis of coumarins from Baylis-Hillman adducts, may provide a viable model for three critical phases in the biosynthetic pathway Particular attention has been given to the first of these phases: i) E→Z isomerisation of the cinnamic acid precursor; ii) Cyclisation (lactonisation) to the hemi-acetal intermediate; and ii) Dehydration to afford the coumarin derivative. In order to accomplish this analysis, an enzyme capable, theoretically, of effecting this E→Z isomerisation required identification, and its potential involvement in the transformation mechanism explored. Combined Molecular Mechanics and high-level Quantum Mechanical/DFT calculations were used to access complementary models of appropriate complexes and relevant processes within the enzyme active sites of a range of eleven Chalcone Isomerase (CHI) enzyme candidates, the structures of which were downloaded from the Protein Data Bank. Detailed B3LYP/6-31+G(d,p) calculations have provided pictures of the relative populations of conformations within the ensemble of conformations available at normal temperatures. Conformations of several protonation states of cinnamic acid derivatives have been studied in this way, and the results obtained showed that coupled protonation and deprotonation of (E)-o-coumaric acid provides a viable approach to achieve the E→Z isomerization. In silico docking of the B3LYP/6-31+G(d,p) optimized (E)-o-coumaric acid derivatives in the active sites of each of the candidate CHI enzymes (CHI) revealed that (E)-o-coumaric acid fits well within the active sites of Medicago Sativa CHI crystallographic structures with 1FM8 showing best potential for not only accommodating (E)-o-coumaric acid , but also providing appropriate protein active site residues to effect the simultaneous protonation and deprotonation of the substrate , two residues being optimally placed to facilitate these critical processes. Further exploration of the chemical properties and qualities of selected CHI enzymes, undertaken using High Throughput Virtual Screening (HTVS), confirmed 1FM8 as a viable choice for further studies of the enzyme-catalysed E→Z isomerization of (E)-o-coumaric acid. A molecular dynamics study, performed to further evaluate the evolution of (E)-o-coumaric acid in the CHI active site over time, showed that the ligand in the 1FM8 active site is not only stable, but also that the desired protein-ligand interactions persist throughout the simulation period to facilitate the E→Z isomerization. An integrated molecular orbital and molecular mechanics (ONIOM) study of the 1FM8-(E)-o-coumaric acid complex, involving the direct protonation and deprotonation of the ligand by protein residues; has provided a plausible mechanism for the E → Z isomerization of (E)-o-coumaric acid within the 1FM8 active site; a transition state complex (with an activation energy of ca. 50 kCal.mol-1) has been located and its connection with both the (E)- and (Z)-o-coumaric acid isomer has been confirmed by Intrinsic Reaction Coordinate (IRC) calculations. More realistic models of the 1FM8-(E)-o-coumaric acid complex, with the inclusion of water solvent molecules, have been obtained at both the QM/MM and adaptive QM/MM levels which simulate the dynamic active site at the QM level. The results indicate that the simultaneous protonation and deprotonation of (E)-o-coumaric acid within the CHI enzyme is a water-mediated process – a conclusion consistent with similar reported processes. Visual inspection of the 1FM8-(Z)-o-coumaric acid complex reveals both the necessary orientation of the phenolic and carboxylic acid moieties of the (Z)-o-coumaric acid and the presence of appropriate, proximal active site residues with the potential to permit catalysis of the subsequent lactonisation and dehydration steps required to generate coumarin.
- Full Text:
- Date Issued: 2019
- Authors: Tshiwawa, Tendamudzimu
- Date: 2019
- Subjects: Coumarins , Isomerization , Biosynthesis , Organic compounds -- Synthesis , Cinnamic acid
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/67646 , vital:29124
- Description: The identity of the enzyme(s) responsible for the biosynthetic transformation of cinnamic acid derivatives to important, naturally occurring coumarins has yet to be established. This study constitutes a high-level theoretical analysis of the possibility that a recently reported molecular mechanism of the synthesis of coumarins from Baylis-Hillman adducts, may provide a viable model for three critical phases in the biosynthetic pathway Particular attention has been given to the first of these phases: i) E→Z isomerisation of the cinnamic acid precursor; ii) Cyclisation (lactonisation) to the hemi-acetal intermediate; and ii) Dehydration to afford the coumarin derivative. In order to accomplish this analysis, an enzyme capable, theoretically, of effecting this E→Z isomerisation required identification, and its potential involvement in the transformation mechanism explored. Combined Molecular Mechanics and high-level Quantum Mechanical/DFT calculations were used to access complementary models of appropriate complexes and relevant processes within the enzyme active sites of a range of eleven Chalcone Isomerase (CHI) enzyme candidates, the structures of which were downloaded from the Protein Data Bank. Detailed B3LYP/6-31+G(d,p) calculations have provided pictures of the relative populations of conformations within the ensemble of conformations available at normal temperatures. Conformations of several protonation states of cinnamic acid derivatives have been studied in this way, and the results obtained showed that coupled protonation and deprotonation of (E)-o-coumaric acid provides a viable approach to achieve the E→Z isomerization. In silico docking of the B3LYP/6-31+G(d,p) optimized (E)-o-coumaric acid derivatives in the active sites of each of the candidate CHI enzymes (CHI) revealed that (E)-o-coumaric acid fits well within the active sites of Medicago Sativa CHI crystallographic structures with 1FM8 showing best potential for not only accommodating (E)-o-coumaric acid , but also providing appropriate protein active site residues to effect the simultaneous protonation and deprotonation of the substrate , two residues being optimally placed to facilitate these critical processes. Further exploration of the chemical properties and qualities of selected CHI enzymes, undertaken using High Throughput Virtual Screening (HTVS), confirmed 1FM8 as a viable choice for further studies of the enzyme-catalysed E→Z isomerization of (E)-o-coumaric acid. A molecular dynamics study, performed to further evaluate the evolution of (E)-o-coumaric acid in the CHI active site over time, showed that the ligand in the 1FM8 active site is not only stable, but also that the desired protein-ligand interactions persist throughout the simulation period to facilitate the E→Z isomerization. An integrated molecular orbital and molecular mechanics (ONIOM) study of the 1FM8-(E)-o-coumaric acid complex, involving the direct protonation and deprotonation of the ligand by protein residues; has provided a plausible mechanism for the E → Z isomerization of (E)-o-coumaric acid within the 1FM8 active site; a transition state complex (with an activation energy of ca. 50 kCal.mol-1) has been located and its connection with both the (E)- and (Z)-o-coumaric acid isomer has been confirmed by Intrinsic Reaction Coordinate (IRC) calculations. More realistic models of the 1FM8-(E)-o-coumaric acid complex, with the inclusion of water solvent molecules, have been obtained at both the QM/MM and adaptive QM/MM levels which simulate the dynamic active site at the QM level. The results indicate that the simultaneous protonation and deprotonation of (E)-o-coumaric acid within the CHI enzyme is a water-mediated process – a conclusion consistent with similar reported processes. Visual inspection of the 1FM8-(Z)-o-coumaric acid complex reveals both the necessary orientation of the phenolic and carboxylic acid moieties of the (Z)-o-coumaric acid and the presence of appropriate, proximal active site residues with the potential to permit catalysis of the subsequent lactonisation and dehydration steps required to generate coumarin.
- Full Text:
- Date Issued: 2019
Enumeration, conformation sampling and population of libraries of peptide macrocycles for the search of chemotherapeutic cardioprotection agents
- Authors: Sigauke, Lester Takunda
- Date: 2019
- Subjects: Peptides -- Synthesis , Macrocyclic compounds , Drug development , Drug discovery , Cardiovascular system -- Diseases -- Prevention , Proteins -- Synthesis
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116056 , vital:34293
- Description: Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used.
- Full Text:
- Date Issued: 2019
- Authors: Sigauke, Lester Takunda
- Date: 2019
- Subjects: Peptides -- Synthesis , Macrocyclic compounds , Drug development , Drug discovery , Cardiovascular system -- Diseases -- Prevention , Proteins -- Synthesis
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116056 , vital:34293
- Description: Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used.
- Full Text:
- Date Issued: 2019
- «
- ‹
- 1
- ›
- »