Activity of diverse chalcones against several targets: statistical analysis of a high-throughput virtual screen of a custom chalcone library
- Authors: Sarron, Arthur F D
- Date: 2020
- Subjects: Acetophenone , Benzaldehyde , Ketones , Pyruvate kinase , Drug development , Aromatic compounds , Heat shock proteins
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116028 , vital:34291
- Description: Chalcone family molecules are well known to have therapeutic proprieties (anti-inflammatory, anti-microbial or anti-cancer, etc). However the mechanism of action in some cases is not well known. A virtual library of this family of compounds was constructed using custom scripts, based on the aldol condensation, and this library was modified further to analogues by expansion of the α,β-unsaturated ketone linker. Acetophenone and benzaldehyde derivatives which are available and purchasable were used as a base to design the chalcone virtual library. 8063 chalcones were constructed and geometrically optimized with Gaussian 09. Their physicochemical characteristics linked to the Lipinski rules were analyzed with Knime and CDK. The entire library was after docked against several targets including HIV-1 integrase, MRSA pyruvate kinase, HSP90, COX-1, COX-2, ALR2, MAOA, MAOB, acetylcholinesterase, butyrylcholinesterase and PLA2. With the exception of MAOA, which does not have a crystal structure ligand, all dockings were validated by redocking the original ligand provided by the literature. These targets are known in the literature to be inhibited by chalcone-derivatives. However, specificity of the particular known chalcone inhibitors to the particular targets is not known. To this end the performance of the generated chalcone library against the list of targets was of interest. The binding energy of ligand-protein complexes was generally good across the library. Statistical analysis including principal component analysis and hierarchical clustering analysis were made in order to investigate for any physical/chemical characteristics which might explain what chalcone features affect the binding energy of the ligand-protein complexes. The spherical polar coordinates defining the orientation of the binding poses were also calculated and used in the statistical analysis. The statistical analysis has allowed us to hypothesize the importance of these radial distances and the polar angles of key atoms in the chalcones in binding to the pyruvate kinase crystal structure. This was validated by the docking of another small library of compound models in which the α,β-unsaturated ketone chain of the chalcone was replaced by incrementally longer conjugated chains. Further studies on the chalcones themselves reveal rotameric systems in both cis and trans-configurations (which may impact binding), and also studied was the effect of Topliss-based modification and its impact of binding to HSP90. Molecular dynamics confirmed good binding of identified chalcone hits.
- Full Text:
- Date Issued: 2020
- Authors: Sarron, Arthur F D
- Date: 2020
- Subjects: Acetophenone , Benzaldehyde , Ketones , Pyruvate kinase , Drug development , Aromatic compounds , Heat shock proteins
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116028 , vital:34291
- Description: Chalcone family molecules are well known to have therapeutic proprieties (anti-inflammatory, anti-microbial or anti-cancer, etc). However the mechanism of action in some cases is not well known. A virtual library of this family of compounds was constructed using custom scripts, based on the aldol condensation, and this library was modified further to analogues by expansion of the α,β-unsaturated ketone linker. Acetophenone and benzaldehyde derivatives which are available and purchasable were used as a base to design the chalcone virtual library. 8063 chalcones were constructed and geometrically optimized with Gaussian 09. Their physicochemical characteristics linked to the Lipinski rules were analyzed with Knime and CDK. The entire library was after docked against several targets including HIV-1 integrase, MRSA pyruvate kinase, HSP90, COX-1, COX-2, ALR2, MAOA, MAOB, acetylcholinesterase, butyrylcholinesterase and PLA2. With the exception of MAOA, which does not have a crystal structure ligand, all dockings were validated by redocking the original ligand provided by the literature. These targets are known in the literature to be inhibited by chalcone-derivatives. However, specificity of the particular known chalcone inhibitors to the particular targets is not known. To this end the performance of the generated chalcone library against the list of targets was of interest. The binding energy of ligand-protein complexes was generally good across the library. Statistical analysis including principal component analysis and hierarchical clustering analysis were made in order to investigate for any physical/chemical characteristics which might explain what chalcone features affect the binding energy of the ligand-protein complexes. The spherical polar coordinates defining the orientation of the binding poses were also calculated and used in the statistical analysis. The statistical analysis has allowed us to hypothesize the importance of these radial distances and the polar angles of key atoms in the chalcones in binding to the pyruvate kinase crystal structure. This was validated by the docking of another small library of compound models in which the α,β-unsaturated ketone chain of the chalcone was replaced by incrementally longer conjugated chains. Further studies on the chalcones themselves reveal rotameric systems in both cis and trans-configurations (which may impact binding), and also studied was the effect of Topliss-based modification and its impact of binding to HSP90. Molecular dynamics confirmed good binding of identified chalcone hits.
- Full Text:
- Date Issued: 2020
Application of computational methods in elucidating the isomerization step in the biosynthesis of coumarins
- Authors: Tshiwawa, Tendamudzimu
- Date: 2019
- Subjects: Coumarins , Isomerization , Biosynthesis , Organic compounds -- Synthesis , Cinnamic acid
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/67646 , vital:29124
- Description: The identity of the enzyme(s) responsible for the biosynthetic transformation of cinnamic acid derivatives to important, naturally occurring coumarins has yet to be established. This study constitutes a high-level theoretical analysis of the possibility that a recently reported molecular mechanism of the synthesis of coumarins from Baylis-Hillman adducts, may provide a viable model for three critical phases in the biosynthetic pathway Particular attention has been given to the first of these phases: i) E→Z isomerisation of the cinnamic acid precursor; ii) Cyclisation (lactonisation) to the hemi-acetal intermediate; and ii) Dehydration to afford the coumarin derivative. In order to accomplish this analysis, an enzyme capable, theoretically, of effecting this E→Z isomerisation required identification, and its potential involvement in the transformation mechanism explored. Combined Molecular Mechanics and high-level Quantum Mechanical/DFT calculations were used to access complementary models of appropriate complexes and relevant processes within the enzyme active sites of a range of eleven Chalcone Isomerase (CHI) enzyme candidates, the structures of which were downloaded from the Protein Data Bank. Detailed B3LYP/6-31+G(d,p) calculations have provided pictures of the relative populations of conformations within the ensemble of conformations available at normal temperatures. Conformations of several protonation states of cinnamic acid derivatives have been studied in this way, and the results obtained showed that coupled protonation and deprotonation of (E)-o-coumaric acid provides a viable approach to achieve the E→Z isomerization. In silico docking of the B3LYP/6-31+G(d,p) optimized (E)-o-coumaric acid derivatives in the active sites of each of the candidate CHI enzymes (CHI) revealed that (E)-o-coumaric acid fits well within the active sites of Medicago Sativa CHI crystallographic structures with 1FM8 showing best potential for not only accommodating (E)-o-coumaric acid , but also providing appropriate protein active site residues to effect the simultaneous protonation and deprotonation of the substrate , two residues being optimally placed to facilitate these critical processes. Further exploration of the chemical properties and qualities of selected CHI enzymes, undertaken using High Throughput Virtual Screening (HTVS), confirmed 1FM8 as a viable choice for further studies of the enzyme-catalysed E→Z isomerization of (E)-o-coumaric acid. A molecular dynamics study, performed to further evaluate the evolution of (E)-o-coumaric acid in the CHI active site over time, showed that the ligand in the 1FM8 active site is not only stable, but also that the desired protein-ligand interactions persist throughout the simulation period to facilitate the E→Z isomerization. An integrated molecular orbital and molecular mechanics (ONIOM) study of the 1FM8-(E)-o-coumaric acid complex, involving the direct protonation and deprotonation of the ligand by protein residues; has provided a plausible mechanism for the E → Z isomerization of (E)-o-coumaric acid within the 1FM8 active site; a transition state complex (with an activation energy of ca. 50 kCal.mol-1) has been located and its connection with both the (E)- and (Z)-o-coumaric acid isomer has been confirmed by Intrinsic Reaction Coordinate (IRC) calculations. More realistic models of the 1FM8-(E)-o-coumaric acid complex, with the inclusion of water solvent molecules, have been obtained at both the QM/MM and adaptive QM/MM levels which simulate the dynamic active site at the QM level. The results indicate that the simultaneous protonation and deprotonation of (E)-o-coumaric acid within the CHI enzyme is a water-mediated process – a conclusion consistent with similar reported processes. Visual inspection of the 1FM8-(Z)-o-coumaric acid complex reveals both the necessary orientation of the phenolic and carboxylic acid moieties of the (Z)-o-coumaric acid and the presence of appropriate, proximal active site residues with the potential to permit catalysis of the subsequent lactonisation and dehydration steps required to generate coumarin.
- Full Text:
- Date Issued: 2019
- Authors: Tshiwawa, Tendamudzimu
- Date: 2019
- Subjects: Coumarins , Isomerization , Biosynthesis , Organic compounds -- Synthesis , Cinnamic acid
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/67646 , vital:29124
- Description: The identity of the enzyme(s) responsible for the biosynthetic transformation of cinnamic acid derivatives to important, naturally occurring coumarins has yet to be established. This study constitutes a high-level theoretical analysis of the possibility that a recently reported molecular mechanism of the synthesis of coumarins from Baylis-Hillman adducts, may provide a viable model for three critical phases in the biosynthetic pathway Particular attention has been given to the first of these phases: i) E→Z isomerisation of the cinnamic acid precursor; ii) Cyclisation (lactonisation) to the hemi-acetal intermediate; and ii) Dehydration to afford the coumarin derivative. In order to accomplish this analysis, an enzyme capable, theoretically, of effecting this E→Z isomerisation required identification, and its potential involvement in the transformation mechanism explored. Combined Molecular Mechanics and high-level Quantum Mechanical/DFT calculations were used to access complementary models of appropriate complexes and relevant processes within the enzyme active sites of a range of eleven Chalcone Isomerase (CHI) enzyme candidates, the structures of which were downloaded from the Protein Data Bank. Detailed B3LYP/6-31+G(d,p) calculations have provided pictures of the relative populations of conformations within the ensemble of conformations available at normal temperatures. Conformations of several protonation states of cinnamic acid derivatives have been studied in this way, and the results obtained showed that coupled protonation and deprotonation of (E)-o-coumaric acid provides a viable approach to achieve the E→Z isomerization. In silico docking of the B3LYP/6-31+G(d,p) optimized (E)-o-coumaric acid derivatives in the active sites of each of the candidate CHI enzymes (CHI) revealed that (E)-o-coumaric acid fits well within the active sites of Medicago Sativa CHI crystallographic structures with 1FM8 showing best potential for not only accommodating (E)-o-coumaric acid , but also providing appropriate protein active site residues to effect the simultaneous protonation and deprotonation of the substrate , two residues being optimally placed to facilitate these critical processes. Further exploration of the chemical properties and qualities of selected CHI enzymes, undertaken using High Throughput Virtual Screening (HTVS), confirmed 1FM8 as a viable choice for further studies of the enzyme-catalysed E→Z isomerization of (E)-o-coumaric acid. A molecular dynamics study, performed to further evaluate the evolution of (E)-o-coumaric acid in the CHI active site over time, showed that the ligand in the 1FM8 active site is not only stable, but also that the desired protein-ligand interactions persist throughout the simulation period to facilitate the E→Z isomerization. An integrated molecular orbital and molecular mechanics (ONIOM) study of the 1FM8-(E)-o-coumaric acid complex, involving the direct protonation and deprotonation of the ligand by protein residues; has provided a plausible mechanism for the E → Z isomerization of (E)-o-coumaric acid within the 1FM8 active site; a transition state complex (with an activation energy of ca. 50 kCal.mol-1) has been located and its connection with both the (E)- and (Z)-o-coumaric acid isomer has been confirmed by Intrinsic Reaction Coordinate (IRC) calculations. More realistic models of the 1FM8-(E)-o-coumaric acid complex, with the inclusion of water solvent molecules, have been obtained at both the QM/MM and adaptive QM/MM levels which simulate the dynamic active site at the QM level. The results indicate that the simultaneous protonation and deprotonation of (E)-o-coumaric acid within the CHI enzyme is a water-mediated process – a conclusion consistent with similar reported processes. Visual inspection of the 1FM8-(Z)-o-coumaric acid complex reveals both the necessary orientation of the phenolic and carboxylic acid moieties of the (Z)-o-coumaric acid and the presence of appropriate, proximal active site residues with the potential to permit catalysis of the subsequent lactonisation and dehydration steps required to generate coumarin.
- Full Text:
- Date Issued: 2019
Enumeration, conformation sampling and population of libraries of peptide macrocycles for the search of chemotherapeutic cardioprotection agents
- Authors: Sigauke, Lester Takunda
- Date: 2019
- Subjects: Peptides -- Synthesis , Macrocyclic compounds , Drug development , Drug discovery , Cardiovascular system -- Diseases -- Prevention , Proteins -- Synthesis
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116056 , vital:34293
- Description: Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used.
- Full Text:
- Date Issued: 2019
- Authors: Sigauke, Lester Takunda
- Date: 2019
- Subjects: Peptides -- Synthesis , Macrocyclic compounds , Drug development , Drug discovery , Cardiovascular system -- Diseases -- Prevention , Proteins -- Synthesis
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116056 , vital:34293
- Description: Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used.
- Full Text:
- Date Issued: 2019
The investigation of type-specific features of the copper coordinating AA9 proteins and their effect on the interaction with crystalline cellulose using molecular dynamics studies
- Authors: Moses, Vuyani
- Date: 2018
- Subjects: Copper proteins , Cellulose , Molecular dynamics , Cellulose -- Biodegradation , Bioinformatics
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/58327 , vital:27230
- Description: AA9 proteins are metallo-enzymes which are crucial for the early stages of cellulose degradation. AA9 proteins have been suggested to cleave glycosidic bonds linking cellulose through the use of their Cu2+ coordinating active site. AA9 proteins possess different regioselectivities depending on the resulting cleavage they form and as result, are grouped accordingly. Type 1 AA9 proteins cleave the C1 carbon of cellulose while Type 2 AA9 proteins cleave the C4 carbon and Type 3 AA9 proteins cleave either C1 or C4 carbons. The steric congestion of the AA9 active site has been proposed to be a contributor to the observed regioselectivity. As such, a bioinformatics characterisation of type-specific sequence and structural features was performed. Initially AA9 protein sequences were obtained from the Pfam database and multiple sequence alignment was performed. The sequences were phylogenetically characterised and sequences were grouped into their respective types and sub-groups were identified. A selection analysis was performed on AA9 LPMO types to determine the selective pressure acting on AA9 protein residues. Motif discovery was then performed to identify conserved sequence motifs in AA9 proteins. Once type-specific sequence features were identified structural mapping was performed to assess possible effects on substrate interaction. Physicochemical property analysis was also performed to assess biochemical differences between AA9 LPMO types. Molecular dynamics (MD) simulations were then employed to dynamically assess the consequences of the discovered type-specific features on AA9-cellulose interaction. Due to the absence of AA9 specific force field parameters MD simulations were not readily applicable. As a result, Potential Energy Surface (PES) scans were performed to evaluate the force field parameters for the AA9 active site using the PM6 semi empirical approach and least squares fitting. A Type 1 AA9 active site was constructed from the crystal structure 4B5Q, encompassing only the Cu2+ coordinating residues, the Cu2+ ion and two water residues. Due to the similarity in AA9 active sites, the Type force field parameters were validated on all three AA9 LPMO types. Two MD simulations for each AA9 LPMO types were conducted using two separate Lennard-Jones parameter sets. Once completed, the MD trajectories were analysed for various features including the RMSD, RMSF, radius of gyration, coordination during simulation, hydrogen bonding, secondary structure conservation and overall protein movement. Force field parameters were successfully evaluated and validated for AA9 proteins. MD simulations of AA9 proteins were able to reveal the presence of unique type-specific binding modes of AA9 active sites to cellulose. These binding modes were characterised by the presence of unique type-specific loops which were present in Type 2 and 3 AA9 proteins but not in Type 1 AA9 proteins. The loops were found to result in steric congestion that affects how the Cu2+ ion interacts with cellulose. As a result, Cu2+ binding to cellulose was observed for Type 1 and not Type 2 and 3 AA9 proteins. In this study force field parameters have been evaluated for the Type 1 active site of AA9 proteins and this parameters were evaluated on all three types and binding. Future work will focus on identifying the nature of the reactive oxygen species and performing QM/MM calculations to elucidate the reactive mechanism of all three AA9 LPMO types.
- Full Text:
- Date Issued: 2018
- Authors: Moses, Vuyani
- Date: 2018
- Subjects: Copper proteins , Cellulose , Molecular dynamics , Cellulose -- Biodegradation , Bioinformatics
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/58327 , vital:27230
- Description: AA9 proteins are metallo-enzymes which are crucial for the early stages of cellulose degradation. AA9 proteins have been suggested to cleave glycosidic bonds linking cellulose through the use of their Cu2+ coordinating active site. AA9 proteins possess different regioselectivities depending on the resulting cleavage they form and as result, are grouped accordingly. Type 1 AA9 proteins cleave the C1 carbon of cellulose while Type 2 AA9 proteins cleave the C4 carbon and Type 3 AA9 proteins cleave either C1 or C4 carbons. The steric congestion of the AA9 active site has been proposed to be a contributor to the observed regioselectivity. As such, a bioinformatics characterisation of type-specific sequence and structural features was performed. Initially AA9 protein sequences were obtained from the Pfam database and multiple sequence alignment was performed. The sequences were phylogenetically characterised and sequences were grouped into their respective types and sub-groups were identified. A selection analysis was performed on AA9 LPMO types to determine the selective pressure acting on AA9 protein residues. Motif discovery was then performed to identify conserved sequence motifs in AA9 proteins. Once type-specific sequence features were identified structural mapping was performed to assess possible effects on substrate interaction. Physicochemical property analysis was also performed to assess biochemical differences between AA9 LPMO types. Molecular dynamics (MD) simulations were then employed to dynamically assess the consequences of the discovered type-specific features on AA9-cellulose interaction. Due to the absence of AA9 specific force field parameters MD simulations were not readily applicable. As a result, Potential Energy Surface (PES) scans were performed to evaluate the force field parameters for the AA9 active site using the PM6 semi empirical approach and least squares fitting. A Type 1 AA9 active site was constructed from the crystal structure 4B5Q, encompassing only the Cu2+ coordinating residues, the Cu2+ ion and two water residues. Due to the similarity in AA9 active sites, the Type force field parameters were validated on all three AA9 LPMO types. Two MD simulations for each AA9 LPMO types were conducted using two separate Lennard-Jones parameter sets. Once completed, the MD trajectories were analysed for various features including the RMSD, RMSF, radius of gyration, coordination during simulation, hydrogen bonding, secondary structure conservation and overall protein movement. Force field parameters were successfully evaluated and validated for AA9 proteins. MD simulations of AA9 proteins were able to reveal the presence of unique type-specific binding modes of AA9 active sites to cellulose. These binding modes were characterised by the presence of unique type-specific loops which were present in Type 2 and 3 AA9 proteins but not in Type 1 AA9 proteins. The loops were found to result in steric congestion that affects how the Cu2+ ion interacts with cellulose. As a result, Cu2+ binding to cellulose was observed for Type 1 and not Type 2 and 3 AA9 proteins. In this study force field parameters have been evaluated for the Type 1 active site of AA9 proteins and this parameters were evaluated on all three types and binding. Future work will focus on identifying the nature of the reactive oxygen species and performing QM/MM calculations to elucidate the reactive mechanism of all three AA9 LPMO types.
- Full Text:
- Date Issued: 2018
- «
- ‹
- 1
- ›
- »