Prediction of mass spectra for natural products using an ab initio approach
- Authors: Novokoza, Yolanda
- Date: 2020
- Subjects: Molecular dynamics , Molecular dynamics -- Computer simulation , Mass spectroscopy , Electron impact ionization
- Language: English
- Type: text , Thesis , Masters , MSc
- Identifier: http://hdl.handle.net/10962/167166 , vital:41443
- Description: Mass spectrometry (MS) is a technique that measures the fragmentation of molecules, dependent on the molecule’s chemical composition and structure, by first introducing a charge on the molecules. The instrument records the mass to charge ratio, but the energy from the ionization process causes the molecule to fragment. The resultant mass spectrum is highly indicative of not only the molecule analyzed, but also its chemical composition. MS is used in research and industry for both routine and research purposes. One such way to ionize molecules for MS is by bombarding the molecule with electrons which is the basis of electron impact mass spectrometry (EIMS). Although EIMS is widely used, prediction of electron impact mass spectra from first principles is a challenging problem due to a need to accurately determine the probability of different fragmentation pathways of a molecule. Ab initio molecular dynamics based methods are able to explore in an automatic fashion the energetically available fragmentation paths thus give reaction mechanisms in an unbiased way. The mass spectra of five molecules have been explored in work-flows leading to the prediction of mass spectra. These molecules include three natural products alpha-hispanolol, PFB oxime derivative and boronolide (for which experimental mass spectra were not available) and two compounds from the NIST database (for which experimental mass spectra were available). For each of these systems many random conformations were generated using the RDKit library. To all conformations random velocities were applied to each atom. Ab initio molecular dynamics was performed on each conformer, using these initial random velocities using CP2K software, at DFTB+ level at a variety of highly raised temperatures (to accelerate the formation of fragments) Fragmentation was monitored by iterating through all bonds, and identifying bond breakages during dynamics. Graph theoretical packages were used then to track distinct fragments generated. For each of these fragments, charges were determined from Mulliken analysis for all atoms on the fragment from the QM calculations and sum of atomic spin densities per fragment was also plotted. The fragment with the greatest charge (corresponding to the formation of a cation fragment) was taken for plotting on the mass spectrum. Finally, from the mass of the fragment and its elemental composition, the isotopic distribution for the fragment was determined, and this distribution was included by addition in to the mass spectrum. For all trajectories, the sum of all isotopic distributions determined the final mass spectrum.
- Full Text:
- Date Issued: 2020
- Authors: Novokoza, Yolanda
- Date: 2020
- Subjects: Molecular dynamics , Molecular dynamics -- Computer simulation , Mass spectroscopy , Electron impact ionization
- Language: English
- Type: text , Thesis , Masters , MSc
- Identifier: http://hdl.handle.net/10962/167166 , vital:41443
- Description: Mass spectrometry (MS) is a technique that measures the fragmentation of molecules, dependent on the molecule’s chemical composition and structure, by first introducing a charge on the molecules. The instrument records the mass to charge ratio, but the energy from the ionization process causes the molecule to fragment. The resultant mass spectrum is highly indicative of not only the molecule analyzed, but also its chemical composition. MS is used in research and industry for both routine and research purposes. One such way to ionize molecules for MS is by bombarding the molecule with electrons which is the basis of electron impact mass spectrometry (EIMS). Although EIMS is widely used, prediction of electron impact mass spectra from first principles is a challenging problem due to a need to accurately determine the probability of different fragmentation pathways of a molecule. Ab initio molecular dynamics based methods are able to explore in an automatic fashion the energetically available fragmentation paths thus give reaction mechanisms in an unbiased way. The mass spectra of five molecules have been explored in work-flows leading to the prediction of mass spectra. These molecules include three natural products alpha-hispanolol, PFB oxime derivative and boronolide (for which experimental mass spectra were not available) and two compounds from the NIST database (for which experimental mass spectra were available). For each of these systems many random conformations were generated using the RDKit library. To all conformations random velocities were applied to each atom. Ab initio molecular dynamics was performed on each conformer, using these initial random velocities using CP2K software, at DFTB+ level at a variety of highly raised temperatures (to accelerate the formation of fragments) Fragmentation was monitored by iterating through all bonds, and identifying bond breakages during dynamics. Graph theoretical packages were used then to track distinct fragments generated. For each of these fragments, charges were determined from Mulliken analysis for all atoms on the fragment from the QM calculations and sum of atomic spin densities per fragment was also plotted. The fragment with the greatest charge (corresponding to the formation of a cation fragment) was taken for plotting on the mass spectrum. Finally, from the mass of the fragment and its elemental composition, the isotopic distribution for the fragment was determined, and this distribution was included by addition in to the mass spectrum. For all trajectories, the sum of all isotopic distributions determined the final mass spectrum.
- Full Text:
- Date Issued: 2020
In silico study of Plasmodium 1-deoxy-dxylulose 5-phosphate reductoisomerase (DXR) for identification of novel inhibitors from SANCDB
- Authors: Diallo, Bakary N'tji
- Date: 2018
- Subjects: Plasmodium 1-deoxy-dxylulose 5-phosphate reductoisomerase , Isoprenoids , Plasmodium , Antimalarials , Malaria -- Chemotherapy , Molecules -- Models , Molecular dynamics , South African Natural Compounds Database
- Language: English
- Type: text , Thesis , Masters , MSc
- Identifier: http://hdl.handle.net/10962/64012 , vital:28523
- Description: Malaria remains a major health concern with a complex parasite constantly developing resistance to the different drugs introduced to treat it, threatening the efficacy of the current ACT treatment recommended by WHO (World Health Organization). Different antimalarial compounds with different mechanisms of action are ideal as this decreases chances of resistance occurring. Inhibiting DXR and consequently the MEP pathway is a good strategy to find a new antimalarial with a novel mode of action. From literature, all the enzymes of the MEP pathway have also been shown to be indispensable for the synthesis of isoprenoids. They have been validated as drug targets and the X-ray structure of each of the enzymes has been solved. DXR is a protein which catalyses the second step of the MEP pathway. There are currently 255 DXR inhibitors in the Binding Database (accessed November 2017) generally based on the fosmidomycin structural scaffold and thus often showing poor drug likeness properties. This study aims to research new DXR inhibitors using in silico techniques. We analysed the protein sequence and built 3D models in close and open conformations for the different Plasmodium sequences. Then SANCDB compounds were screened to identify new potential DXR inhibitors with new chemical scaffolds. Finally, the identified hits were submitted to molecular dynamics studies, preceded by a parameterization of the manganese atom in the protein active site.
- Full Text:
- Date Issued: 2018
- Authors: Diallo, Bakary N'tji
- Date: 2018
- Subjects: Plasmodium 1-deoxy-dxylulose 5-phosphate reductoisomerase , Isoprenoids , Plasmodium , Antimalarials , Malaria -- Chemotherapy , Molecules -- Models , Molecular dynamics , South African Natural Compounds Database
- Language: English
- Type: text , Thesis , Masters , MSc
- Identifier: http://hdl.handle.net/10962/64012 , vital:28523
- Description: Malaria remains a major health concern with a complex parasite constantly developing resistance to the different drugs introduced to treat it, threatening the efficacy of the current ACT treatment recommended by WHO (World Health Organization). Different antimalarial compounds with different mechanisms of action are ideal as this decreases chances of resistance occurring. Inhibiting DXR and consequently the MEP pathway is a good strategy to find a new antimalarial with a novel mode of action. From literature, all the enzymes of the MEP pathway have also been shown to be indispensable for the synthesis of isoprenoids. They have been validated as drug targets and the X-ray structure of each of the enzymes has been solved. DXR is a protein which catalyses the second step of the MEP pathway. There are currently 255 DXR inhibitors in the Binding Database (accessed November 2017) generally based on the fosmidomycin structural scaffold and thus often showing poor drug likeness properties. This study aims to research new DXR inhibitors using in silico techniques. We analysed the protein sequence and built 3D models in close and open conformations for the different Plasmodium sequences. Then SANCDB compounds were screened to identify new potential DXR inhibitors with new chemical scaffolds. Finally, the identified hits were submitted to molecular dynamics studies, preceded by a parameterization of the manganese atom in the protein active site.
- Full Text:
- Date Issued: 2018
The investigation of type-specific features of the copper coordinating AA9 proteins and their effect on the interaction with crystalline cellulose using molecular dynamics studies
- Authors: Moses, Vuyani
- Date: 2018
- Subjects: Copper proteins , Cellulose , Molecular dynamics , Cellulose -- Biodegradation , Bioinformatics
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/58327 , vital:27230
- Description: AA9 proteins are metallo-enzymes which are crucial for the early stages of cellulose degradation. AA9 proteins have been suggested to cleave glycosidic bonds linking cellulose through the use of their Cu2+ coordinating active site. AA9 proteins possess different regioselectivities depending on the resulting cleavage they form and as result, are grouped accordingly. Type 1 AA9 proteins cleave the C1 carbon of cellulose while Type 2 AA9 proteins cleave the C4 carbon and Type 3 AA9 proteins cleave either C1 or C4 carbons. The steric congestion of the AA9 active site has been proposed to be a contributor to the observed regioselectivity. As such, a bioinformatics characterisation of type-specific sequence and structural features was performed. Initially AA9 protein sequences were obtained from the Pfam database and multiple sequence alignment was performed. The sequences were phylogenetically characterised and sequences were grouped into their respective types and sub-groups were identified. A selection analysis was performed on AA9 LPMO types to determine the selective pressure acting on AA9 protein residues. Motif discovery was then performed to identify conserved sequence motifs in AA9 proteins. Once type-specific sequence features were identified structural mapping was performed to assess possible effects on substrate interaction. Physicochemical property analysis was also performed to assess biochemical differences between AA9 LPMO types. Molecular dynamics (MD) simulations were then employed to dynamically assess the consequences of the discovered type-specific features on AA9-cellulose interaction. Due to the absence of AA9 specific force field parameters MD simulations were not readily applicable. As a result, Potential Energy Surface (PES) scans were performed to evaluate the force field parameters for the AA9 active site using the PM6 semi empirical approach and least squares fitting. A Type 1 AA9 active site was constructed from the crystal structure 4B5Q, encompassing only the Cu2+ coordinating residues, the Cu2+ ion and two water residues. Due to the similarity in AA9 active sites, the Type force field parameters were validated on all three AA9 LPMO types. Two MD simulations for each AA9 LPMO types were conducted using two separate Lennard-Jones parameter sets. Once completed, the MD trajectories were analysed for various features including the RMSD, RMSF, radius of gyration, coordination during simulation, hydrogen bonding, secondary structure conservation and overall protein movement. Force field parameters were successfully evaluated and validated for AA9 proteins. MD simulations of AA9 proteins were able to reveal the presence of unique type-specific binding modes of AA9 active sites to cellulose. These binding modes were characterised by the presence of unique type-specific loops which were present in Type 2 and 3 AA9 proteins but not in Type 1 AA9 proteins. The loops were found to result in steric congestion that affects how the Cu2+ ion interacts with cellulose. As a result, Cu2+ binding to cellulose was observed for Type 1 and not Type 2 and 3 AA9 proteins. In this study force field parameters have been evaluated for the Type 1 active site of AA9 proteins and this parameters were evaluated on all three types and binding. Future work will focus on identifying the nature of the reactive oxygen species and performing QM/MM calculations to elucidate the reactive mechanism of all three AA9 LPMO types.
- Full Text:
- Date Issued: 2018
- Authors: Moses, Vuyani
- Date: 2018
- Subjects: Copper proteins , Cellulose , Molecular dynamics , Cellulose -- Biodegradation , Bioinformatics
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/58327 , vital:27230
- Description: AA9 proteins are metallo-enzymes which are crucial for the early stages of cellulose degradation. AA9 proteins have been suggested to cleave glycosidic bonds linking cellulose through the use of their Cu2+ coordinating active site. AA9 proteins possess different regioselectivities depending on the resulting cleavage they form and as result, are grouped accordingly. Type 1 AA9 proteins cleave the C1 carbon of cellulose while Type 2 AA9 proteins cleave the C4 carbon and Type 3 AA9 proteins cleave either C1 or C4 carbons. The steric congestion of the AA9 active site has been proposed to be a contributor to the observed regioselectivity. As such, a bioinformatics characterisation of type-specific sequence and structural features was performed. Initially AA9 protein sequences were obtained from the Pfam database and multiple sequence alignment was performed. The sequences were phylogenetically characterised and sequences were grouped into their respective types and sub-groups were identified. A selection analysis was performed on AA9 LPMO types to determine the selective pressure acting on AA9 protein residues. Motif discovery was then performed to identify conserved sequence motifs in AA9 proteins. Once type-specific sequence features were identified structural mapping was performed to assess possible effects on substrate interaction. Physicochemical property analysis was also performed to assess biochemical differences between AA9 LPMO types. Molecular dynamics (MD) simulations were then employed to dynamically assess the consequences of the discovered type-specific features on AA9-cellulose interaction. Due to the absence of AA9 specific force field parameters MD simulations were not readily applicable. As a result, Potential Energy Surface (PES) scans were performed to evaluate the force field parameters for the AA9 active site using the PM6 semi empirical approach and least squares fitting. A Type 1 AA9 active site was constructed from the crystal structure 4B5Q, encompassing only the Cu2+ coordinating residues, the Cu2+ ion and two water residues. Due to the similarity in AA9 active sites, the Type force field parameters were validated on all three AA9 LPMO types. Two MD simulations for each AA9 LPMO types were conducted using two separate Lennard-Jones parameter sets. Once completed, the MD trajectories were analysed for various features including the RMSD, RMSF, radius of gyration, coordination during simulation, hydrogen bonding, secondary structure conservation and overall protein movement. Force field parameters were successfully evaluated and validated for AA9 proteins. MD simulations of AA9 proteins were able to reveal the presence of unique type-specific binding modes of AA9 active sites to cellulose. These binding modes were characterised by the presence of unique type-specific loops which were present in Type 2 and 3 AA9 proteins but not in Type 1 AA9 proteins. The loops were found to result in steric congestion that affects how the Cu2+ ion interacts with cellulose. As a result, Cu2+ binding to cellulose was observed for Type 1 and not Type 2 and 3 AA9 proteins. In this study force field parameters have been evaluated for the Type 1 active site of AA9 proteins and this parameters were evaluated on all three types and binding. Future work will focus on identifying the nature of the reactive oxygen species and performing QM/MM calculations to elucidate the reactive mechanism of all three AA9 LPMO types.
- Full Text:
- Date Issued: 2018
Structural studies on yeast eIF5A using biomolecular NMR and molecular dynamics
- Authors: Sigauke, Lester Takunda
- Date: 2015
- Subjects: Molecular dynamics , Reverse transcriptase , HIV (Viruses) , HIV infections , Eukaryotic cells , Yeast
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4547 , http://hdl.handle.net/10962/d1017927
- Description: Eukaryotic initiation factor 5A, eIF5A, is a ubiquitous eukaryotic protein that has been shown to influence the translation initiation of a specific subset of mRNAs. It is the only protein known to undergo hypusination in a two-step post translational modification process involving deoxyhypusine synthase (DHS) and deoxyhypusine hydroxylase (DOHH) enzymes. Hypusination has been shown to influence translation of HIV-1 and HTLV-1 nuclear export signals, while the involvement of active hypusinated eIF5A in induction of IRES mediated processes that initiate pro-apoptotic process have inspired studies into the manipulation of eIF5A in anti-cancer and anti-diabetic therapies. eIF5A oligomerisation in eukaryotic systems has been shown to be influenced by hypusination and the mechanism of dimerisation is RNA dependent. Nuclear magnetic resonance spectroscopy approaches were proposed to solve the structure of the hypusinated eIF5A in solution in order to understand the influence of hypusination on the monomeric arrangement which enhances dimerisation and activates the protein. Cleavage of the 18 kDa protein monomer by introduction of thrombin cleavage site within the flexible domain was thought to give rise to 10 kDa fragments accessible to a 600 MHz NMR spectrometer. Heteronuclear single quantum correlation experiments of the mutated isotopically labelled protein expressed in E. coli showed that the eIF5A protein with a thrombin cleavage insert, eIF5AThr (eIF5A subscript Thr), was unfolded. In silico investigations of the behaviour of eIF5A and eIF5AThr (eIF5A subscript Thr) models in solution using molecular dynamics showed that the mutated model had different solution dynamics to the native model. Chemical shift predictors were used to extract atomic resolution data of solution dynamics and the introduction of rigidity in the flexible loop region of eIF5A affected solution behaviour consistent with lack of in vivo function of eIF5AThr (eIF5A subscript Thr) in yeast. Residual dipolar coupling and T₁ relaxation times were calculated in anticipation of the extraction of experimental data from RDC and relaxation dispersion experiments based on HSQC measurable restraints.
- Full Text:
- Date Issued: 2015
- Authors: Sigauke, Lester Takunda
- Date: 2015
- Subjects: Molecular dynamics , Reverse transcriptase , HIV (Viruses) , HIV infections , Eukaryotic cells , Yeast
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4547 , http://hdl.handle.net/10962/d1017927
- Description: Eukaryotic initiation factor 5A, eIF5A, is a ubiquitous eukaryotic protein that has been shown to influence the translation initiation of a specific subset of mRNAs. It is the only protein known to undergo hypusination in a two-step post translational modification process involving deoxyhypusine synthase (DHS) and deoxyhypusine hydroxylase (DOHH) enzymes. Hypusination has been shown to influence translation of HIV-1 and HTLV-1 nuclear export signals, while the involvement of active hypusinated eIF5A in induction of IRES mediated processes that initiate pro-apoptotic process have inspired studies into the manipulation of eIF5A in anti-cancer and anti-diabetic therapies. eIF5A oligomerisation in eukaryotic systems has been shown to be influenced by hypusination and the mechanism of dimerisation is RNA dependent. Nuclear magnetic resonance spectroscopy approaches were proposed to solve the structure of the hypusinated eIF5A in solution in order to understand the influence of hypusination on the monomeric arrangement which enhances dimerisation and activates the protein. Cleavage of the 18 kDa protein monomer by introduction of thrombin cleavage site within the flexible domain was thought to give rise to 10 kDa fragments accessible to a 600 MHz NMR spectrometer. Heteronuclear single quantum correlation experiments of the mutated isotopically labelled protein expressed in E. coli showed that the eIF5A protein with a thrombin cleavage insert, eIF5AThr (eIF5A subscript Thr), was unfolded. In silico investigations of the behaviour of eIF5A and eIF5AThr (eIF5A subscript Thr) models in solution using molecular dynamics showed that the mutated model had different solution dynamics to the native model. Chemical shift predictors were used to extract atomic resolution data of solution dynamics and the introduction of rigidity in the flexible loop region of eIF5A affected solution behaviour consistent with lack of in vivo function of eIF5AThr (eIF5A subscript Thr) in yeast. Residual dipolar coupling and T₁ relaxation times were calculated in anticipation of the extraction of experimental data from RDC and relaxation dispersion experiments based on HSQC measurable restraints.
- Full Text:
- Date Issued: 2015
- «
- ‹
- 1
- ›
- »