Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual...

96
Biochemical Engineering Symposium Proceedings Chemical and Biological Engineering 9-8-2001 Proceedings of the irty-first Annual Biochemical Engineering Symposium Larry E. Erickson Kansas State University Follow this and additional works at: hp://lib.dr.iastate.edu/bce_proceedings Part of the Biochemical and Biomolecular Engineering Commons is Book is brought to you for free and open access by the Chemical and Biological Engineering at Iowa State University Digital Repository. It has been accepted for inclusion in Biochemical Engineering Symposium Proceedings by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Recommended Citation Erickson, Larry E., "Proceedings of the irty-first Annual Biochemical Engineering Symposium" (2001). Biochemical Engineering Symposium Proceedings. 27. hp://lib.dr.iastate.edu/bce_proceedings/27

Transcript of Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual...

Page 1: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Biochemical Engineering Symposium Proceedings Chemical and Biological Engineering

9-8-2001

Proceedings of the Thirty-first Annual BiochemicalEngineering SymposiumLarry E. EricksonKansas State University

Follow this and additional works at: http://lib.dr.iastate.edu/bce_proceedings

Part of the Biochemical and Biomolecular Engineering Commons

This Book is brought to you for free and open access by the Chemical and Biological Engineering at Iowa State University Digital Repository. It hasbeen accepted for inclusion in Biochemical Engineering Symposium Proceedings by an authorized administrator of Iowa State University DigitalRepository. For more information, please contact [email protected].

Recommended CitationErickson, Larry E., "Proceedings of the Thirty-first Annual Biochemical Engineering Symposium" (2001). Biochemical EngineeringSymposium Proceedings. 27.http://lib.dr.iastate.edu/bce_proceedings/27

Page 2: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

TP2483 GEN .8511

c.1

Proceedings of the Thirty-first Annual Biochentical

Engineering Syntposiunt Septentber 8, 2001

I 'APR 2 9 £.~' 'l L Q>, ~l ~ • I J

.... i _,, .. ' ... - - -· -

Larry E. Erickson Editor

Department of Chemical Engineering Kansas State University

Manhattan, Kansas 66506

Page 3: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Proceedings of the Thirty-First Annual Biochemical Engineering Symposium

The Thirty-First Annual Biochemical Engineering Symposium was held on September 7 and 8, 2001, at Kansas State University. The program included 10 oral presentations and 3 posters; however the paper by Boyack and Gilcrease was not presented because the presenter was ill and unable to come. Some of the papers describe work that is in progress while others describe completed projects. Many of the authors intend to submit their work for publication elsewhere in a more complete form. A listing of those who attended is given below. The activities began on Friday evening with an indoor picnic because of rain and wind.

List of Participants

Iowa State University: Jeffrey Gahan, Tony Hill, Matt Kipper, Alain Laederach, Chandrika Mulakala, Balaji Narasimhan, Peter Reilly, Jackie Shanks, Murali Subramanian, Melissa Swarts, and Ganesh Sriram

University of Oklahoma: Kemi Harris, Ulli Nollert, Elizabeth Phan, Omid Tawackoli, Tony Iran, Regina Visser, Heath Williams, and Marci Williams

University of Kansas: Wendy Sun

Kansas State University: Akkawit Aimdilokwong, Sigifredo Castro, Naveen Chennubhotla, Sasivimon Chittrakom, Kenneth Dokken, Larry E. Erickson, L.T. Fan, Stevin Gehrke, Sharon Hagan, Brian Lindsay, Sameer Khaitan, and Kaila Young

Page 4: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

31st Annual Biochemical Engineering Symposium Fiedler Hall Auditorium

Saturday, September 8, 2001

8:30a.m. Introduction and Welcome

8:35 Bioreduction of Chromium (Vn by Inactivated Medicago sativa (Alfalfa) Biomass, Jorge L. Gardea-Torresdey, Kenneth M. Dokken, Kirk J. Tiemann, Jason G. Parsons, and Gerardo Gomez, Univ. of Texas at EIPaso, Univ. of Indiana, and Kansas State Univ.

9:00 TNT Transformation by Plants: Role ofHydroxylamines, Murali Subramanian and Jacqueline V. Shanks, Iowa State Univ.

9:25 Plant Uptake and Transformation of Benzotriazoles, Sigifredo Castro, Lawrence C. Davis, and Larry E. Erickson, Kansas State Univ.

9:50 Break to Put Up Posters and View Posters

10:20 Biological Sulfate Reduction in Alkaline Waters for the Reprocessing of Trona Tailings Ponds, Leigh Ann Boyack and Pat Gilcrease, Univ. of Wyoming

10:45 Microbial Degradation of Methyl Benzotriazole, Kaila Young, Lawrence C. Davis, and Larry E. Erickson, Kansas State Univ.

11:10 New Insights into the Catalytic Mechanism ofthe Family 47 Class I Alpha-1,2 Mannosidases by Computational Docking of Mannosyl Substrates, Chandrika Mulakala and Peter J. Reilly, Iowa State Univ.

11:35 Lunch and Poster Viewing

1:00 p.m. A Mathematical Model for Carbon Bond Labeling NMR Experiments: Analytical Solutions and Sensitivity Analysis for the Effect ofReaction Reversibilities on Estimated Fluxes, Ganesh Sriram and Jacqueline V. Shanks, Iowa State Univ.

1:25 Platelet Derived Nitric Oxide (NO) Inhibits Thrombus Formation on Collagen: The Role of Insulin, R. H. Williams and M.U. Nollert, Univ. of Oklahoma

1:50 Molecular Mechanics Calculations to Quantify Segmental Interactions in Bioerodible Polyanhydrides: Consequences for Drug Delivery, Matt Kipper and Balaji Narasimhan, Iowa State Univ.

2: 15 Poster Viewing

2:30 Three-Dimensional Hydrophobic Cluster Analysis: Use of a Virtual Environment to Display Protein Cluster Information, Tony Hill, Alain Laederach, and Peter J. Reilly, Iowa. State Univ.

Poster Presentations in Durland 129

1. Mechanical Properties of Elastin-Mimetic Hydrogels, Eder D. Oliviera, Sharon A. Hagan, and Stevin H. Gehrke, Federal University of Minas Gerais, Brazil and Kansas State Univ.

2. The Use of Patterned Films for Directional Optic Nerve Growth, Jeffrey C. Gahan, Jennifer A. Behr, Surya K. Mallapragada, and Srdija Jeftinija, Iowa State Univ.

3. Multiple Sequence Alignment and Phylogenetic Analysis of a Family of Beta-Glycosidases, Tony Hill, Alain Laederach, and Peter J. Reilly, Iowa State Univ.

Page 5: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

TABLE OF CONTENTS

Bioreduction of Chromium(VI) by Inactivated Medicago Sativa (Alfalfa) Biomass Kenneth M Dokken, Jorge L. Gardea-Torresdey, Kirk J. Tiemann, Jason G. Parsons, and Gerardo Gamez ....................................................... 1

TNT Transformation by Plants: Role of Hydroxylamines in the Pathway Murali Subramanian and Jacqueline V. Shanks ................................. 11

Plant Uptake and Transformation of Benzotriazoles Sigifredo Castro, Lawrence C. Davis, and Larry E. Erickson ...................... 21

Microbial Degradation of 5-Methyl Benzotriazole Kaila Young, Larry Erickson, Lawrence Davis, and Sigifredo Castro Diaz ........... 29

Understanding Protein Structure-Function Relationships in Family 47 a-1,2-Mannosidases through Computational Docking of Ligands

Chandrika Mulakala and Peter J. Reilly ...................................... 35

A Mathematical Model for Carbon Bond Labeling Experiments: Analytical Solutions and Sensitivity Analysis for the Effect of Reaction Reversibilities on Estimated Fluxes

Ganesh Sriram and Jacqueline V. Shanks ..................................... 45

Platelet Derived Nitric Oxide (NO) Inhibits Thrombus Formation: The Role of Insulin R.H. Williams and M U. Nollert ............................................ 55

Molecular Mechanics Calculations to Quantify Segmental Interactions in Bioerodible Polyanhydrides: Consequences for Drug Delivery

Matt Kipper and Balaji Narasimhan ......................................... 65

Three-Dimensional Hydrophobic Cluster Analysis: The Use of a Virtual Environment for Protein Sequence Analysis: HELIX v0.3

Anthony D. Hill, Alain Laederach, and Peter J. Reilly ............................ 73

Mechanical Performance of Elastin-Mimetic Hydrogels Eder D. Oliveira, Sharon A. Hagan, and Stevin H. Gehrke ........................ 76

Multiple Sequence Alignment and Phylogenetic Analysis of Family 1 ~-Giycosidases Anthony D. Hill, Alain Laederach, and Peter J. Reilly ............................ 82

Page 6: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

BIOREDUCTION OF CHROMIUM(VI) BY INACTIVATED MEDICA GO

SATIVA (ALFALFA) BIOMASS

Kenneth M. Dokken8, Jorge L. Gardea-Torresdey*b, Kirk J. Tiemannb, Jason G. Parsonsb,

and Gerardo Gamezc 8Present address: Department ofBiochemistry, Kansas State University, Manhattan,

Kansas 66506 ~epartment of Chemistry, University ofTexas at El Paso, El Paso, Texas 79968.

*Corresponding author phone: (915) 747-5359; fax: (915)747-5748; e-mail: [email protected]

cpresent address: Department of Chemistry, Indiana University at Bloomington, Bloomington, Indiana 47405.

Abstract

Previous studies have demonstrated the ability of Medicago sativa (alfalfa) to adsorb chromium(III) ions in aqueous solution. However, the use alfalfa to remove chromium(VI) from aqueous solution has not been considered. Thus, batch studies were conducted to determine the behavior of chromium(VI) binding to alfalfa biomass. Optimum chromium(VI) binding was observed at pH 2.0 with increased binding over a 48 hour period. Modification experiments using esterified and hydrolyzed alfalfa biomass were carried out to determine the participation of carboxyl groups in chromium(VI) binding. The unmodified, hydrolyzed, and esterified alfalfa biomasses showed binding capacities of 5.9, 10.4, and 0.0 mg Cr(Vl) per gram of biomass, respectively. Thus providing evidence that carboxyl groups may play a role in chromium(VI) binding. X-ray absorption spectroscopic studies were used to characterize chromium(VI) adsorption by alfalfa biomass through X-ray Absorption Near Edge Structure (XANES) and Extended X-ray Absorption Near Edge Structure (EXAFS). The results of the batch and XAS studies indicate that alfalfa biomass bioreduces chromium(VI) to chromium(IIl) and subsequently binds chromium(lll) predominantly through carboxyl groups.

Introduction

Chromium(VI) is commonly found in wastewater from metal finishing, refractory, and metallurgical industries[l-3]. It is also commonly used as a pigment and a corrosion inhibitor. The increased demand for these products has led to a rise in wastewater production and contamination of natural waters. Chromium(VI) is the most toxic form of chromium and is highly soluble and mobile in the aquatic environment. The most common method for remediating chromium(VI) involves the reduction of chromium(VI) to chromium(IIl) by chemical or electrochemical means, followed by precipitation, and then filtration [3]. Other conventional technologies include adsorption by activated carbon or strongly basic anion exchange resins [ 4-8]. However, these techniques are extremely costly, utilize harsh chemicals in their fabrication, are energy and time extensive, and are subject to fouling in hard water conditions. Bioremediation has been the dominant alternative method of choice for the past twenty years. Losi et al. conducted a study on a wide variety of microorganisms for their potential to uptake chromium(VI) [3]. Live plants can adsorb chromium(VI) from contaminated waters

1

Page 7: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

[9, 10]. However, live systems require constant maintenance, nutritional supplementation and can fall prey to the toxicological effects of chromium(VI). This presents a need for the production of techniques that can overcome these problems.

The use of inactivated (dead) plant or algal biosorbents and industrial by­products, such as sawdust and fly ash, has presently generated much interest due to their cost effectiveness, low maintenance, and high durability as compared to live systems [11-16]. Alfalfa (Medicago sativa) possesses these much wanted characteristics and has shown the potential to adsorb chromium(III) and other heavy metals at high levels in previous studies [ 17 -19]. It is known that some plants have the ability to bioreduce chromium(VI) to chromium(III), and it is known that plants possess functional groups with reducing properties [10,11,20-22]. Alfalfa biomass may possess these reducing groups and may be able to function as a bioreduction/adsorption system for chromium(VI) [23].

Experimental A!falfa Collection

The alfalfa plant tissues used in this study were acquired from controlled agricultural field studies at New Mexico State University in Las Cruces, New Mexico. The species of alfalfa (Medicago sativa) used was the Malone variety. The plants were removed from the soil and washed thoroughly using deionized water to remove any soil or debris. The roots were separated from the shoots and oven dried at 90°C for a week. The samples were ground to pass through a 100-mesh sieve using a Wiley mill. The shoot biomass was utilized for this study due to its ease in agricultural availability and considerable adsorption capacity for aqueous heavy metal ions [17, 18].

pH Profile for Chromium(Vl) Binding Batch laboratory techniques were used for pH studies as previously reported by

Gardea-Torresdey et al. [17, 18]. Samples of unmodified alfalfa biomass was placed into deionized water at a concentration of 5 mg of biomass per mL. The pH of the suspensions were adjusted to pH 2.0 and allowed to equilibriate, and 2-mL aliquots were transferred to three 5-mL plastic tubes. The pH was then adjusted to 3.0, 4.0, 5.0, 6.0, and 7.0, and 2-mL aliquots of the suspension at each pH were transferred to three clean 5-mL tubes. The suspensions were centrifuged at 3000 rpm for 5 min. A 0.1 mM chromium(VI) solution was prepared from potassium dichromate (K2Cr201) salt and the pH was adjusted to 2.0, 3.0, 4.0, 5.0, 6.0, and 7.0. At each pH, 2 mL of chromium(VI) solution were added to its respective pH biomass pellet. Each pH had a control chromium(VI) solution to compare for metal analyses. All of the samples were equilibrated for one hour on a rocker. The samples were then centrifuged for five minutes at 3000 rpm. The supernatants were transferred to clean tubes and kept for metal analyses by flame atomic absorption spectroscopy (F AAS). Experiments were conducted in triplicate for quality control purposes.

Time Dependent Study for Chromium(Vl) Binding The time dependent study was performed according to studies previously

performed [19]. The protocol utilized an alfalfa biomass concentration of 5 mg per ml of solution. A 0.3mM Cr(VI) solution at pH 2.0 was prepared. The time intervals used in

2

Page 8: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

---------- -----~- ---- --

this study were: 5, 10, 15, 20, 25, 30, 45, 60, 90, 120, 360, 720, 1440, and 2880 minutes. The alfalfa biomass was reacted with the Cr(VI) solution for the aforementioned time intervals. After the indicated time intervals, the samples were centrifuged for five minutes at 3000 rpm. The final pHs were recorded and the chromium concentration of the supernatants were determined by F AAS.

Esterification Q/ Alfalfa Biomass The esterification of the alfalfa biomass was conducted as pre:viously reported

[19]. Nine grams of Malone shoot biomass was suspended in acidic methanol solution containing 63 3 mL of 99.9% methanol and 5.4 mL of concentrated hydrochloric acid (HCl). The solution was continuously stirred for eight hours at 60°C. After eight hours, the alfalfa was removed from the heat and allowed to cool to room temperature. The alfalfa was pelleted by centrifugation at 3000 rpm for five minutes. The pelleted alfalfa biomass was washed three times with deionized water to stop the esterification reaction and remove any excess acidic methanol. The esterified alfalfa biomass was lyophilized overnight in a Labconco freeze dryer.

Hydrolyzation of Alfalfa Biomass Hydrolyzation of the alfalfa biomass was performed according to previously

described protocol [ 19]. Nine grams of Malone shoot biomass were reacted with 100 mL of 0.1 M sodium hydroxide (NaOH) for one hour. The biomass was then pelleted by centrifugation at 3000 rpm for five minutes and washed three times with deionized water. The hydrolyzed alfalfa was lyophilized overnight in a Labconco freeze dryer.

Chromium(V/) Binding Capacities Batch laboratory methods utilized in previous studies conducted by Gardea­

T orresdey et al. were used to determine the chromium(VI) binding capacity of the unmodified and modified alfalfa biomasses [17]. 100 mg of each sample were washed twice with 0.1 M HCl and twice with deionized water to remove any debris or soluble compounds. The washings were collected and weighed to account for any biomass or resin loss. The washed samples were resuspended into 10 mL of deionized water, and the pH was adjusted to pH 2.0. A 0.3mM chromium solution was prepared at pH 2.0 from the potassium chromate (K2Cr20 7) salt. The samples were reacted for ten fifteen minute intervals with the chromium(VI) solution or until saturation was achieved. The samples were then centrifuged at 3000 rpm for five minutes. The supernatants were placed into clean tubes and analyzed for chromium concentration using F AAS.

Flame Atomic Absorption 5ipectroscopy (FAAS) A Perkin Elmer model 3110 Atomic Absorption Spectrometer was used to

determine the chromium content in the batch pH and capacity studies. The analytical wavelength used was 359.8nm with a slit width of 0.7nm. The chromium lamp current was 25mA. An impact bead was used to improve sensitivity. A calibration curve with a correlation coefficient of 0.98 or better was obtained. Known standards were used to check the instrument response. The samples were analyzed three times and the mean value and relative standard deviation were determined. In order to work within the linear

3

Page 9: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

range, some samples were diluted using 0.1M hydrochloric acid. The amount of accumulated chromium by the alfalfa biomass was determined by mass balance.

X-ray Absorption Spectroscopy (XAS) The X-ray Absorption spectroscopic studies were conducted at Stanford

Synchrotron Laboratories (SSRL) at beamline 7-3. The Cr K edges (5989 eV) were collected using standard operating conditions of 3 GeV and 60-100 rnA [12]. The chromium(VI) loaded biomass samples were ran at 20K usirig a liquid helium cryostat to reduce any dampening or thermal disorder. Fluorescence spectra measurements of the biomass samples were acquired using a Canberra 13-element array germanium detector. The transmission measurements for the model compounds were taken using argon filled ionization chambers. The model compounds, potassium dichromate (K2Cr20 7) and chromium(III) phosphate (CrP04), were ground and diluted using boron nitride. Samples of unmodified alfalfa biomass and Diaion CRB02 amino, Diaion WTO 1 S carboxyl, and cellulose phosphate resins (Supelco, Belafonte, PA) were saturated with a 1000 ppm Cr(VI) solution[19mM K2Cr201] prior to analysis at pH 2.0 [24]. The samples and model compounds were packed into aluminum sample holders with x-ray transparent Kapton tape with a 1 mm path length. The packed samples were measured as solids. A Si(220) double crystal monochromator with an entrance slit of 1 mm was used to conduct the measurements. The monochromator was detuned by 50% in order to reject higher order harmonics. All spectra were calibrated against the edge position (5989 eV) of a chromium foil internal standard. To improve the signal to noise ratio, the scans of XANES spectra were averaged.

Results and Discussion Batch Studies

The binding trend of chromium(VI) with respect to pH can be observed in Figure 1. It can be seen that the optimum chromium(VI) binding pH is 2.0 which is similar to others studies with seaweed and water hyacinth which display optimum chromium(VI) binding at acidic pH [10,11]. Gardea-Torresdey et al. also observed good binding at low pH for chromium(Vl) with oat byproducts [25]. A low binding pH suggests the involvement of positively charged functional groups such as amino groups to provide protons for a bioreduction process described by Kratochvil et al [11]. These investigators proposed that the adsorption of chromium(VI) is similar to that of a weak anion exchange resin. At low pH, protons are available to bind to the chromate. Then protonated chromate can oxidize the alfalfa biomass and be reduced to chromium(III) ions. These chromium(III) ions may then adsorb to the alfalfa biomass similar to a cation exchange method. However, it is evident that the bioreduction and adsorption process consists of several steps. Further indication that a stepwise process is occurring can be seen in Figure 2 which depicts the amount of chromium bound with respect to time. The amount of chromium bound to the alfalfa biomass increases over a 48 hour period from 30% to almost 60%.

Table 1 shows the chromium(VI) binding capacities for unmodified and modified alfalfa biomasses. It can be observed that the hydrolyzed alfalfa produced a 46% enhancement in chromium(VI) binding from 5.9 to 10.4 mg chromium per gram of alfalfa biomass. Chromium(VI) adsorption was not observed for the esterified biomass.

4

Page 10: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

These results indicate that carboxyl groups are playing a vital role in the adsorption/reduction of chromium(VI) by the alfalfa biomass. Similar results were observed by Gardea-Torresdey et al. for the trivalent species of chromium using unmodified and modified alfalfa biomasses [19]. This suggests that chromium(VI) is binding to the alfalfa biomass as chromium(III).

X-ray Absorption Spectroscopy (XAS) Figure 3 depicts the XANES spectra for model compounds potassium dichromate

and chromium(III) phosphate and chromium(VI) reacted with the alfalfa biomass at pH 2.0. It can be observed that the XANES spectrum for the chromium(VI) reacted with alfalfa biomass resembles that of the XANES spectrum for the chromium(III) phosphate. The difference between the XANES spectra for trivalent and hexavalent chromium can be seen by the prominent pre-edge for chromium(VI) at about 6 ke V. This pre-edge is due to a bound-state Is to 3d transition which is forbidden for octahedral Cr(HI)06 but allowed for noncentrosymmetric tetrahedral Cr(VI)04 molecules[26]. The empty 3d orbitals of Cr(VI) allow for Is to 3d transitions and therefore enhance the intensity of the pre-edge peak. The almost identical XANES spectra for the chromium(III) phosphate model compound and the alfalfa biomass reacted with chromium(Vl) indicate that the chromium is binding to the biomass as chromium(III) and further provides evidence that a bioreduction process has taken place. Small pre-edge features can be observed for the chromium(III) phosphate and the chromium(VI) reacted with the alfalfa biomass at about 6 keY. The pre-edges for chromium(III) are at slightly higher energy positions than chromium(VI). This phenomenom can be explained in terms of the characteristics of the Cr-CO bond [27,28]. Figure 3 and Table I provide evidence that chromium(VI) is bioreduced to chromium(III) and subsequently binds to the alfalfa biomass by way of an oxygen ligand, which is most likely a carboxyl ligand as indicated by the batch modification studies.

To determine which functional group(s) may be capable of bioreducing chromium(VI) to chromium(III), several ion exchange resins were reacted with chromium(VI) at pH 2.0. Figure 4 depicts the XANES spectra for chromium(VI) reacted with amino, phosphate, and carboxyl ion exchange resins. The carboxyl and phosphate resins share similar spectra because no chromium was bound to either of them yielding spectra without an edge. However, the spectrum for the amino resin reacted with chromium(VI) resembles that of a chromium(III) spectrum. This possibly indicates that amino groups may bioreduce chromium(VI) to chromium(III). This is possible because at a low pH of 2.0 the amino groups are protonated and thus available to bind the chromium(VI) oxyanions (which are negatively charged) and then reduce chromium(VI) to chromium(Ill). This follows the proposed anion exchange mechanism described earlier by Kratochvil et a! [II].

Conclusions

The ability of the alfalfa biomass to bioreduce and adsorb chromium(VI) ions from aqueous solution was observed through the batch and XAS studies. The bioreduction/adsorption process occurs mainly at pH of 2.0 and increases over time. This

5

Page 11: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

indicates that chromium(VI) adsorption is a multistep process consisting of the initial binding of chromium(VI), bioreduction from chromium(VI) to chromium(III), and binding of chromium(III). The first step requires a positively charged functional group, most likely an amino group, to bind chromium(VI) and donate protons for the bioreduction process. Next, bioreduction of chromium(VI) to chromium(III) by amino groups on the surface of the biomass may occur. After the bioreduction has taken place, the chromium binds to the alfalfa biomass in the trivalent state. The chromium binds to the alfalfa biomass by way of oxygen ligands as observed by the small pre-edge for Cr­CO coordination. The modification studies indicated that hydrolysis enhanced chromium(VI) binding while esterification yielded no binding of chromium(VI). This suggests that the carboxyl group is the predominant oxygen ligand involved in the process. Alfalfa biomass has the potential to be a environmentally safe, cost effective system for removal of chromium(VI) from contaminated waters.

References

I. Nriagu JO, 1988. Chromium in Natural and Human Environments. Wiley Interscience, New York. 2. Kotas J, Stasicka Z. 2000. Chromium occurrence in the environment and methods of its speciation. Environ. Pollut., 107, 263. 3. Losi ME, Amrhein C, Frankenberger WT. 1994. Environmental biochemistry of chromium. Rev. Environ. Toxicol., 136, 91. 4. Lalvani SB, Wiltowski T, Hubner A, Weston A, Mandich N. 1998. Removal of hexavalent chromium and metal cations by a selective and novel carbon adsorbent. Carbon 36, 1219. 5. Park SJ, Park BJ, Ryu SK. 1999. Electrochemical treatment on activated carbon fibers for increasing the amount and rate ofCr(VI) adsorption. Carbon 37, 1223. 6. Aggarwal M, Goyal M, Bansal RC. 1999. Adsorption of chromium by activated carbon from aqueous solution. Carbon. 37, 1989. 7. Sapari A, Idris N, Hisham Ab. Hamid N. 1996. Total removal of heavy metal from mixed plating rinse wastewater. Desalination. I 06, 419. 8. Sule PA, Ingle JD. 1996. Determination of the speciation with an automated two­column ion-exchange system. Anal. Chim. Acta, 326, 85. 9. Micera G, Dessi A 1988. Chromium adsorption by plant roots and formation of long­lived Cr(V) species: An ecological hazard?.!. Jnorg. Biochem. 34, 157. 10. Lytle CM, Lytle FW, Yang N, Qian JH, Hansen D, Zayed A, Terry N. 1988. Reduction of Cr(III) to Cr(VI) by wetland plants: Potential for in situ heavy metal detoxification. Environ. Sci. Techno!. 32, 3087. 11. Kratochvil D, Pimentel P, Volesky B. 1998. Removal of trivalent and hexavalent chromium by seaweed biosorbent. Environ. Sci. Techno!. 32, 2693. 12. Prakasham RS, Merrie JS, Sheela R, SaswathiN, Ramakrishna NV. 1999. Biosorption of chromium VI by free and immobilized Rhizopus arrhizus. Environ. Pollut. 104, 421. 13. Pappas CP, Randall ST, Sneddon J. 1990. An atomic-emission study ofthe removal and recovery of chromium from solution by an algal biomass (Chiarella vulgaris). Talanta 37, 707.

6

Page 12: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

~--------------- ---- ---

14. Gupta VK, Mohan D, Sharma S, Park KT. 1999. Removal of chromium(VI) from electroplating industry wastewater using bagasse fly ash-a sugar industry waste material. The Environmentalist 19, 129. 15. Singh RP, Vaishya RC, Prasad SC. 1992. A study of removal of hexavalent chromium by adsorption on sawdust. Indian J. Environ. Protect. 12, 916. 16. Raji C, Anirudhan TS. 1998. Batch Cr(VI) removal by polyacrylamide-grafted sawdust: Kinetics and thermodynamics. Wat. Res. 32, 3772. 17. Gardea-Torresdey JL, Gonzalez ffi, Tiemann JK, Rodriguez 0, Gamez G. 1998. Phytofiltration of hazardous cadmium, chromium, lead and zinc ion by biomass of Medicago sativa (alfalfa). J. Hazard Mater. 57, 29. 18. Gardea-Torresdey JL, Tiemann KJ, Gamez G, Dokken K. 1999. Effects of chemical competition for multi-metal binding by Medicago sativa (alfalfa). J. Hazard Mater. 69, 41. 19. Tiemann JK, Gardea-Torresedey JL, Gamez G, Dokken K, Sias S. 1999. Use of X­ray absorption spectroscopy and esterification to investigate Cr(III) and Ni(II) ligands in alfalfa biomass. Environ. Sci. Techno/. 33, 150. 20.Gessa C, De Cherch ML, Dessi A, Deiana S, Micera G. 1983. The reduction ofFe(III) to Fe(II) and V(V) to V(IV) by polygalacturonic acid : a reduction and complexation mechanism ofbiochemical significance. Inorg. Chim. Acta 80, L53. 21. Romheld V, Marschner H. 1983. Mechanism of iron uptake by peanut plants. Plant Physiol. 71, 949. 22. Hernandez A, Hernandez T, Martinez C. 1998. Production and chemical composition of alfalfa protein concentrate obtained by freezing. Animal Feed Sci. Techno/. 72, 169. 23. Dushenkov V, Nonda Kumar PBA, Motto H, Raskin I. 1995. Rhizofiltration: The use of plants to remove heavy metals from aqueous streams. Environ. Sci. & Techno/. 29, 1239. 24. Salt DE, Pickering IJ, Prince RC, Gleba D, Dushenkov V, Smith RD, Raskin I. 1997. Metal Accumulation by aquacultured seedlings oflndian Mustard. Environ. Sci. Techno/. 31, 1636. 25. Gardea-Torresdey JL, Tiemann KJ, Armendariz V, Bess-Oberto L, Chianelli RR, Rios J, Parsons JG, Gamez G. 2000. Characterization of Cr(VI) binding and reduction to Cr(III) by the agricultural byproducts of Avena monida (Oat) biomass .. !. Hazard Mater., B80, 175. 26. Peterson ML, Brown GE, Parks GA, Stein CL. 1997. Differential redox and sorption of Cr(IIINI) on natural silicate and oxide minerals: EXAFS and XANES results. Geochim. et Cosmochim. Acta. 61, 3399. 27. Peterson L, White AF, Brown GE, Parks GA. 1997. Surface passivation of magnetite by reaction with aqueous Cr(VI): XAFS and TEM results. Environ. Sci. Techno!. 31, 1573. 28. Engemann C, Hormes J, Longen A, Dotz KH. 1998. A XANES study on organochromium complexes at the Cr K-edge. Chemical Physics. 23 7, 4 71.

7

Page 13: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

'0 c:::: :;, 0 m E :;, ·-E e .c u ';/e.

'0 c:::: :;, 0 m E :;, ·-E 0 .... .c u

70

60

50

40

30

20

10

0 2 3 4 5 6 7

Solution pH

Figure 1: Effect of solution pH on the adsorption of chromium(VI) by inactivated alfalfa biomass

70 60 50 40 30 20 10

~ 0 0 ';) ~ ~ ~ OJ~ t;o~ ~

~ "~ Time (min)

Figure 2: Percent of chromium bound by inactivated alfalfa biomass at different intervals ranging from 0 min to 48hours.

8

Page 14: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

'"'":' :l

~ c

1.0

.5! 0.7 .... c. '-0 Cl) .Q

C'IS

E o.5 '-0 c

0.2

Table 1: Chromium(VI) Binding Capacities for Unmodified and Modified Alfalfa Biomasses

Type of Biomass mg Chromium/ gram biomass

Unmodified 5.9±1.1

Hydrolyzed 10.4±1.4

Esterified 0.0±0.1

. ······ ··... l "'· -~· ... - ....... ····· c. .·~ .... ~. . .. , ~·~

. IE

! ·= fo/ I

'------'---~~-~·-• 0---'.

0 0 ~~~;::i _L_c~-•-- -~-. ~-·~~---~--~- -". _j 5.9 6 6.1 6.2

photon energy [keV]

Figure 3: XANES spectra for chromium model compounds and chromium(VI) reacted with the alfalfa biomass. Potassium dichromate (K2Cr201) (-), Chromium(III) phosphate (CrP04) (- -), Chromium(VI) reacted with the alfalfa biomass at pH 2.0 (•••).

9

Page 15: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

1.5

..... ::;

.!!. 1.0 t: 0 a ... 0 Cl) .c CIS

E o.5 ... 0 c:

... ·····---··., ..... . -~--~-~-~-·~·~·~·~·~·~··~·~·-~·~· --

6 LO 6.1

photon energy [keV1

Figure 4: XANES spectra of ion exchange resins reacted with chromium(Vl) at pH 2.0. Diaion CRB02 Amino resin(-), Diaion WTOIS Carboxyl resin(--), Cellulose phosphate (•••).

10

Page 16: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

TNT Transformation by Plants: Role of Hydroxylamines in the Pathway Murali Subramanian and Prof. Jacqueline V Shanks Chemical Engineering, Iowa State University, Ames, fA 50011

Introduction Phytoremediation is the science of using plants to clean up pollutant wastes. It offers many advantages over conventional methods of cleaning up soil wastes, such as incineration, since it is much less expensive (EPA 2000) and more ecologically friendly. 2,4,6-trinitrotoluene, TNT, a widely used explosive has contaminated around 40 anny land sites in the US, and many more worldwide (Spain 2000). Clean up of such sites before the deleterious effects of TNT in the food chain manifest themselves is important (Nagel eta!. 1999, Palazzo and Leggett 1986, Won eta!. 1976, Yinon 1990).

TNT transformation by plants has been sufficiently researched to generate a hypothetical pathway for its transformation. A 'Green Liver' model (Bhadra eta!. 1999a, Bhadra eta!. 1999b, Hughes eta!. 1997, Sandermann 1994, Vanderford eta!. 1997) has been proposed and verified for TNT transformation. The TNT transformative experiments were performed with Catharanthus roseus hairy roots and Myriophyllum aquaticum, aquatic plants. Under this model, TNT is initially reduced or oxidized, and then conjugated with plant molecules, and finally the conjugates polymerize to form inextricable bound residues, covalently bonded to the plant. A hypothetical TNT transformation pathway has been proposed (figure 1) (Burken eta!. 2000), which encompasses all these stages. In the hypothetical TNT -transformation pathway hydroxylamines occupy a central position, and are acted on by different enzymes to form varying metabolites. The types of metabolites formed possibly include reductive, conjugative, oxidative and diazoxy compounds. In addition, hydroxylamines are highly unstable in the presence of oxygen and in aqueous media (Ahmad and Hughes 2000, Burken eta!. 2000, Wang, C. Y. eta!. 2000). These factors make the detection ofhydroxylamines a challenge in the aqueous, aquatic media that conventional phytoremediation experiments are carried out in.

Hydroxylamines are the central focus of this paper, since their presence in TNT metabolism in plants has been speculated on but never proved. It was attempted to identify hydroxylamines in the C. roseus system, prove its importance in the pathway, and develop a stabilization scheme for it. Confirming the presence ofhydroxylami.nes, and realizing their role in TNT -metabolism is very significant, given both the toxic nature ofhydroxylamines (Fu 1990), and the fact that they are the most reactive compounds in the pathway. Until now, hydroxylamines have not been positively identified in the C. roseus system, although theoretical evidence points to its formation. The explanation for the non-observance ofhydroxylamines lie in probably in the fact that their instability in aerobic and aqueous media make them undetectable by conventional HPLC. Stabilizing the hydroxylamines and positively identifying them in the Cathranthus roseus system will help in further understanding of TNT metabolism by plants.

11

Page 17: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

2HA4,6DNT

NH-R3 02N

Conjugates

TNT-1

TNT

N02

! NH2 0 2N

N02

! NH-R4 0 2N

N02

t

Conjugates Higher Degree of Polymerization

a

!

NH2

! CH3

NH-R1

t

.

N02 •••••

.. . -

4RA2,6DNT

' '

' . ' ' ' ' '

' . . . N02

. \

.

N02 0 2N

Conjugates

4A-1

' ' ' ' '

' '

' ' ,.

. '

CH3

NH-R2

Figure 1: Hypothetical TNT transformative pathway. Adapted from Burken et al, 2001.

12

N02

Page 18: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Hydroxylamine Feeding Studies

Motivation: To understand the role and position ofhydroxylamines in the pathway, a feeding study was performed, where 2-hydroxylamino-4,6-dinitrotoluene was fed to two-week-old C. roseus roots, in their late exponential phase, at a concentration of0.02 mM. As a comparison, 0.05mM 2-amino-4,6-dinitrotoluene was fed to another flask of two-week­old C. roseus roots. Temporal product analysis was done until all the metabolites formed disappeared using a PDA equipped HPLC with a C8 column.

Results and Discussion: 2-hydroxylamino-4,6-dinitrotoluene fed flasks showed the formation of two types of conjugates, previously identified as 2A-1 and TNT-1 (Bhadra et al. 1999b, Wayment et al. 1999), whereas the 2-amino-4,6-dinitrotoluene fed flask produced only the 2A-1 conjugate. These results demonstrated the fact that hydroxylamines do not need to be reduced to conjugate, and plant organic molecules can add either directly to the hydroxylamine group, or can be added to the amine group. Hence, this experiment verifies the position of the hydroxylamines in the pathway (figure 1), and proves their versatility. It also proves that there may exist many conjugative schemes in the plcmt, depending on the substrate.

Stability of hydroxylamines

Motivation: The poor stabilities ofhydroxylamines in aqueous systems were probed by carrying out stability runs under different pH conditions and monitoring the hydroxylamine concentrations in the system. A normal pH (pH approx 7) and a pH <1 were used, the lower pH being introduced to investigate the stability ofhydroxylamines under oxidative conditions.

Results and Discussion: Figure 2 has the concentration profile of 2-hydroxylamino-4,6-dinitrotoluene in water, while figure 3 has the same for the 4-hydroxylamine isomer. Large degradation amounts and pH dependence were observed for both the hydroxylamines. By 40 hours, 50 percent of the 2HADNT had disappeared, and 80 percent of the 4HADNT was missing, at normal pH. Conditions of pH < 1 had a destabilizing effect on the hydroxylamines, quite contrary to what was expected. This could be due to either faults in the experimental set-up or execution, or could imply that the hydroxylamines under low pH conditions in aqueous media get oxidized by the electron accepting water. Water can behave as a weak acid or base; hence under low pH conditions it probably behaves as an acid. The hydroxylamines could be oxidized back to their nitroso functionalities. No degradation products were observed, indicating that the hydroxylamines did not reduce to their diamines, or ~my other identifiable metabolite, thereby also proving that all metabolites seen in the C. roseus system were obtained by enzymatic transformations.

13

Page 19: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

C)

.51 2 c . •2HADNT,NonnalpH,Aqueous ·-cu 1 E ! 1- 0.8 • • • 2HADNT, pH=1, Aqueous

z c 0.6 <C • • ~ 0.4 + r2HADNTh = 0.05 mM c 0 0.2 ;; (,)

0 f! I

• • • LL

0 10 20 30 40 50 time (hours)

Figure 2: Transient concentration profile of 2-hydroxylamino-4,6-dinitrotoluene in water, due to natural degradation.

C) 1.2 c .4HADNT, pH=1, Aq. ·-c 1 ·-cu E f 0.8 • •4HADNT,NonnalpH,Aq

1-z 0.6 c

<C :::z:::

0.4 • • "'t c 0 ;; 0.2 ( 4HADNT]I = 0.05 mM (,) cu • ...

LL 0 0 10 20 30 40 50

time (hours)

Figure 2: Transient concentration profile of 4-hydroxylamino-2,6-dinitrotoluene in water, due to natural degradation.

While these results appear to contrast with those published by Wang et al, who showed good stability for hydroxylamines under aqueous conditions, the one important difference is starting concentration ofhydroxylamines. While they used 50 mg!L concentrations, we used maximum concentrations of 5 mg/L in an effort to mimic our in-

14

Page 20: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

vivo systems. Hence, at such low concentrations, even small amounts of degradation are reflected as large percentage drops in the amount remaining. These experiments showed the need for good stabilizing procedure for hydroxyhunine detection, since both isomers showed different effects to lowered pH.

High TNT Concentration Experiments Motivation: Since hydroxylamines occupy a central position in the TNT -metabolism pathway, they are reacted on by a number of enzymes, and hence have very fleeting lives in the C. roseus system. In an effort to prolong their presence, high concentration TNT experiments were performed on late-exponential phase hairy roots. The effect expected was two-fold; the high concentrations of TNT would cause a higher flux through the pathway, and therefore higher concentration ofhydroxylamines, and the hence, their presence may be observed for longer. In addition, the toxicity caused by high concentrations of TNT may inhibit efficient transformation of the TNT, and hence the entire pathway may slow down in rate. This would imply that the rate of degradation of the hydroxylamines would be slowed down too, and hence an HPLC analysis may prove its presence. Late exponential phase sterile hairy roots were kept in 250 ml flasks, and the volume of the media in them was equalized to 50 ml, using Gamborg's B-5 media, with sucrose. Three different levels of TNT were added to the flasks, ranging from 0.35 to 0.45 mM. Sterile TNT solution in methanol was used. Samples were taken periodically for immediate analysis via HPLC.

Results and Discussion: Both 2-hydroxylamino, and its 4-hydroxylamino isomer were seen in the extracellular media. This is the first positive identification of these compounds in a C. roseus system, and their appearance might be contingent on the deceleration of the pathway. Figure 4 has the concentration of 2-hydroxylamine formed during the high concentration experiment, while Figure 5 has the tabular values for 4-hydroxylamine production. As the experiment progressed the roots began to look visibly stressed, as their color changed from pale white to a pinkish brown. This indicated the toxicity effect of the high TNT concentrations on the roots. The transient TNT profile indicated an uptake of TNT by the roots, although in a less efficient manner, since the rate of uptake was slower. This indicated that the transformative pathway as a whole was progressing slower. 4-hydroxylamino-2,6-DNT was seen in brief quantities for small periods oftime, with all of it disappearing within 4 hours. 2-hydroxylamino-4,6-DNT, by contrast, displayed more stability and present for at-least 9 hours before falling below detection levels. The transient profile of its concentration shows a rise in concentration, followed by a rapid decline in levels.

15

Page 21: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

16

14 -+-0.37 mM (80 mg/L) TNT .-. "' .. 12 = c; ---0.47 mM (100 mg/L) TNT E

10 Q .. .::! E 8 ._

--....-0.33 mM (70 mg/L) TNT

c Q ; 6 = .. .... c " 4 <J c Q u 2

0

0 5 10 15 20 25 30 35 40

Time (hours)

Figure 4: Transient extracellular concentrations of2-hydroxylamine in the high-TNT concentration experiment.

Initial TNT 0.33 mM 0.37mM 0.47mM

Maximum4HA 3.6 7.85

0

Hours since TNT addition 1

4.5 1

Figure 5: Extracellular 4-hydroxylamine levels in the high-TNT concentration experiment.

45

Amongst the other metabolites formed were the usual monoamines and a couple of the conjugates. These metabolites were formed at low-levels (below 5 mg/L), and this seems indicative of the fact that enzymes transforming these compounds were performing at their usual rate, and were not significantly affected by the high TNT concentration. Since hydroxylamine presence was proven in the transformation of TNT, its stabilization was next attempted in order to standardize its identification in phytoremediation systems.

Derivatization of hydroxylamines

Motivation: Optimization of a derivatization procedure for hydroxylamine stability (Wang, C. and Hughes 1998) was done, in an attempt to prevent the natural degradation ofhydroxylamines in in-vitro conditions. Derivatization stabilizes the hydroxylamine group, thereby making it more detectable by conventional HPLC schemes. The resultant products, a 2,6-dinitro-4-(N)-acetoxyacetamido toluene (figure 6), or its 2,4-isomer, are most stable than the parent hydroxylamines and hence application of this procedure was expected to improve the identification ofhydroxylamines in the C. roseus system.

The modified-derivatization procedure starts off with acidying the sample containing the hydroxylamine to a pH < 1, since the lower pH implies oxidative conditions, and hence the potential for the natural conversion of the hydroxylamines to

16

Page 22: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

their diazoxy and nitroso functionalities reduces. Methanol is added to aqueous samples to further increase the stability of the hydroxylamines and then acetic anhydride is added slowly drop-wise. The entire mixture is chilled in an ice-bath, and the products are allowed to stand for 5 to I 0 minutes, after which Sodium Acetate is added to quench the reaction. Sodium Acetate also acts to stabilize the product formed by, probably, reacting with any of the unreacted acetic anhydride.

~N ~N ~N

+ CJi-c-o---c-CI-J.i ---. II II 0 0

NHOH

Figure 6: Derivatization of 4-hydroxylamine.

0

II

~N

+ unidentified byproduct

N--c-CI-J.i I 0

I c 0

I c~

Optimization of Derivatization: Since the derivatization procedure had quite a few parameters, and was potentially going to be used to quantify hydroxylamines in the in­vivo C. roseus system, it was going to be necessary to first optimize the procedure to obtain conditions of maximum yield and stability of the hydroxylamines. Towards this end, optimization with respect to amount of acetic anhydride, sodium acetate, and delay time (time gap between addition of acetic anhydride and sodium acetate) were performed. Optimization with respect to the amount of water the derivatization procedure could successfully handle, and the minimum amount of methanol required are important parameters for which derivatization runs need to be performed. Water content in the system is especially important since all the in-vivo samples are aqueous. Important: All optimization runs were carried out using 2,6-diamino-4-hydroxylamino since the UV -vis spectra for the corresponding derivatization product was obtained from Dr. Chaunyue Wang in the Hughes lab at Rice Univ.

Acetic Anhydride Optimization: Since Acetic Anhydride is the primary reactant with the hydroxylamines, the derivatization was optimized with the amount of Acetic Anhydride added. 100, 500, 1000, 2000 and 5000 microliters of Acetic Anhydride was added to the cocktail of 4HADNT, methanol and water, acidified to pH<l. The amounts ofthese components were 75 microliters (0.1 mg/ml concentration), 0.1 ml, 0.1 ml and 25 microliters respectively. 500 microliters of Sodium Acetate was added to stop the reaction after 5 minutes of reaction time. When two distinct aqueous and organic phases were visible, the sample for HPLC analysis was taken from the organic phase. A transient concentration measurement of the derivatization product was also done, to show the stability of the derivatized product, and to show its rate of formation.

17

Page 23: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Sodium Acetate Optimization: A procedure similar to the adopted for Acetic Anhydride optimization was used. Three different amounts of sodium acetate were used to stop the reaction- 250, 1000 and 2000 microliters. 50 microliters 4HADNT, 0.1 ml Methanol, 0.1 ml water, 1 ml Acetic Anhydride and 25 microliters ofHCl were used in the derivatization reactions.

Results and Discussion: The derivatized products when analyzed using the HPLC (C8 column, 78:22, Water: IPA- mobile phase) showed two distinct peaks- one at 18 minutes, and the other at 41 minutes. In addition, a few minor peaks were seen, but the levels were very low, and hence they were not analyzed. The product eluting at 18 minutes was identified, using UV-vis match, as 4(N)-acetoxyacetamido-2,6-DNT, the derivatized product of interest. The other peak (41 minutes) was an unidentified byproduct; it appeared to be a modification of 4-hydroxylamino-2,6-DNT. The transient concentrations ofboth products are presented in figure 7. The highest concentration attained is that of the 3000 microliter Acetic Anhydride sample, after 50 hours. The general trend seems to indicate that higher amounts of Acetic Anhydride will results in higher amounts of derivatized product being formed, and the amount formed increases with time, probably reaching a maximum by 50 hours. The rationale for assuming that the maximum amount is formed in 50 hours is that the concentration of the chiefby-product formed falls to zero by 50 hours, in the case of 2000 and 3000 microliters Acetic Anhydride samples. Therefore, for Acetic Anhydride, the most suitable option is probably a large amount, in excess, and a gestation period of several hours. Comparing the concentration profiles of the by-product formed and the derivatization product reveals the fall of concentration of the formed is accompanied in a rise in concentration of the latter, and vice-versa. This indicates that an equilibrium exists between the two products, and the by-product, with time eventually reacts/degrades to become the (N)-acetoxyacetamido substituent. This is a significant finding, since it implies that the presence of the by-product means that all the hydroxylamines have not fully reacted, and more reaction time is required. As in the case of Acetic Anhydride optimization, the amount of product formed increases with the amount of Sodium Acetate added to the system. The transient concentration profiles for the derivatization product show an increase in amount of product formed with passage oftime, and a corresponding decline in concentration of by-product with time. This is again indicative of the fact that the by-product is in equilibrium with the derivatization product, and in time gets completely converted to the derivatization product. Therefore, using high amounts of sodium acetate (2 ml) and a reaction time of around 35 hours will probably give best results for the derivatization procedure. The two optimization runs of the derivatization procedure seem to indicate non-limiting, high amounts of acetic anhydride and sodium acetate are probably the best option. Also, a reaction time of around 3 5 hours is optimum for completion of reaction.

Conclusions and Future Directions The role ofhydroxylamines in the TNT transformative pathway was demonstrated by the feeding studies, where hydroxylamine fed roots produced two kinds of conjugates, as opposed to amino-fed roots that produced only one kind of conjugate. In addition, hydroxylamines were identified in high concentration TNT experiments, thereby further confirming their role in the TNT metabolism. The poor stability ofhydroxylamines was shown in aqueous studies where they disappeared within 40 hours. An optimized

18

Page 24: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

1.2

c 1 0 ~0.8 CJ 1!0.6 u. 00.4

::Eo.2

0

Transient concentration of 4-acetoxyacetamido-2,6-DNT

0 10 20 30 40

Time (hours)

-+-1000 ul AcAn ---100 ul AcAn -lk- 500 ul AcAn -*- 2000 ul AcAn ~ 3000 ul AcAn

50

Transient concentration of by-product

0.5 or------------------, 0.45

c 0.4

~ 0.35 A-__,~--.j~--CJ I!! u. Gl 0 ::!

0 10 20 30

Time (hours)

40 50

Figure 7: Transient concentration of 4-acetoxyacetamido-2,6-DNT and the byproduct in optimization w.r.t Acetic Anhydride.

19

Page 25: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

derivatization procedure was developed to stabilize the hydroxylamines, which is expected to be useful in further in-vivo TNT metabolism studies.

References: Ahmad, F. and J. B. Hughes (2000) Anaerobic Transformation of TNT by Clostridium in Knackmuss, H. J., ed., Biodegradation ofNitroaromatic Compounds and Explosives. Lewis Publishers, 185-212. Bhadra, R., R. J. Spanggord, D. G. Wayment, J. B. Hughes and J. V. Shanks (1999a) Characterization of Oxidation Products of TNT Metabolism in Aquatic Phytoremediation Systems of Myriophyllum aquaticum. Environmental Science and Technology, 33 3354-3361. Bhadra, R., D. G. Wayment, J. B. Hughes and J. V. Shanks (1999b) Confirmation of Conjugation Processes during TNT Metabolism by Axenic Roots. Environmental Science and Technology, 33 446-452. Burken, J. G., J. V. Shanks and P. L. Thompson (2000) Phytoremediation and Plant Metabolism of Explosives and Nitroaromatic Compounds in Knackmuss, H. J., ed., Biodegradation ofNitroaromatic Compounds and Explosives. CRC Press LLC, Lewis Publishers, Boca Raton, FL, 239-275. Fu, P. P. (1990) Metabolism of nitro-polycyclic aromatic hydrocarbons. Drug Metabolism Reviews, 22 (2&3): 209-268. Hughes, J. B., J. V. Shanks, M. Vanderford, J. Lauritzen and R. Bhadra (1997) Transformation of TNT by aquatic plants and plant tissue cultures. Environmental Science and Technology, 31 (1): 266-271. Nagel, D. B., S. Scheffer, B. Casper, H. Gam, 0. Drzyzga, E. V. Low and D. Gemsa (1999) Effect of 2,4,6-Trinitrotoluene and its Metabolites on Human Monocytes. Environmental Science and Technology, 33 2566-2570. Palazzo, A. J. and D. C. Leggett (1986) Effect and disposition of TNT in terrestrial plants. Journal of Environmental Quality, 15 (1): 49-52. Sandermann, J., H. (1994) Higher plant metabolism ofxenobiotics: the 'green liver' concept. Pharmacogenetics, 4 225-241. Spain, J. C. (2000) Introduction in Knackmuss, H. J., ed., Biodegradation of Nitroaromatic Compounds and Explosives. CRC Press LLC, Lewis Publishers, Boca Raton, FL, 1-5. Vanderford, M., J. V. Shanks and J. B. Hughes (1997) Phytotransformation of trinitrotoluene (TNT) and distribution of metabolic products in Myriophyllum aquaticum. Biotechnology Letters, 19 (3): 277-280. Wang, C. and J. B. Hughes (1998) Derivatization and separation of2,4,6-trinitrotoluene metabolic products. Biotechnology Techniques, 12 (11): 839-842. Wang, C. Y., Z. D. D. and J. B. Hughes (2000) Stability ofhydroxylamino- and amino­intermediates from reduction of 2,4,6-trinitrotoluene, 2,4-dinitrotoluene and 2,6-dinitrotoluene. Biotechnology Letters, 22 (1): 15-19. Wayment, D. G., R. Bhadra, J. Lauritzen, J. B. Hughes and J. V. Shanks (1999) A Transient Study of Formation of Conjugates during TNT Metabolism by Plant Tissues. International Journal ofPhytoremediation, 1 (3): 227-239. Won, W. D., L. H. DiSalvo and J. Ng (1976) Toxicity and mutagenicity of2,4,6-trinitrotoluene and its microbial metabolites. Applied Environmental Microbiology, 31 575-580. Yinon, J. (1990) Toxicity and metabolism of explosives CRC Press Boca Raton, FL

20

Page 26: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Plant Uptake and Transformation of Benzotriazoles

Sigifredo Castro 0 >, Lawrence C. Davis (2), and Larry E. Erickson <1>. KANSAS STATE UNIVERSITY

(l) Department of Chemical Engineering, KSU, Manhattan, KS 66506, Phone: 785-532-5584, Fax: 785-532-7372. (2) Department of Biochemistry, KSU, Manhattan, KS 66506,

Phone: 785-532-6121, Fax: 785-535-7278.

ABSTRACT

Benzotriazoles are used in three main applications: to prevent the corrosion of metals, to stabilize plastics against UV decomposition, and to improve photographic characteristics offilms. Recently the 1-hydroxy derivative is being considered as an alternative laccase mediator in biopulping for paper production. The majority of benzotriazoles produced go into anticorrosion applications, such as aircraft deicing fluids, automobile antifreeze solutions, and recirculating water systems. Benzotriazoles are stable compounds, toxic to microorganisms and plants, irritant to humans, and potentially carcinogenic. The release of these compounds to the environment represents an important environmental concern. Given that the structure of benzotriazoles is similar to some of the components of the lignin fraction in plants, we have studied the potential of using plants to remediate soil contaminated with these compounds. A hydroponic culture of young sunflowers (Helianthus annuus) under continuous lighting, and constantly fed at different concentrations ofbenzotriazole (BT) or hydroxy benzotriazole (HBT), was designed to determine the toxicity threshold and to gain a better understanding of the process by which the plants interact with these compounds, in an effort to elucidate the driving force ofthe process, determine the kinetics, and identify relevant parameters.

INTRODUCTION

The high stability of the chemical structure derived from the condensation of a benzene ring with a three nitrogen ring (USEPA, 1977) is one of the reasons for the use ofbenzotriazoles as corrosion inhibitors in aircraft deicing formulations (ADFs); however, the collected but not properly treated and the uncollected runoff generated from deicing processes at airports, represent a continuous source ofbenzotriazole-enriched waste that is undesirably persistent in the environment. Benzotriazole and its derivatives (except for 5-methyl-benzotriazole, 5-MBT) have been classified as resistant to bacterial degradation (Rollinson and Call ely, 1986, Castro et al, 2000, Young et al., 2001 ). The presence ofbenzotriazole derivatives has been reported in monitoring wells at some airports (Cancilla et al., 1998), because of their significant water solubility and non­reactive character (USEPA, 1977). Because ofthe known toxicity ofbenzotriazoles to microorganisms, aquatic fauna, and plants (Pillard, 1995, Castro et al., 2000), and potential carcinogenicity to mammals (NCI, 1978), suitable strategies for the environmental management ofbenzotriazole waste should be found.

We have proposed (Castro et al., 2000) the use of plants for the in situ immobilization of benzotriazole (BT) and its 4 and 5 methyl derivatives (5-methyl-

21

Page 27: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

benzotriazole and tolyltriazole (TT), which is a mixture of the 4 and 5-methyl benzotriazoles). Three factors provide the rationale for using vegetation as a strategy for triazole-waste remediation. First, triazoles are structurally similar to some substructures of the lignin in plants (see Figure 1). Second, the relatively small molecular size, their appreciable water solubility and most- important, their LogKow values (See Table 1 ), put them in the appropriate range for compounds to be mobile in the xylem and immobile in the phloem of the plants vascular system (Narayanan et al., 1999). Third, triazoles have been shown to be susceptible to react with peroxidases under specific conditions (Fenton reaction, in horseradish plant culture, or in cultures of the. fungus Phanerochaete chrysosporium, Wu et al., 1998). The mechanism by which higher plants strengthen their cell walls, known as lignification, is probably a free-radical catalyzed polymerization of methoxylated aromatic alcohols with laccase and lignin peroxidases as the catalysts (Buchanan et al., 2000). It is reasonable to think that benzotriazoles may be reactive in plants where lignin is being synthesized and be covalently bonded into plant lignin. However, as it will be shown in this paper, triazoles are toxic to plants at concentrations in the liquid phase surrounding the roots of about 75 mg!L of triazole. Higher concentrations severely damage the root system and plants are dramatically stunted or killed. Nevertheless, not only can plants grow in properly fertilized solutions with triazole concentrations lower than 75 mg!L, but they also can make BT, 5-MBT and TT disappear from the contaminated solution to be irreversibly bound to the plant material (Castro et al., 2000). Another benzotriazole derivative, 1-hydroxy-benzotriazole (HBT), with a value of LogK0 w less than I (See Table 1), shows different interactions with plants compared to the BT, 5-MBT and TT. The exact mechanism of triazole phytotoxicity and their ultimate fate within the plants remain unknown. In an effort to understand the process by which benzotriazoles can be taken up and transformed by plants, a hydroponic experiment with sunflowers (Helianthus annuus) was designed and is described in the following section. Given that 5-MBT is suspected to be degraded by bacteria (Young et al., 2001), only BT and HBT were studied.

T bl 1 S a e orne properties o

Compound

1-H-benzotriazole 1-Hydroxy-benzotriazole 5-methyl-benzotriazole

Tolyltriazole a From USEP A, 1977 b From SRC, 2001

fb enzotnazo es se ecte

Solubility in water (Wt % at 25°C)

1.98a 12c

N/A 0.55a

c From Advanced Chern Tech, 2001

22

dfi d or stu 1y Log octanol-water partition

coefficient _(L01 Kowl Experimental Predicted

1.44a 1.1711

NIA 0.11 b

N/A I. 71 b

NIA 1. 71 b

Page 28: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

---------------~- --- ----- -- ---- --

H OH I I

~ ~ 2 N 2 N

3/ 3/ N N

1-H-Benzotriazole (BT) 1-Hydroxy-Benzotriazole (HBT)

5-Methyi-Benzotriazole (5-MBT)

Figure 1 : Chemical structure of benzotriazole and some derivatives

MATERIALS AND l\.1ETHODS Pure benzotriazole (BT), and 1-hydroxy-benzotriazole (HBT) were purchased

from Aldrich I Sigma Chemical Co. The compounds were kept as aqueous stock solutions (2 giL) for later dilution to treat plants and/or prepare calibration standards for chromatography. Stored solutions appeared to be stable for at least two years. For separation and quantification of triazoles, detection at 275 nm was used with liquid chromatography on a Hamilton PRP-1 Column (150 x 4. I mm). The eluent consisted of a solution of 1 mg/L 5-MBT and 1mM potassium phosphate monobasic in methanol (50-70 Vol.%) + water at a flow rate of 1.0 mL/min. The low level of :MBT in the eluent allowed a uniform distribution of the injected sample throughout the column, which in turn resulted in a stable baseline in the chromatographic chart and more reproducible results with greater sensitivity. The phosphate was used to buffer the solution at a neutral ·pH. The methanol level was adjusted to optimize resolution of compounds of interest in each experiment. The detection limit of triazole was 2 mg!L at a range of 0.2 absorbance units for a full scale in the UV detector.

For the growth of plants, lighting was continuous with 40-watt cool white fluorescent light tubes (about I 0 four-foot tubes per m2

) at 25°C. The light energy available for photosynthetic activity fell in the range of 150 - 200 !J.mol/m2

, measured with a quantum meter (Apogee instruments).

Twenty sunflower seedlings that had been kept in nutrient-enriched moist vermiculite for fifteen days were transferred to 50 mL glass bottles with Hoagland's IX solution. The bottles were placed in a rack in such a way that light was prevented from

23

Page 29: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

reaching most part of the solution to avoid the growth of photosynthetic algae. The plants were held by an open cell polyurethane foam assuring that only the roots were in contact with the water phase. After a four-day period for adaptation in the liquid system, the solution was changed to triazole-contaminated solution, where they were kept for 4 more days. Four of the plants were kept as controls, and two groups of eight were treated with either BT or HBT at concentration levels of 10, 20, 30, 50, 75, 100, 150, and 200 mg I L. Their initial fresh weights varied from 2.45 to 3.37 g right before starting the treatment and the triazole level for each plant was randomly assigned. The solution taken up by the plants was measured and brought back to the original 50 mL volume daily with fresh triazole-amended Hoagland's solution, after which a 1 mL sample was taken for HPLC analysis. The plant fresh weight was also measured daily.

RESULTS With respect to water uptake, control plants showed an increase in the volume

taken up daily since the plants were growing (see Figure 2). The same occurred for plants treated at low concentrations of triazole. For plants treated with benzotriazole, concentrations from 50 to 75 mg/L seemed to have affected the water uptake, but for concentrations of 200 mg/L, progressively diminished dailY- water consumption was observed (see Figure 2).

2 ""'" ~ "-"'

Cl) .::.: co -0.. ::s I-. Cl) -co ~

35

30 .1 0 0 0 30 • 75

25

20

15

10

5

0 +-----------.-----------.------------,----------~

0 50 100

Elapsed time (h)

150 200

Figure 2: Water uptake for sunflowers before and after treatment. with benzotriazole. Error bars show the standard deviation for the average of 4 control plants. Dotted lines show the data after treatment started.

In the case of hydroxy benzotriazole, concentrations equal or less than 75 mg/L did not seem to severely affect the water uptake; however, at a concentration of I 00 mg/L and higher, the water uptake was dramatically diminished the next day (see Figure 3).

24

Page 30: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

2 ~

-~ '-" ~

~ -c. = 1-o

~ ~

----------------

35

30 j 0 0 .& 75 0 100 X 200 mg HBT I L /.. .................................................... .

-. . . . -

25

20

15

10

5

························································:··~:·~·:··~x~·:·:·:·:-•· ... -~ .......................... . X· ·······~:: .. -···g

.................................................................................... £1··'"''~.:.:.~ .. : ............................................................................ .

0 +-----------~------------.------------.----------~

0 50 100

Elapsed time (h)

150 200

Figure 3: Water uptake for sunflowers before and after treatment with hydroxy benzotriazole. Error bars show the standard deviation for the average of 4 control plants. Dotted lines show the data after treatment started.

To confirm the observations from the water uptake, the percentage change in the fresh weight during the four days of treatment was calculated as shown in Equation 1:

01 r:-. h r:rr . h Ch Final weight -Initial weight * 1 0001 /orres rrerg t . ange = /o

· Initial weight Equation 1

Figure 4 shows that the percent change in fresh weight for plants treated with benzotriazole was in the lower range for the· control plants for concentration between 20 and 75 mg/L and started to be less than the controls with a concentration of 100 mg/L. The effect seemed progressive, being more pronounced as the initial concentration was higher. For the hydroxy benzotriazole it seemed that the plants were able to tolerate and grow as the control plants at concentrations less than 75 mg/L; however, the plant treated with a concentration of l 00 mg/L immediately showed a dramatic effect, similar to the plants with concentrations of 150 and 200 mg/L. Thus: it was concluded that a concentration of 75 mg/L of any of the triazoles was the maximum level tolerable by the plant without a severe effect in the water uptake and growth of the plant. Concentrations of benzotriazole higher than 75 mg/L affect the plant progressively, whereas concentrations of hydroxy benzotriazole higher than 75 mg!L may quickly lead to plant death.

25

Page 31: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

100 . .

80 ........... .. ....... :~,~·- .. :+: ··~,,*_·········· , ... :.......... .. ......... , ........... : ............... . 60 • !:., t ' • + .

~ 40 .! u ';t. 20

. : : .................. ~ ................... ; .................. i .................. 'i" ............. :t ................ L............. . = ................. ; ................. T ................ .

0

-20

i : : i ·, i . l ................. ~.: .................. ,.t"""""'"'""+"·"""""""""•i-................ ~.: .......... ~-~-'+~~----......... : ............ ~·~·~,.t:·~·;·,····· ................ .

...... : ... · .................. f .................................... f ................. -[-................ L ................ ; ................ :..._.: ............. J .................. j ................ ..

-40 +---~-----r----+---~-----r----~---T----,_----r---~

-25 0 25 50 75 100 125 150 175 200 225

TriaZDie concentration (mg!L}

• BT · · · •· · · HBT X Control I

Figure 4: Percent fresh weight change for young sunflowers treated with BT and HBT for 4 days (initial fresh weights varied from 2.45 to 3.37 g).

With respect to the interactions of triazole with plants, the amount that was "transformed" by the plant between sampling periods could be estimated by a mass balance, as shown in Equation 2, since the concentrations at the beginning and at the end of each sampling period and the volume and concentration of the solution added were known.

Initial amount + Amount added - Amount transformed = Final amount Equation 2

In terms ofthe known data, Equation 2 will take the form ofEquation 3:

Equation 3

Where C is the initial concentration for interval I, V is the total volume of solution, SU is the volume of solution taken up by the plant, Cs is the concentration of the solution added, and C+J is the initial concentration for interval i+ 1.

Thus, the amount of triazole transformed can be found for each interval; the total amount lost for the 4 days of the experiment will be the sum of the amounts lost during

26

Page 32: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

each interval. Figure 5 shows the total amount transformed for each case for both treatments at all levels; it was clear that the plants were taking up the benzotriazole and the higher the initial concentration the more triazole was being transformed. However, the hydroxy benzotriazole-treated plants did not show the same behavior, and a particular trend could not be identified. The hydroxy benzotriazole seemed to be transformed, but the rate for the process was slower than that one for the benzotriazole.

12

p I +--------------------------------------------------~~~ ~ 10

'""' 0 E-<

8 m llii!l: ff

'-"' '"0 6 (!)

§ ..9

r:/l c:: 4 c:e '""' ..... § 0 2 ~ • 0

~

1[-1~ 10 20 30 50 75 100 150 200

Initial concentration (rng!L)

jsBT DHBTj

Figure 5: Total amount of triazole (mg) that was transformed by sunflowers treated with BT or HBT for 4 days

CONCLUSIONS A concentration less than 75 mg/L ofBT seems to affect slightly the water uptake

and growth rate of young sunflowers. A toxic effect, corresponding to plant weakening, stunted growth and reduction in water uptake is observed when the input BT concentration is greater than 75 mg/L; the effect is highly toxic at 200 mg/L. In the case of HBT, concentrations equal or less than 75 mg/L did not seem to severely affect the plants; concentrations equal or greater than 100 mg!L of HBT impact the plant severely rather quickly. HBT does not seem to be phytotransformed at the same rate as the BT and it does not seem to be taken up with the water coming into the plant, i.e., it may be excluded by the plant or phytotransformed very slowly. The differences in the phytotransformation of these compounds could be because plants can interact with them through different enzymatic pathways. Parameters such as LogKow for these compounds

27

Page 33: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

are different and may be correlated with the observed phenomena; thus LogK0 w could be used in future research to predict plant response to different triazole compounds.

ACKNOWLEDGMENTS This research was partially supported by the U.S. E.P.A. and the U.S. Air Force

under assistance agreements R-819653, R-825549, and R-825550 to the Great Plains Rocky Mountain Hazardous Substance Research Center for regions 7 and 8 under projects 94-27 and 98-3. It has not been submitted to the EPA for peer review and, therefore, may not necessarily reflect views of the agency and no official endorsement should be inferred. The Center for Hazardous Substance Research also provided partial funding.

REFERENCES

Advanced ChemTech, 2001. Advanced ChemTech Inc website. Material Safety Data Sheet. b.t.t.P.J!.w.w.w..J2~P.ti4.~,.g_Qm/.MSR.S/.g~f~.Y.It.~.~P.

Buchanan B.B., Gruissem, W., and Jones, R.L. (2000). Biochemistry and molecular ·biology qfplants. ASPP, Rockville, MD. 1294-1300.

Cancilla, D.A., J. Martinez, and G.C Van Aggelen, 1998. "Detection of aircraft deicing/anti-icing fluid additives in a perched water table monitoring well at an international airport." Environ. Sci. Techno!., 32, 3834-3835.

Castro S., L. Davis, and L. Erickson, (2000). "Experimental study of phytodegradation kinetics of methyl benzotriazole" Proc., 30th Annual Biochemical Engineering Symposium''. Dhinakar S. Kampala (Editor). University of Colorado, Boulder, CO.

Narayanan, M., L.E. Erickson, L.C. Davis, 1999. Simple plant-based design strategies for volatile organic pollutants. Environ. Prog., 18 (4), p. 231-242.

N.C.I. (1978). "Bioassay of benzotriazole for possible carcinogenicity" National Cancer Institute, Bethesda, MD, DHEW Pub No. NIH 78-1338.

Pillard, D.A., 1995. "Comparative toxicity offormulated glycol deicers and pure ethylene and propylene glycol to Ceriodaphnia duhia and Pimephales promelas." Environ. Toxicol. Chem., 14, 311-315.

Rollinson, G. and A.G. Callely, 1986. "No evidence for the biodegradation of benzotriazole by elective culture or continuous enrichment." Biotech. Lett. 8, 303-304.

SRC, 2001. Syracuse Research Corporation Website, Environmental Science Division. Interactive LogKow Demo; J:mp;//.~§.9.,.~Y!I~~,.~Qm/..i.nt~IkQw/..kQ.W.~t~m.9.,.h.tm

USEPA, 1977. "Investigation of selected potential environmental contaminants: benzotriazoles." EPA 560/2-77-001.

Young, K., Erickson L.E., Davis, L.C, and Castro S. (2001) "Microbial degradation of 5-Methyl benzotriazole" Proc., 3rt Annual Biochemical Engineering Symposium". Larry E. Erickson (Editor). Kansas State University, Manhattan, KS.

Wu, X., Chou, N.C., Lupher, D., and Davis, L.C. (1998). "Benzotriazoles: Toxicity and biodegradation." Proc. 13th Annual Conf Hazardous Waste Research, Kansas State University, Manhattan, KS, 374-384.

28

Page 34: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Abstract

Microbial Degradation of 5-Methyl Benzotriazole Kaila Young, Larry Erickson, Lawrence Davis, and Sigifredo Castro Diaz

Kansas State University Manhattan, Kansas 66506

Methyl benzotriazole (MBz) is used for several industrial applications and this contaminant is currently under study as a candidate for removal by phytoremediation. However, recent observation has shown that 5-MBz degradation occurs in the liquid remaining in the test vessels after the removal of the plants used for experimentation. Additionally, 5-MBz degradation has been observed in solutions containing samples of soil that had been previously exposed to the contaminant. Subsequent purification and separation of the microbial consortiums present in the liquid and soil media led to the isolation of several morphologically distinct organisms believed to be responsible for the observed degradation. However, cultures of the separated organisms showed no ability to degrade the 5-MBz even when exposed to a variety of growth media and conditions. It was hypothesized that the organisms may exist in a symbiotic relationship so that 5-MBz degradation cannot occur in the absence of a bacterial consortium. Further experimentation will be conducted to investigate conditions under which the isolated organisms can degrade the contaminant. Work has also been done to elucidate the pathway by which the microbes degrade the 5-MBz. When both toluene and 5-MBz are present in solution containing the mixed culture, the degradation of toluene, which has a structure similar to a segment of the 5-MBz molecule, occurs at a slower rate than the rate observed when only toluene is present. This suggests that both substances may be degraded by similar mechanisms or that 5-MBz may inhibit toluene biodegradation.

Background

Federal Aviation Administration (FAA) requires that, for the safety of passengers, airplanes be subject to rigorous deicing standards in the winter months. To comply with the regulations set by the FAA, airports across the country routinely use massive amounts of deicing solution each year to remove ice from plane wings before takeoff, so that the plane is not dangerously weighted down. However, the deicing solutions generally contain compounds that can potentially cause corrosion damage to the plane. Therefore, a mixture of 4- and 5-methyl benzotriazole (MBz, Figure 1 ), a corrosion inhibitor, is commonly added as a preventative measure. Another common use for MBz derivatives is protection of plastics against ultraviolet decomposition, as MBz is highly UV stable. H As indicated by the UV stability of the compound, I MBz is highly persistent in the environment and · N \\ has shown toxic effects to plants and other N organisms. Studies have found that prolonged ;.1

yl exposure to MBz caused paralysis/central nervous 1 l

system disorders in mice, and the compound is CH believed to inhibit protein synthesis (USEP A, 3 1977). Therefore, studies have been underway for Figure 1. 5-Methyl benzotriazole (MBz). several years in the search for an effective

29

Page 35: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

removal technique for the contaminant. A literature search revealed that degradation of MBz by microbes had not been reported (Rollinson, 1986). Instead, phytoremediation was under study as a means to remove the MBz contamination (Castro, et. al, 2000).

Experimentation and Results

Many phytoremediation studies are done under hydroponic conditions, where the plants are grown in solution, without soil, so that sampling can be easily done (Castro, et. al, 2001). In January of 2000, 5-MBz degradation was observed in the absence of vegetation in solutions previously used for these hydroponic studies. Subsequent 5-MBz feeds were also degraded in the absence of plants (Figure 2) and the growth of bacterial colonies was noted in the solution. The reader should note that the sharp increase of 5-MBz concentration on Day 3 corresponds to a

100%

90%

80% Cl 70% c: ·c:

60% "iii E 50% Q)

0:: 40% N

CD :lE 30% cfl 20%

10%

\ 1\.. \ '\. \ '\. \ '\.

\ .~ \ \.

\ ~ \ ~

\ .............. 1;:1

0%

0 2 4 6 8 10

Time, Days

Figure 2. Decrease in 5-MBz concentration in hydroponic liquid.

"reloading" of the sample with additional 5-MBz so that a degradation rate could be estimated. The work here applies only to the 5-methyl derivatives. The 4-methyl form is not metabolized by the bacteria described herein (unpublished observation).

Bacterial Isolation

In the effort to isolate and determine the identity of the 5-MBz degrading organisms, streak plates were prepared using Hoagland's Solution, the nitrogen-rich fertilizer used for hydroponic studies, as a base. Half of the prepared plates contained varying concentrations of 5-MBz in the agar support as a feed source for the organisms. The rest of the feed plates were simply an agar base with a crystal of 5-MBz placed in the center, so that the 5-MBz could diffuse through the agar and a zone of optimum 5-MBZ concentration could be observed. After 72 hours, bacterial growth appeared on the plates in the form of 2 distinct colony morphologies: ( 1) small, clear and drop like and (2) large, yellow smears with blurred edges. Gram staining revealed that both colony types were gram (+). Zones of inhibition and optimum 5-MBz concentration were evident around the solid 5-MBz. The two colonies also grew on nitrogen-free media, which raises the point that 5-MBz may serve as a nitrogen source. However, in this case, the nitrogen source may have been the agar support. Excessive growth after only 24 hours was observed on plates that were rich in nutrients, indicating that 5-MBz is not a good sole source of energy.

30

Page 36: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

In order to ascertain plausible sources of the 5-MBz-degrading organisms, soil from planted zones-channels that had been planted with alfalfa and cylinders that had been planted with grass-that had been treated previously with 5-MBz was removed and suspended in a fertilizer/5-MBz solution. A degradation of 98% of the initial 5-MBz concentration was noted after only 48 hours in all samples except the soil that had never been treated with 5-MBz (Figure 3). Note that no bacterial degradation of 5-MBz occurred in the solutions containing soil samples that had not been exposed to the contaminant, which indicates that the degradation is caused by an inducible enzyme or an increased population of microbes selected for MBz use. The morphology of isolated soil bacteria from these samples-small, clear, and drop like colonies-was similar to that of colony type ( 1) isolated from original hydroponics liquid.

100% •••••••••••••••• G •••••••••••••••••••••

90%

80% C) s::: 70% c: "i 60% E G) 50% a: N 40% aJ :!:

30% ~

20%

10%

0%

0 2 Time, Days

Figure 3. Degradation of 5-MBz by ubiquitous soil bacteria found in planted zones previously treated with MBz: Cylinder 1 (L\.), Cylinder 2 (x, dashed line), Cylinder 3 (* , dashed line), Channell(*), Channel2 (+),Soil untreated with MBz (o, dashed line).

The isolated bacteria from the hydroponic solution and the soil samples were grown under a variety of conditions and on an assortment of media in the presence of 5-MBz. Semi-solid agar, liquid media, and flooded plates in which a layer of 5-MBz-containing solution was laid over a lawn of bacteria were all tried, however, even after several months, no degradation of the 5-MBz in the cultures was ever observed. Even recombining several isolated bacterial colonies into a single liquid culture did not lead to degradation of the 5-MBz. It was hypothesized that the oxygen or nutrient needs for the cultures were not being met. In original liquid culture, the bacteria appeared to exist in suspended "clumps", where the observed flocculations possibly allowed for nutrients to be available in the necessary concentrations. There is also the possibility that a consortium is necessary for degradation of the 5-MBz. The ineffectiveness of re­combining the bacteria indicates that, after several doubling times in the absence of other members of the symbiotic relationship, the bacteria may lose the ability to break down the 5-MBz even when they are again in the presence of other bacteria of the consortium. Alternatively, the effective organisms might not be culturable under the tested conditions. Although further experimentation will be conducted to investigate conditions under which the isolated organisms can degrade the contaminant, perhaps isolated colonies will never be able to perform the desired degradation.

31

Page 37: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Kinetics Studies

In addition to experimentation on the isolated bacterial colonies, work has also been done to ascertain the pathway by which the original consortium degrades the 5-MBz. Because of the similarity in molecular structure between toluene and 5-MBz, the kinetics of the degradation of both species were studied under the hypothesis that the mechanism of 5-MBz breakdown is related to that of the toluene degradation. In addition, the bacteria seem unable to degrade the MBz-related compound benzotriazole, which lacks the methyl group on the benzene ring. These kinetics experiments involved exposing the bacterial culture to both substrates and evaluating the effects of the presence of each substrate on the degradation of the other. Bottles were prepared with bacterial culture and 4. 7 1-1moles toluene, for each sample, with varying concentrations of 5-MBz. Each solution was diluted to either 11.5 or 15 mL, so that the head space in each bottle was 52.5 or 49 mL. The bottles were subsequently placed on a shaker. At the highest concentration of 5-MBz, about 90 mg/L, there were approximately four 1-1moles of 5-MBz per 1-1mole of toluene in the solution phase; at the lowest concentration of 5-MBz, 3 mg!L, there were approximately 0.15 1-1moles of 5-MBz per 1-1mole of toluene. The concentrations of each component were measured over 20 hours (Figures 4 and 5). The reader should note that the scales on the y-axes are not consistently delineated from 0 -100%.

100%

90%

80% Cl c

70% c iii E 60% CD 0::

50% CD c CD 40% ;:, 0 1- 30% :::e 0

20%

10%

0%

~··· ~ ~··· ... ~- "

. . . . . . . ~- "

. . . . . . ~-~

T

""· " ~-"'-"-~

~"" ~ 0 5 10 15 20 25

Time, Hours

Figure 4. Bacterial degradation of toluene in the presence of 5-MBz. The initial concentrations of 5-MBz in solution are: 86 mg!L (x), 28 mg!L (+, dashed line), 8.6 mg!L (.6), 2.8 mg/L (*, dashed line), and 0 mg/L (o). The initial toluene molar concentration is approximately 0.18 mmolar in the liquid phase.

Inspection of Figure 4 reveals that 5-MBz in large concentrations inhibits the consumption of toluene as a substrate. This may mean that 5-MBz is degraded by the same mechanism as that for toluene, where competitive inhibition may be occurring between the two substrates.

32

Page 38: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

100%

95%

Cl c 'E 90% iii E Cll It: N m 85% :::!: :.e 0

80%

75% 0 5

---------

----·------------------------

10 Time, Hours

--------------·-···

15 20

Figure 5. Bacterial degradation of 5-MBz in the presence of toluene. The concentrations of 5-MBz in solution are: 86 mg/L (x), 28 mg/L (+,dashed line), 8.6 mg/L (6), and 2.8 mg!L (*, dashed line). The initial toluene molar concentration is approximately 0.18 mmolar in the liquid phase.

Figure 5 illustrates that low levels of 5-MBz in the presence of toluene are not rapidly degraded, indicating that the bacterial consortium simply feeds on the component in the highest concentration.

In order to determine if the presence of toluene affects the degradation of similar concentrations of 5-MBz, the original culture was exposed to two conditions: 5-MBz +toluene and 5-MBz alone. The concentrations of toluene and 5-MBz were measured over 50 hours (Figure 6).

100%

90%

80%

Q 70% c 'E iii 60% E Cll 50% It: N m 40% :::!: :.e 30% 0

20%

10%

0%

K·· .. ~

. . . .

""' . . .

"'-. . . .

""' ~ . . ~

. . . "''•, "'·. ~·. ~

0 10 20 30 40 50

Time, Hours

Figure 6. Bacterial degradation ofMBz in the absence (6, dashed line) and presence (o) of 0.15 mmolar equivalents of toluene. The initial MBz concentration was 50 mg/L.

33

Page 39: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

From inspection of Figure 6, the degradation of 5-MBz in the presence of toluene was slightly faster than in the absence of toluene. Repetition of this experiment confirms these results. The addition of amounts of toluene from 0.5 to 3.0 times the molar concentration of the 5-MBz induces slightly faster degradation of 5-MBz.

Conclusions and Future Work

From the two sets of kinetic experiments and recent duplicate experiments, it appears that the bacterial consortium can degrade either toluene or 5-MBz-and degrades preferentially the component in the higher concentration. This leads to the possible conclusion that 5-MBz may be consumed by the same mechanism as toluene, where toluene di-oxygenase is the enzyme responsible for toluene degradation. Measurements of the increasing C02 levels over time in the headspace of samples exposed to 5-MBz, toluene, or both, indicate that the compounds serve as substrates and are at least partially mineralized.

A continuation of the kinetic studies described here is planned. Growth studies are also underway to determine the doubling time of the bacterial consortium in the presence of 5-MBz and toluene.

Acknowledgments

This research was partially supported by the USEP A and the U.S. Air Force under assistance agreements R-819653, R-825549, and R-825550 to the Great Plains-Rocky Mountain Hazardous Substance Research Center for regions 7 and 8 under projects 94-27 and 98-3. It has not been submitted to the EPA for peer review and, therefore, may not necessarily reflect the views of the agency and no official endorsement should be inferred. The Center for Hazardous Substance Research also provided partial funding.

References

Castro, S., L.C. Davis, and L.E. Erickson, 2000. Experimental Study ofPhytodegradation Kinetics of Methyl Benzotriazole. Proceedings of the 30th Annual Biochemical Engineering Symposium, University of Colorado, Boulder, CO. p. 33-41.

-----. Plant-Enhanced Remediation of Glycol-Based Aircraft Deicing Fluids. 2001. Practice Periodical of Hazardous, Toxic, and Radioactive Waste Management, Vol. 5, No.3, pp. 141-152.

Rollinson, G. and A. G. Callely, 1986. No Evidence for the Biodegradation ofBenzotriazole by Elective Culture or Continuous Enrichment. Biotech. Lett. 8, p. 303-304.

USEP A (1977). Investigation of Selected Potential Environmental Contaminants: Benzotriazoles. USEPA 560/2-77-001

34

Page 40: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Understanding Protein Structure-Function Relationships in Family 47 a-1,2-Mannosidases through

Computational Docking of Ligands Chandrika Mulakala and Peter J. Reilly*

Department of Chemical Engineering, Iowa State University, Ames, Iowa

ABSTRACT Family 4 7 a-1 ,2-mannosidases are crucial enzymes involved inN-glycan maturation in the

endoplasmic reticula and Golgi apparati of eukaryotic cells. High-resolution crystal structures of the human and yeast endoplasmic reticulum a-1,2-mannosidases have been recently determined, the former complexed with the inhibitors 1-deoxymannojirimycin and kifunensine, both of which bind in its active site in the unusual 1 C4 conformation. However, unambiguous identification of the catalytic proton donor and nucleophile involved in glycoside bond hydrolysis was not poss­ible from this structural information. In this work, a-D-galactose, a-D-glucose, and a-D-mannose were computationally docked in the active site in the energetically stable 4C1 conformation as well as in the 1 C4 conformation to compare their interaction energetics. From these docked struc­tures, a model for substrate and conformer selectivity based on the dimensions of the active site was proposed. a-D-Galactopyranosyl-(1 ~2)-a-D-mannopyranose, a-D-glucopyranosyl-(1 ~2)­a-D-mannopyranose, and a-D-mannopyranosyl-(1 ~2)-a-D-mannopyranose were also docked into the active site with their nonreducing-end residues in the 1 C4 and £ 4 (representing the trans­ition state) conformations. Based on the docked structure of a-D-mannopyranosyl-£4-(1 ~2)-a­D-mannopyranose, the catalytic acid and base are Glu132 and Glu435, respectively.

INTRODUCTION Class I a-1,2-mannosidases (EC 3.2.1.113), which form glycosyl hydrolase Family 47,1 are

crucial enzymes inN-glycan synthesis in eukaryotic organisms. Glycoprotein synthesis through the N-glycan pathway begins with the transfer of Glc3Man9GlcNAc2, a preformed mannose-rich oligosaccharide, from dolichyl phosphate to freshly synthesized polypeptides in the endoplasmic reticulum (ER).2-4 a-Glucosidases and a-1,2-mannosidases of the ER trim Glc3Man9GlcNAc2 to form Man8GlcNAc2, which is transported to the Golgi apparatus for further processing. Subseq­uent trimming by the Golgi Class I a-1 ,2-mannosidases results in Man5GlcNAc2 formation, which is necessary for the maturation of the N-glycan to hybrid and complex oligosaccharides.

The crystal structure ofthe catalytic domain ofER Class I a-1,2-mannosidase of Saccharo­myces cerevisiae has been recently determined.5 It has an unusual (a,a)rbarrel structure, with an N-glycan (Fig. 1) from one molecule extending into the barrel ofthe adjacent symmetry-related molecule, interacting with the putative enzyme active site. A C-terminal ~-hairpin protrudes into the center of the barrel from one side, plugging it. The other side of the barrel has a -25 A-diam­eter funnel-shaped cavity that decreases to -5 A at the funnel tube, which is also plugged by a Ca2

+ ion. Site-directed mutagenesis of Arg273, located in the funnel neck in the yeast enzyme, to Leu273 (all residue numbering henceforth is based on the yeast enzyme) allowed the enzyme to cleave all four a-1,2-linked mannosyl residues, rather than just the single residue of the middle arm ofMan9GlcNAc2. 6

A crystal structure of the catalytic domain ofhuman ER Class I a-1,2-mannosidase, both with and without the potent inhibitors 1-deoxymannojirimycin (DMJ) and kifunensine (KIF) (Fig. 2), has also been recently determined.? Both inhibitors bind to the enzyme at the base of its

35

Page 41: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

active site, with the Ca2+ ion coordinating and stabilizing

02 and 03 hydroxyls of the six-membered rings of both inhibitors in their 1C4 conformations. The overall structures of the yeast and human enzymes are essentially the same. Although the amino acid sequences of the two enzymes are no more than 35% similar, the positions of the amino acid residues that make up the active site in the two crystal structures are also practically identical.

This structural data, when combined with the distances of the catalytic acid and base from the glycosidic bond needed for catalysis, 8 suggest that the only candidates as catalytic proton donor and nucleophile are Glu132, Asp275, and Glu435.5,7 Due to the absence of a terminal middle-arm mannosyl residue in the crystal structure, ASN96 however, the two catalytic residues could not be clearly Fig.1. Man9GtcNAc2 schematic. The M7-identified. The 1C4 conformation of the inhibitors in the -1 M10 bond is cleaved by Family 47 ERa-catalytic site in the human enzyme suggests that the term- 1,2-mannosidases. Residues in white do

1 not appear in the yeast a-1,2-mannosi-inal mannosyl residue would also adopt the C4 conformat- dase crvstal structure.

ion. Given this ring pucker, Glul32 would then have to be the catalytic acid for the enzyme to invert product conformation, and hence Asp275 or Glu435 would be the catalytic base.5,7 Since Glu132 is too far away from the glycosidic oxygen atom for direct attack, the water molecule Wl95 would have to mediate proton donation, suggesting an unusual catalytic mechanism for this enzyme. 7

We have used automated docking of ligands into the active sites of glycosyl hydrolases and the carbohydrate-binding site of surfactant protein D9-15 with Auto Dock 16-18 to predict different bound ligand conformations. Therefore we thought that it would be possible to use computation

H0~6

5N1 ,4 2

HO'' 3 OH

OH

(a)

(g)

HO::Q

6

5 N 1 4 2.

Ho'' 3 ''oH

OH

(b)

0 0

~ ~t" HO:Q( HO

4 2 .

H ''oH HO''' 3 OH

OH OH (c) (d)

(h)

»,,OH 6

HO HO .

H ''oH OH

OH OH

(e) (f)

6

HOXYr o: 3 5 2 1 0

,,0 ~

OH HO

OH

OH

(i)

Fig.2. Ligands selected for docking: (a) 1-deoxymannojirimycin (DMJ); (b) 1-deoxynojirimycin (DNJ); (c) kifunensine (KIF); (d) a-0-galactopyranose (Gal); (e) a-D-glucopyranose (Gic); (f) a-0-mannopyranose (Man); (g) a-0-galacto­pyranosyl-(1-+2)-a-D-mannopyranose (Gai-1,2-Man); (h) a-0-glucopyranosyl-(1-+2)-a-D-mannopyranose (Gic-1,2-Man); (i) a-D-mannopyranosyl-(1-+2)-a-D-mannopyranose (Man-1 ,2-Man).

36

Page 42: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

to supplement the available experimental knowledge of the active-site function and hydrolysis mechanism of Family 47 a-1,2-mannosidases. In the present study, DMJ and KIF in t.he 1C4

conformation were first docked to validate that AutoDock would yield the same bound structures as found by x-ray crystallography. Next DMJ, 1-deoxynojirimycin (DNJ), a-D-galactopyranose (Gal), a-D-glucopyranose (Glc), and a-D-mannopyranose (Man) (Fig. 2) were docked both in the 4C1 conformation, favored by these molecules in solution, as well as in the 1C4 conformation to compare the binding energetics of the two conformations with the active site. To understand the nature of transition-state stabilization in a-1 ,2-mannosidase, the disaccharides a-D-galactopyra­nosyl-( 1 ~2)-a-D-mannopyranose (Gal-l ,2-Man), a-D-glucopyranosyl-(1 ~2)-a-D-mannopyra­nose (Glc-1 ,2-Man), and a-D-mannopyranosyl-(1 ~2)-a-D-mannopyranose (Man-1 ,2·-Man) (Fig. 2) were docked into the active site. Each of these ligands was docked two ways, with the non­reducing-end residue either in the 4C1 or the £4 (representing the transition state) conformations.

COMPUTATIONAL METHODS Automated Docking

AutoDock employs a Lamarckian genetic algorithm (LGA) to perform a search of the con­formational space of the ligand. In the AutoDock implementation of the genetic algorithm,I9 the genes are a string of real values representing the three cartesian coordinates for the ligand trans­lation, four variables for the quatemion defining the ligand orientation, and one real value for each ligand torsion, in that order.

The LGA uses a local search algorithm from Solis and Wets (SW)20 after the global search performed by the genetic algorithm, and the local search results are inherited by the offspring. This search algorithm is adaptive; the step size is modified depending upon the recent energy history. User-defined numbers of consecutive increases or decreases in the energies cause the step size to be doubled or halved, respectively. A slightly modified version of the SW method that allows different step sizes for different genes has also been implemented in AutoDock.I9

The yeast a-1,2-mannosidase crystal structure (PDB IDL2) was chosen for this docking study. All hydrogen atoms in both protein and ligands were explicitly modeled, with polar hydro­gen atoms being assigned Lennard-Jones 12-10 hydrogen bonding parameters and nonpolar hydrogen atoms being assigned 12-6 parameters. Hydrogen atoms were added to structure 1DL2 followed by a small minimization run to optimize their positions using CHARMM.2I All water molecules were removed while docking. Partial charges were assigned to the protein atoms using all-atom charges of the AMBER force field.22 Atomic solvation parameters and fragmental atom volumes were added using the AddSol program provided in the AutoDock 3.06 suite.

In the testing phase of the study, DMJ and KIF structures were first isolated from the crystal coordinates of human a-1,2-mannosidase. Hydrogen atoms were added using Babel23 and partial charges for inhibitors and mannosyl substrates were generated using MOPAC.24 Rotatable ligand bonds were defined using the Auto Tors module of AutoDock.

The grid maps for van der Waals and electrostatic energies were prepared using AutoGrid version 3.0, 19 with 101 X 101 X 101 points spaced at 0.375-A distances. The grid was centered on the Ca2

+ ion at the base of the active site. AMBER force-field parameters were used for evaluat­ing non bonded interaction energies. 22 The Ca2

+ parameters were the same as those used by Allen et al. !5 Electrostatic interactions were evaluated using a distance-dependent dielectric constant to model solvent effects.

For the LGA, the size of the initial random population was 50 individuals, the maximal num­ber of energy evaluations was 1.5 X 106

, the maximal number of generations was 80, the number

37

Page 43: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

of top individuals that survived into the next generation, the elitism, was 1, the probability that a gene would undergo a random change was 0.02, the crossover probability was 0.80, and the average of the worst energy was calculated over a window often generations.

The pseudo-SW method was used for local searches. There were a maximum of 300 iterat­ions per local search, the probability of performing a local search on an individual was 1.0, the maximal number of consecutive successes or failures before doubling or halving the step size of the local search was 4, and the lower bound on the step size, 0.01, was the termination criteria for the local search. A total of 1 00 dockings were performed in each docking run. In analyzing the docked conformations, the clustering tolerance of the root mean square positional deviation was 1.0 A. The crystal coordinates ofDMJ and KIF were references for their docking. For disacchar­ide dockings, crystal coordinates ofDMJ and M7 (Fig. 1) served that purpose.

The involvement oftwo crystal-structure water molecules, W54 and W195, in the catalytic mechanism for nucleophilic attack and mediation of proton donation, respectively, has been sug­gested.7 It is likely that these waters would be displaced upon substrate entry, so they were opti­mized using the local search algorithm of AutoDock. The minimization parameters were the same as those used for the LGA local search.

The main aim of this docking study was to identify the catalytic residues by determining the bound conformation of a mannosyl residue in the tube of the active-site funnel, so a local search of the conformational space inside this tube was needed. However, a global search was expected to yield interesting results also. Therefore, to suit our specific docking needs we increased the local search character of the LGA by making the probability of local search 1.0, and reduced the global search character by keeping the maximal number of generations over which the genetic algorithm is looped to 80. Also, to reduce the search of meaningless conformational space, the initial conformation of the docked ligand was placed in the active-site funnel tube by superim­posing it on the DMJ or KIF crystal coordinates.

All docking jobs were run on an SGI Origin 2000 with a 300-MHz MIPS R12000 processor and 1GB of memory running IRIX 6.5.

RESULTS

Automated Docking Validation of the docking procedure with inhibitors

The first step was to validate our method by docking DMJ and KIF in the active site of the yeast crystal structure to check for agreement with the observed structures. Since the two enzyme active sites are practically the same, human a-1 ,2-mannosidase was superimposed on the yeast structure to obtain the corresponding positions of KIF and DMJ in the yeast enzyme, and these coordinates were used to compare the docked structures. The lowest-energy docked structure of KIF_IC4 (KIF in the 1C4 conformation) has a final docked energy of -107.0 kcal/mol; it docked with a root mean square deviation (RMSD) of0.72 A to the crystal structure (Table I). Similarly, docking ofDMJ_IC4 gave a final docked energy of -95.0 kcal/mol and an RMSD of0.62 A from the crystal structure (Table I). The agreement was sufficiently good to proceed with the docking of other molecules into the enzyme active site.

DNJ, a Glc analogue, does not inhibit a-1,2-mannosidaseJ As a negative control, DNJ-1C4 was docked into the enzyme active site to compare its docking energy with that of DMJ. The lowest-energy docked structure ofDNJ- 1C4 has a total docking energy of -88.5 kcal/mol with a 1.31-A RMSD with the crystal structure ofDMJ-1C4. This energy is higher than the docking

38

Page 44: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

energy of DMJ-1 C4 and apparently is not sufficient to compensate for the loss of entropy and sol­vation energy incurred by it upon binding, i.e. its free energy of binding is not negative.

Monosaccharide docking simulations The lowest-energy docked structure of Man-1 C4, with an energy of -93.0 kcal/mol, does not

dock in the active-site funnel tube. Instead, it docks just outside the funnel neck away from M7. This structure hydrogen-bonds Asp61, Trp82, Arg136, and Glu497. Ofthese, only Arg136 is conserved, implying that this binding site is not unique across the a-1,2-mannosidase family. However, the second lowest-energy cluster docks very close to the crystal-structure DMJ-1C4,

with a RSMD of 0.82 A and an energy of -84.3 kcal/mol (Table 1). The lowest-energy cluster of Man-4C1 docks close to M7 with an energy of -94.6 kcal/mol. The second lowest-energy cluster docks in the funnel tube with an energy of -88.5 kcallmol and a RMSD of 1.87. Glc and Gal, C2 and C4 epimers of Man, respectively, bind with higher energies than DMJ-1C4.

Disaccharide docking simulations The E4 conformation ofMan-1,2-Man docks with a lower energy than the 1C4 conformation;

this is not the case in the docking simulations ofGal-1,2-Man and Glc-1,2-Man (Table 1). Only Man-£4-1,2-Man can closely overlay the crystal structures of both DMJ and M7 in the active site, as is evident from their low RMSD values (Table 1).

TABLE I. Inhibitor, Monosaccharide, and Disaccharide Docking

Ligand Conform- No. of major Cluster Lowest energy RMSD of lowest

ation clusters rank* (kcal/mol) energl (A)

DMJ 1C4 2 1 -95.00 0.62

DMJ 4C1 9 6 -77.70 1.29

DNJ 1C4 4 1 -88.48 1.31

DNJ 4C1 6 1 -91.10 1.52

KIF 1C4 1 1 -107.00 0.72

Gal 1C4 5 1 -86.13 1.96

Gal 4C1 4 2 -91.71 1.82

Glc 1C4 6 1 -85.44 0.95

Glc 4C1 5 2 -80.41 1.91

Man 1C4 5 2 -84.25 0.82

Man 4C1 9 2 -88.51 1.87

Gal-1 ,2-Man 1C4 3 1 -141.00 1.92

Gal-1 ,2-Man E4 3 1 -137.40 1.01

Glc-1 ,2-Man 1C4 3 1 -146.79 1.74

Glc-1 ,2-Man E4 4 1 -137.24 1.37

Man-1 ,2-Man 1C4 4 1 -136.75 1.89

Man-1 ,2-Man E4 3 1 -153.03 0.80

*Lower-energy clusters that dock outside the active-site funnel tube are ranked but not reported.

t 1-A RMSD cluster tolerance, based on the crystallographic coordinates of DMJ (inhibitors and mono-

saccharides) or DMJ and M7 (disaccharides).

39

Page 45: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

DISCUSSION Automated Docking of Inhibitors and Monosaccharides

The very negative docking energies ofDMJ_IC4 and KIF-1 C4 explain their inhibitory action. The total docked energy in Auto Dock is the sum of the total non bonded intermolecular interaction energy between every ligand atom and the macro­molecule and the nonbonded intramolecular energy of the ligand. An interesting point to be noted in these docked struc­tures is the very high interaction energies of the C2 and C3 hydroxyl groups of the inhibitors with the enzyme (Fig. 3). In the lowest-energy docked structure ofDMJ, they contribute -25.0 kcal/mol, almost 27.5% ofthe total interaction energy Glu503

and 26.3% of the total docking energy ofDMJ with the a-1,2-

mannosidase. Equivalent values with KIF are -22.7 kcallmol, 22. 7%, and 21.2%. Both these hydroxyl groups are coordina­ted by the Ca2

+ ion present at the base of the active-site funnel tube. Thus, the Ca2

+ ion plays a very important role in stabili­zing the energetically unfavorable 1C4 conformations of these

Fig.3. Stabilizing interactions of DMJ (black) and KIF (white). Conserved residues interacting with DMJ and KIF

2+ (gray); Ca (gray sphere). Hydrogen bonds and coordination by Ca2

+ are shown as lines.

inhibitors. Another point to be noted is that the activation energy for the transformation of mannose from the 4C1 to the 1C4 conformation is 12-14 kcal/mol,26 so these interactions alone may contribute significantly to causing the observed ring pucker.

DMJ-4C1 has a much higher docking energy than DMJ-1C4, the difference being higher than the difference in energies of the 4 C1 and the I C4 COnformations ( -4 kcal/mol), 26 thus making the docked 1C4conformation more stable. DNJ-1C4 and DNJ-4C1 dock with higher energies than the docked energy ofDMJ-1C4, explaining why DNJ is not an inhibitor.

Enzymes function by stabilizing the transition state; however, for efficient function the active site should not favorably bind the product. The difference in the binding energies ofDMJ-1C4 and Man_IC4clearly demonstrates how enzymes achieve this kind of specificity. DMJ-1C4 binds the active site with very negative interaction energy, whereas Man-1C4 does not. The extra hydroxyl group at the mannosyl C 1 causes a steric hindrance that is responsible for the reduced interaction energy. The enzyme active site is optimized to bind the £4 conformation of -1Man. Man-£4 in solution has an apparent free energy of -10 kcal!moi26 greater than that of the Man-1 C4. Before bond breakage the interactions provided by the rest of the substrate stabilize the £4 conformation and, as suggested earlier, might even induce the strain necessary for its deformat­ion. After the bond breaks, the cleaved mannose probably leaves the funnel tube in an effort to relax back into an energetically more favorable conformation.

Man8GlcNAc2, the hydrolysis product of Family 47 ER a-1,2-mannosidases, is a necessary signal for degradation of misfolded protein. 27,28 a-1 ,2-Mannosidase inhibitors block degradation of misfolded proteins, and it has been suggested that this slow-acting ER a-1 ,2-mannosidase (KM = 0.5 mM, kcat = 12 s·1 for the S. cerevisiae enzyme25) may work as a timer for glycoprotein deg­radation. 29,30 This ER a-1 ,2-mannosidase therefore is the first enzyme in the N-glycan synthesis pathway that binds the newly synthesized protein in fully/nearly folded form in the cell under normal conditions.

The difference in apparent free energy between Man-1C4 and Man-4C1 in solution is 4.38 kcal/moi,26 which implies that the Man-1C4 solution concentration is <1 %. Since a-1,2-manno­sidase binds the 1C4 conformation, the corrected KM for this enzyme would therefore be >0.005

40

Page 46: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

-------------

mM and kcatiKM would be >2.4 X 106 M"1s-1, giving this enzyme very high affinity for its sub­strate. Also, the relaxation time for the conformational change between the two chair forms is of the order ofmicroseconds.31 Protein folding itself takes milliseconds to minutes. The rate-limit­ing step, therefore, seems to be the diffusion of the bulky substrate to the a-1 ,2-mannosidase act­ive site. Also, the lower concentration of the 1 C4 conformation might aid in retention. Misfolded proteins would have greater resistance to diffusion and would therefore be retained in the ER longer as the ER resident chaperonins aid the folding process. The compact folded form of the enzyme would be able to move more easily to the ER a-1,2-mannosidase active site where its middle-arm mannose is trimmed. The MansGlcNAcz formed by hydrolysis by this enzyme makes it amenable to be marked for degradation by the ER glucosyltransferases.28 However, if correctly folded, due to its lower diffusion resistance compared to the unfolded form, it can mig­rate quickly enough out to the ER to avoid being marked for degradation. In this scenario, the retention time of the MansGlcNAc2-protein seems to be the most important step in ER-associated protein degradation.

There is also the possibility that the transformation of the terminal mannosyl residue from the 4C1 to the 1C4 conformation is mediated by interactions with the enzyme, but this seems less like­ly considering the high activation energy for this transformation.

Transition State Ring flattening at the C1 position ofDMJ, leading to the E4 conformation, was suggested by

Vallee et a/. 7 so that it could form an a-1 ,2 bond with M7. This would also be consistent with the absolute stereochemical requirement of the coplanarity ofC1, C2, C5, and 05 atoms ofthe nonreducing-end residue necessary for forming the oxocarbenium-ion-like transition state.s The disaccharides Gal-1,2-Man, Glc-1,2-Man, and Man-1,2-Man were docked with the nonreducing­end residues in their 1 C4 and £4 conformations. In each case the reducing-end mannosyl residue was in the relaxed 4C1 conformation. The disaccharide with the nonreducing end in the E4 con­formation would therefore mimic the substrate transition state.

The apparent free energy of Man-£4 in solution is -10 kcallmol higher that that of Man-1 C4. 26

Therefore, for transition-state stabilization to occur, the decrease in energy upon binding should overcome the difference in energy between the two conformations. The lower energy of the docked Man-£4-1,2-Man compared to that ofMan-1C4-1,2-Man clearly establishes that docked Man-£4-1,2-Man well approximates the enzyme-transition state complex. Energies of the two other disaccharide pairs are higher for their E4 than for their I c4 conformations. This implies that even ifGal-1,2-Man and Glc-1,2-Man penetrate the active site, their 1C4 conformations will not deform, and hence there will be no substrate activation for hydrolysis.

An examination of the interaction energies of the various substrate hydroxyl groups indicates that three, 03 and 06 of -1 Man and 04 of+ 1 Man, form strong hydrogen bonds with the enzyme (interaction energies of 12.0, 12.2, and 13.8 kcallmol, respectively). This implies that the interactions provided by the hydroxyl groups further away from the scissile bond also contri­bute significantly to transition-state stabilization. These bonds might also help generate the torque nec­essary for -lMan deformation.

41

Asp275).._

Phe131

~.· .... ~

I Leu338

Fig.4. Stabilizing interactions of +1Man (white). Conserved residues interacting with +1 Man (gray) and M7 (black). Hydrogen bonds and hydrophobic interactions are shown as lines.

Page 47: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

The conserved residues of the active-site neck that bind+ 1Man are shown in Fig. 4. Con­served residue Asp275 hydrogen-bonds 03 and 04 of+ 1Man, thus securely anchoring the sub­strate for the reaction to occur. These hydrogen bonds cannot form if -1Man is linked by a-1,3, a-1,4, or a-1,6 bonds to +lMan, thus explaining its specificity for a-1,2-linked mannosyl resi­dues. This might explain why the D275N mutant ofthe S. cerevisiae enzyme has very low a-1,2-mannosidase activity compared to the wild-type enzyme.25

Hydrophobic packing interactions between conserved aromatic residue Phe 131 and the C4, C5, and C6 atoms of+ 1Man and between conserved Leu338 and the C2 and C3 atoms of+ 1Man also seem to have an important role in stabilizing+ !Man (Fig. 4). For a negative-binding free energy, the interaction energy between the ligand and enzyme should compensate for entropy loss and solvation energy loss due to binding. The active site, being rich in acidic amino acid residues, can replace the hydrogen bonding network supplied by water. Binding to the enzyme active site also supplies a hydrophobic environment for the nonpolar carbon atoms of the ligand, and this could be key in causing the negative binding free energy.

Catalytic Mechanism Identification of the catalytic acid/proton donor and base/

nucleophile from the human and yeast mannosidase crystal structures was somewhat ambiguous.5,7 Docking ofMan-£4-

1,2-Man and the optimal positions ofW54 and W195 after minimization clearly establish their identities: Glu132 is the catalytic proton donor and Glu435 is the catalytic base (Fig. 5). Upon optimization, W54 is 0.7 A and W195 is 0.8 A from their corresponding crystal coordinates. As suggested earlier,5 W195 appears to mediate proton donation, since Glu132 is not within hydrogen-bonding distance of the 02 atom of M7. Another possibility could be the movement ofGlul32 within hydrogen­bonding distance of the 02 atom. However, this does not seem likely because Glu132 is anchored in place by a salt bridge with Arg136 that restricts its motion. The water molecule activated by the nucleophile is W54, the only crystal-structure water that is located close enough to the Cl of -I Man for nucleophilic attack. Asp275 is not within hydrogen-bonding distance ofW54 and hence is ruled out as the catalytic base (Fig. 5). W54 also is coordinated by the Ca2

+ ion; hence Ca2+ has a more direct role in

catalysis than stabilizing the I c4 conformation of -1 Man.

CONCLUSIONS

Asp275

W541~ \3.5

/2.9 W19~

~:lu132 Fig.5. Catalytic mechanism. Putative catalytic residues (gray), docked Man-E4-1.2-Man (black), waters (white): crystal (only oxygen) and minimized (with hydrogens). Distances in A.

The detailed energetic information supplieq by AutoDock significantly adds to the structural information obtained by x-ray crystallography and NMR. This information, upon careful inter­pretation, can contribute to understanding biomolecular recognition mechanisms. It is evident that, in the post-genomic era, when protein structural information will be available at genomic scale, detailed study of interactions at the atomic level would be invaluable for developing meth­ods to determine a protein's structure from its function.

ACKNOWLEDGMENTS The authors are grateful to Pedro M. Coutinho for suggesting this project.

42

Page 48: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

REFERENCES 1. Coutinho PM, Henrissat B. CAZy: Carbohydrate-Active Enzymes Server; 1999. http://afmb.

cnrs-mrs.ft/-cazy/CAZY /index.html. 2. Herscovics A. Processing glycosidases of Saccharomyces cerevisiae. Biochim Biophys Acta

1999;1426:275-285. 3. Herscovics A. Importance of glycosidases in mammalian glycoprotein synthesis. Biochim

Biophys Acta 1999;1473:96-107. 4. Herscovics A. Structure and function of Class I a.1,2-mannosidases involved in glyco-protein

synthesis and endoplasmic reticulum quality control. Biochimie 2001;83:757-762. 5. Vallee F, Lipari F, Yip P, Sleno B, Herscovics A, Howell PL. Crystal structure of class I

a.1,2-mannosidase involved inN-glycan processing and endoplasmic reticulum quality control. EMBO J 2000;19:581-588.

6. Romero PA, Vallee F, Howell PL, Herscovics A. Mutation of Arg273 to Leu alters the specificity of the yeast N-glycan processing Class I a.1,2-mannosidase. J Bioi Chern 2000;275:11071-11074.

7. Vallee F, Kharaveg K, Herscovics A, Moremen KW, Howell PL. Structural basis for catal­ysis and inhibition ofN-glycan processing class I a.1,2-mannosidases. J Bioi Chern 2000;275:41287-41298.

8. Rye CS, Withers SG. Glycosidase mechanisms. Curr Opin Chern Biol2000;4:573-580. 9. Coutinho PM, Dowd MK, Reilly PJ. Automated docking of monosaccharide substrates and

analogues and methyl a.-acarviosinide in the glucoamylase active site. Proteins 1997;27: 235-248.

10. Coutinho PM, Dowd MK, Reilly PJ. Automated docking of isomaltose analogues in the glucoamylase active site. Carbohydr Res 1997;297:309-324.

11. Coutinho PM, Dowd MK, Reilly PJ. Automated docking of glucosyl disaccharides in the glucoamylase active site. Proteins 1997;28:162-173.

12. Coutinho PM, Dowd MK, Reilly PJ. Automated docking of a.-(1,4)- and a-(1,6)-linked glu­cosyl trisaccharides in the glucoamylase active site. lnd Eng Chern Res 1998;37:2148-2157.

13. Laederach A, Dowd MK, Coutinho PM, Reilly PJ. Automated docking of maltose, 2-deoxy­maltose, and maltotetraose into the soybean ~-amylase active site. Proteins 1999;37:166-175.

14. Rockey WM, Laederach A, Reilly P J. Automated docking of a.-1 ,4- and a-1 ,6-linked gluco­syl trisaccharides and maltopentaose into the soybean ~-amylase active Site. Proteins 2000;40:299-309.

15. Allen MJ, Laederach A, Reilly PJ, Mason RJ. Polysaccharide recognition by surfactant pro­teinD: Novel interactions of a C-type lectin with nonterminal glucosyl residues. Biochem­istry 2001;40:7789-7798.

16. Goodsell DS, Olson AJ. Automated docking of substrates to proteins by simulated annealing. Proteins 1990;8: 195-202.

17. Goodsell DS, Lauble H, Stout CD, Olson AJ. Automated docking in crystallography: Analy­sis of the substrates of aconitase. Proteins 1993; 17: 1-10.

18. Goodsell DS, Morris GM, Olson AJ. Automated docking of flexible ligands. Applications of AutoDock. J Mol Recog 1996;9:1-5.

19. Morris GM, Goodsell DS, Halliday RS, Huey R, Hart WE, Belew RK, Olson AJ. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J Comput Chern 1998;19:1639-1662.

43

Page 49: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

20. Solis FJ, Wets RJ-B. Minimization by random search techniques. Math Oper Res 1981;6: 19-30.

21. Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput

. Chern 1983;4:187-217. 22. Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KMJ, Ferguson DM, Spellmeyer DC, Fox

T, Caldwell JW, Kollman PA. A second generation force field for the simulation of proteins and nucleic acids. JAm Chern Soc 1995;117:5179-5197.

23. Walters P, Stahl M. Babel. 1.6. Tucson, AZ; 1992-1996. http://smog.com/chem/babel/ 24. Stewart JJP. MOPAC 6.0; 1990. 25. Lipari F, Herscovics A. Calcium binding to the Class I a-1,2-mannosidase from Saccharo­

myces cerevisiae occurs outside the EF hand motif. Biochemistry 1999;38:1111-1118. 26. Dowd MK, French AD, Reilly PJ. Modeling of aldopyranosyl ring puckering with MM3(92).

Carbohydr Res 1994;264:1-19. 27. Jakob CA, Burda P, Roth J, Aebi M. Degradation ofmisfolded endoplasmic reticulum

glycoproteins in Saccharomyces cerevisiae is determined by a specific oligosaccharide structure. J Cell Bioi 1998;142:1223-1233.

28. Ellgaard L, Molinari M, Helenius A. Setting the standards: Quality control in the secretory pathway. Science 1999;286:1882-1888.

29. Helenius A. How N-linked oligosaccharides affect glycoprotein folding in the endoplasmic reticulum. Molec Bioi Cell 1994;5:253-265.

30. Liu Y, Choudhury P, Cabral CM, Sifers RN. Oligosaccharide modification in the early secretory pathway directs the selection of a misfolded glycoprotein for degradation by the proteasome. J Bioi Chern 1999;274:5861-5867.

31. Stenger J, Cowman M, Eggers F, Eyring EM, Kaatze U, Petrucci S. Molecular dynamics and kinetics of monosaccharides in solution. A broadband ultrasonic relaxation study. J Phys Chern B 2000;104:4782-4790.

44

Page 50: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

--------------~-~~-----~~ --~-~--- ~---

A Mathematical Model for Carbon Bond Labeling Experiments: Analytical Solutions and Sensitivity Analysis for the Effect of Reaction Reversibilities on

Estimated Fluxes

Summary

Ganesh Sriram and Jacqueline V. Shanks1

Department of Chemical Engineering 2114, Sweeney Hall, Iowa State University, Ames, lA 50011

Carbon labeling experiments provide valuable information toward the quantification of intracellular metabolic fluxes. A class of 13C labeling experiments, known as carbon bond labeling experiments, provides information on the extent of coupling between adjacent carbon atoms (i.e. 'bond integrities' of carbon-carbon bonds) in metabolite molecules. From these, it is possible to calculate the abundances of bondmers (molecules of the same metabolite having different C-C bond integrities). The bondmer abundances can be processed and translated to valuable metabolic flux maps. In this paper, we have developed a mathematical model based on metabolite and bondmer balances, for describing bondmer abundances of metabolites in the glycolysis and pentose phosphate pathways as a function of the flux split ratio at the glucose-6-phosphate branchpoint, and the reversibilities of the reactions catalyzed by hexose isomerase and transketolases. We show that in this case, analytical solutions can be obtained for groups of bondmers. Also, we report a sensitivity analysis (based on these solutions), of the effect of the reversibilities on the bondmer distributions.

Introduction Metabolic flux analysis is the quantification of all steady state intracellular metabolic fluxes (i.e. bio­chemical reaction rates), in a metabolic pathway network, and has long been identified as a valuable computational tool in metabolic engineering (Stephanopoulos and Vallino 1991, Stephanopoulos 1994, 1999). Metabolic flux maps, resulting from these analyses, provide a measure of the degree of engage­ment of various pathways in cellular metabolism (Stephanopoulos et al. 1998) and thus provide a description of the physiological state of the cell (Schmidt et al. 1999). Therefore, metabolic flux anal­ysis of genetic variants can be useful in deriving information about the effect of the genetic variation on the biochemistry of the cell (Bailey 1991, Nielsen 1998).

Metabolic flux analysis is typically performed by writing balances for intracellular metabolites in ma­trix form, by measuring the rates of changes in concentration of certain metabolites (such as substrate, extracellular products), and then solving the matrix equation obtained (Vallino and Stephanopoulos 1993). However, in large and biochemical networks (such as those of many eukaryotes) that are topologically complex as well as possibly compartmented, there is an inherent lack of measurable metabolites related to the intracellular flux distribution (Schmidt et al. 1999). This problem can be overcome by using isotopically labeled substrates, and subsequent analysis by nuclear magnetic res­onance (NMR) spectroscopy or mass spectroscopy (MS). NMR and MS provide flux distributions at branchpoints in the network, which can be used as additional contraints in the solution of the aformen­tioned matrix equation.

1 Author for correspondence, E-mail j shanks@iastate. edu

45

Page 51: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

NMR (and MS) experiments involve the feeding of a isotopically labeled (13 C) substrate, and relating the fate of the label to the fluxes in the pathways. The 'fate' of the label could be (a) the extent of 13C enrichment of carbon atoms in a metabolite molecule (if a non-uniformly labeled 13C substrate is used) or (b) the extent of coupling between adjacent carbon atoms in a molecule (or 'bond integrities' of carbon-carbon bonds in the molecule), if a uniformly labeled 13C substrate is used (Schmidt et al. 1999).

In this paper, we concentrate on category (b), also known as carbon bond labeling experiments (Szyper­ksi 1995). Using bond labeling experiments to detect bond-integrities of the amino acids (from the cellular protein of an organism), is a particularly quick, efficient and powerful method of obtaining metabolic flux maps of primary and intermediary metabolism (Szyperksi 1995, Sauer et al. 1997, 1999). This is because the carbon skeletons of the amino acids reflect the carbon skeletons of key precursors spread out over primary and intermediary metabolism (Szyperski 1998), which in turn, are related to the metabolic flux ditsribution. Further, the experimental aspect of this methodology is rel­atively straightforward. Additionally, the amino acids appearing on a 2-D HSQC NMR spectrum are segregated enough to allow easy quantitation of carbon-carbon bond integrities.

We have developed a mathematical model that uses metabolite and bondmer balances, to describe the bondmer distributions (see 'Mathematical Model' section for definition) of the amino acid precursors, as a function of the fluxes and reaction reversibility extents in glycolysis and the pentose phosphate pathway. Prior to this work, Klapa et al. ( 1999) have applied the concept of metabolite and isotopomer balances to develop a model for fluxes in the TCA cycle, in a carbon atom enrichment experiment (category (a) of experiments, above). Since this work focuses on carbon bond labeling experiments (category (b), a completely different experiment type), our work is novel. Also, we have concentrated on the glycolysis and pentose phosphate pathway reactions (i.e. the section of primary metabolism before pyruvate and acetyl CoA), whereas Klapa et al. have focused on the TCA cycle, which is the section of primary metabolism after pyruvate and acetyl CoA.

Mathematical model for bondmer distributions Definitions Before we proceed to describe the development of the model, we formalize the definitions of bond integrity and bondmer, and elucidate the method we have used to assign numbers to bondmers.

Bond integrity. The bond integrity of a covalent bond between two carbon atoms in a metabo­lite molecule, is a property indicating if those two carbon atoms originated from the same substrate molecule, or not. For a single molecule, bond integrity takes the binary values 1 and 0 (1 if the two carbons originated from the same substrate molecule, and 0 if they did not). For a pool of molecules of the same metabolite, the bond integrity of a certain bond is statistical mean of the bond integrities of that bond in all the metabolite molecules of that pool.

In other words, bond integrity is the probability that the carbon atoms connected by that bond origi­nated from the same source molecule. Thus the bond integrity of a C-C bond in a metabolite pool can take fractional values in the range [0, 1].

Bondmer. Bondmers (bond-isomers) are molecules of the same metabolite which have different bond integrities for different carbon-carbon bonds. A bondmer pool is a collection of all metabolite

46

Page 52: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

----------------~-- --

Molecule Structure bl b2 Bondmer number

A 1 2-3 0 1 M3 B 1-2 3 1 0 M2 c 1-2-3 1 1 M4 D 1 2-3 0 1 M3 E 1 2-3 0 1 M3 F 1-2 3 1 0 M2 G 1 2 3 0 0 M1 H 1-2-3 1 1 M4

Mean 0.50 0.63

Bondmer Abundance M1 1/8 = 0.125 M2 2/8 = 0.250 M3 3/8 = 0.375 M4 2/8 = 0.250

Table 1: lllustration of bond integrity, bondmer and bondmer number

molecules of the same metabolite which have the same bond integrity for each bond. Thus bondmer pools are subsets of metabolite pools.

Bondmer numbering The number of bondmers for a given metabolite may sometimes be quite large, and it is therefore necessary to systematically assign numbers to them. We have used the following procedure: Consider a metabolite M having ann-carbon skeleton, 12 3 4 5 ... n. We number the C-C bonds in the molecule from left to right, and let bi be the integrity of the ith bond. Thus,

The bondmer number of this bondmer is then calculated as k = :E~:::-01 2i-l bi + 1. Since we are consid­ering a single molecule, its bond integrities will be either 0 or 1. Thus the number calculated above would be an integer. We then designate the abundance of this bondmer pool as Mk. (Note that the '1' is added for convenience, so that the lowest-numbered bondmer M 1).

Example

Consider an experiment where glucose is the sole carbon substrate, and a 3-carbon metabolite, M, is formed. Assume, for simplicity, that its metabolite pool consists of the 8 molecules shown in Table 1. Their bond integrities b1 and b2, and the bondmer pool that they belong to, are shown alongside. The overall bond integrities of the C 1-C2 and C2-C3 bonds for this metabolite are shown in the last row.

In order to translate NMR data to bondmer abundances, probablity expressions, such as those derived by Szyperksi ( 1995) can be used. Additional details on the same will be available in our forthcoming publication (Sriram and Shanks 2002).

47

Page 53: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

1-3z+r.J. f r

1-z .J.

2-z.J.

..... -......... -........ -- ... --- ... -- ... -- ..... -....... -------~ .... . . . . . . . . 1--- •. --- ··--- •. ------ ... -- ....... -- ... --- ...... --- ........ !

Figure 1: Glycolytic and pentose phosphate pathways, and flux distributions therein

Development of Model

The glycolytic and pentose phosphate pathways, and their flux distributions have been shown in Figure 1. All fluxes are relative to the input glucose flux, which has been taken to be 1. The flux into the pentose phosphate pathway has been designated as 3z. All other fluxes can be described as a function of z, as shown in the figure. This can be calculated using metabolite balances (Sriram and Shanks 2002). Note that the amino acids whose bond integritieslbondmer distributions can be determined using NMR, have been shown inside ovals, and are linked to their metabolite precursors.

Case 1: All reactions are completely irreversible

To exemplify the use of bond integrities in obtaining metabolic flux maps, we have considered the evaluation of the flux through the pentose phosphate pathway, using the bondmer distribution of glyceraldehyde-3-phosphate (GAP). (Note that this information can be experimentally obtained by conducting a bond labeling NMR experiment on the amino acids Ala and Ph e.)

Using information on the carbon skeleton rearrangement of the metabolites during the glycolytic/pentose phosphate pathway reaction, [such as that reported by Follstad and Stephanopoulos (1998), and shown in Figure 2], we enumerated all possible bondmers that would be formed in this reaction network (if the reactions were irreversible). As depicted in Figure 3, the only metabolites in this network that form multiple bondmers are fructose-6-phosphate (F6P or F)-and glyceraldehyde-3-phosphate (GAP or T), the bondmers being:

F1s: 1-2 3 4 5-6 F26: 1-2 3 4-5-6 F3o: 1-2 3-4-5-6 F32 : 1-2-3-4-5-6

Page 54: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

GLU-+ G6P (pts) G&P-+ F&P (u1)

G&P -+ P5P + C02 (pg/)

~ ~ ~ ~ * .. ~ COz~

P5P + P5P-+ T3P + S7P (tMA)

S7P + T3P -+ E4P + F&P (tal)

I r-;==-

~ ~ ~ i ~

~

Figure 2: Carbon atom transitions in pentose phosphate pathway reactions

T3: 1 2-3 T4: 1-2-3

To obtain these abundances as a function of z, input-output balances can be written for the F6P and G6P bondmers, based on the bondmer formation/depletion depicted in Figure 3:

F6P balance:

~F _ d [ ~~:] dt dt F3o

F32

GAP balance:

!GAP - ! [ ~: l [

2(1- z)F26 + (1- z)F1s + (1- z)F3o l + z [ 0 l + z [ 0 l 2(1- z)F32 + (1- z)F1s + (1- z)F3o 1 1

49

Page 55: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

1-2-3 3-2-1 3-2-1 3-2-1

I I

1-I2-3-·-r··<----------·

1-2-3

3-2-1

Figure 3: Enumeration of bondmers in the glycolysis and pentose phosphate pathways when all reac­tions are irreversible

Furthermore, metabolic and isotopic steady can be assumed, so that the rates of accumulation of the bondmers can be set to zero:

d -F=O dt

!!_T=O dt

Thus, from the F6P balance, we have

(1- z) [ ~~ ] = z [ ~:] + (1- 3z) [ ~] + z [ ~ ] Combining this with the GAP balance gives

2 [ T3 ] = [ 2(1- 3z) + zT4 + z + 2z l T4 2zT3 + zT4 + z

This gives 2 simultaneous equations in T3 and T4 , on solving which we obtain

2z T3=--

2-z

50

(1)

(2)

(3)

(4)

(5)

Page 56: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

and

2 -3z T4 = ---

2-z (6)

Eqs. 5 and 6 provide the bondmer distribution of GAP in terms of the relative flux through the pentose phosphate pathway (3z). This demonstrates how the bondmer abundances, obtained from NMR data, can be used to evaluate metabolic fluxes.

Case 2: pgi, tktA and tktB reactions are reversible

Effect of tktA reversibility

Reversible biochemical reactions can lead to a redistribution of the 13C label, and lead to label fates substantially different from the irreversible reactions scenario examined in the above section. We ex­amined the effects of reversibilities of individual reactions in the glycolysis/PPP pathways. The first of the reactions examined was transketolase (tktA) reaction. Szyperksi (1995) suggested the reversibility of this reaction to be the cause of formation of bondmers of pentose phosphate (P5P) with b2 = 0, a class of bondmers not encountered in the previous section, where the reversibility of tktA was not considered. Here, we present a systematic analysis of effect the reversibility on the abundance of such bondmers.

The reaction being examined is

P5P + P5P t~ S7P +GAP

Here, z is the net flux of the reaction (from Figure 1 ), and r is the flux of the backward reaction. The reaction reversibility extent, eA, is defined as the ratio of the backftux to the total flux,

r eA=-­

r+z (7)

From the carbon atom transitions shown in Figure 2, it can be infered that a pentose phosphate (P5P) molecule with unbroken C-C bonds will form the bondmers 862 and T4 , of S7P and GAP, respectively,

(8)

If the reaction is reversible, these bondmers will react to form a pentose phosphate molecule with a broken C2-C3 bond,

(9)

The P14 molecules in the pentose phosphate pool can now participate in the forward reaction, either by themselves or with the P16 molecules, giving 854 and 862 molecules,

P14 + P14 T:!;

P14 + P16 T:i:;

51

(10)

(11)

(12)

Page 57: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

The Ss4 molecules can undergo the reverse reaction to give only P14 molecules,

(13)

This completes the enumeration of the bondmers formed as a result of tktA reversibility. We can now write bondmer balances in vector form, thus,

Since P14 and P16 are the only bondmers of P5P in this system, we have P 14 + P 16 = 1, and therefore, at steady state,

0 = -3zPl4 + r P16 (15)

so that P14/ P 16 = 3zjr, and using the fact that P14 + P 16 = 1, the P5P bondmer abundances are

pl4 r eA

-3z+r 3- 2eA

pl6 3z 3- 3eA

-3z +r

-3- 2eA

Here, P14 represents the class of P5P bondmers which have b2 = 0. The above equations provide the relationship between their abundance and the extent of reversibility of the tktA reaction.

Effect of pgi reversibility

We analyzed the effect of pgi reversibility, by a procedure similar to that in the previous two sections: enumerating bondmers, writing and solving bondmer balances. The number of bondmers of in this case is substantially larger than in the foregoing cases, and is not elaborated here for lack of space. Details can be obtained in Sriram and Shanks (2002). Table 2 lists the abundances of selected bond­mers (occassionally groups of bondmers), when the pgi reaction is reversible. The reversibility extent of pgi is denoted as e. Two different sets of results have been presented: (1) pgi partially reversible (0 < e < 1) and (2) the special case of pgi being completely reversible (e = 1), which results in complete scrambling of the G6P and F6P bondmer pools.

Combined effect of pgi and tktA and reversibilities

We analyzed the combined effects of reversibilities of pgi and tktA reactions, in a manner similar to that above. The results for selected bondmers are presented in Table 3. Here, the definitions of e and eA are the same as in the above sections. Sensitivity Analysis

Based on the analytical solutions obtained for the abundances of various bondmers, we conducted sensitivity analyses for the effect of various reaction reversibilities on the evaluated ftuxes. Although the presentation of detailed analysis is not possible here, we report a summary of the analysis results: the bondmer distribution of GAP is insensitive to the reversibility of tktA (eA) and relatively much

52

Page 58: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Metabolite Bondmer Abundance pgi partially pgi completely

reversible reversible 2z(1 ez) 2z(1 z] GAP TI +T3 1 2 3 + 1 2-3 P = (2-z)(1+2ez) (2-zJ(1+2z)

q=1-p +z T4 1-2-3 (2-z)(1+2z) pez 2z~(1 z}

Pen PI +Pg 1 2 3 4 5 + 1 2 3 4-5 (1+ez) (2-z) ?:1 +~~(1+2z) qez 2+z z

PI3 1 2 3-4-5 (1+ez) (2-z)(1+z)(1+2z) ez z

Pis 1 2-3-4-5 (1+2ez{(1+ez) (1+2z{(1+z)

PI6 1-2-3-4-5 l+2ez 1+2z pez 2z~(1 z}

E4P EI +Es 1 2 3 4+1 2 3-4 (1+ez) (2-z) ?1 +~~(1+2z) qez 2+z z

E1 1 2-3-4 (1+ez) (2-z)(1+z)(1+2z) 1 1

Es 1-2-3-4 1+ez 1+z

Table 2: Effect of pgi reversibility on bondmer distributions

Metabolite Bondmer Abundance 2z(l ez)(2eAez+eA+ez)

GAP GI +G3 1 2 3 + 1 2-3 (1 +2ez) f(2-z) f (1 +ez )+2e A (2+ez)1-6eA] 2z(1 ez)(2eAez+eA+ez)

Pen PI +Pg 1 2 3 4 5 + 1 2 3 4-5 (1 +2ez) [ (2-z) [ (1 +ez )+2e A (2+ez)] -6e A] ez{1+e,d}

Pis 1 2-3-4-5 (1 +2ez)(1+ez+eA +2e Aez) 1

PI6 1-2-3-4-5 (1+eA)(1+2ez)

Table 3: Combined effect of pgi and tktA reversibilities

53

Page 59: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

sensitive to the reversibility of pgi (e). On the other hand, the bondmer distribution of the P5P pool is highly sensitive to the reversibility of tktA (eA), and relatively less sensitive to the reversibility of pgi (e). Also, in a carbon bond labeling experiment, it is crucial to accurately quantify the bondmer abundances of P5P (from histidine) and E4P (from the aromatic skeleton of phenylalanine and tyro­sine). Information from the modeling and sensitivity analysis is expected to be useful in obtaining how significantly each measurement in a carbon bond labeling experiment contributes to the estimated fluxes, and is therefore expected to facilitate better experimental design, as well as faster and more accurate computation of fluxes.

References Bailey, J. E. 1991. Science, 252: 1668-1675.

Christensen, B. and J. Nielsen. 2000 Biotechnol. Bioeng., 68: 652-659.

Follstad, B. D. and G. Stephanopoulos. 1998. Eur. J. Biochem., 252: 360-371.

Klapa, M. 1., S. M. Park, A. J. Sinskey and G. Stephanopoulos. 1999. Biotechnol. Bioeng., 62: 375-391.

Sauer, U., V. Hatzimanikatis, J. E. Bailey, M. Hochuli, T. Szyperski and K. Wuthrich. 1997. Nature Biotechnol., 15: 448-452.

Schmidt, K., L. C. Norregaard, B. Pedersen, A. Meissner, J. 0. Duus, J. 0. Nielsen and John Villadsen. 1999. Metabol. Eng., 1: 166-179.

Sriram, G. and J. V. Shanks. 2002. Manuscript in preparation for submission to Metabol. Eng.

Stephanopoulos, G. and J. J. Val1ino. 1991. Science, 252: 1675-1691.

Stephanopoulos, G. 1994. Curr. Opin. Biotechnol., 5: 169-200.

Stephanopoulos, G. 1999. Metabol. Eng., 1: 1-11.

Stephanopoulos, G., A. Aristidou and J. Nielsen. 1998. Metabolic Engineering. Principles and Methodologies. Academic Press, San Diego.

Szyperski, T. 1995. Eur. J. Biochem., 232: 433-448.

54

Page 60: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

.. .,

Platelet Derived Nitric Oxide (NO) Inhibits Thrombus Formation: The Role of Insulin

R. H. Williams and M. U. Nollert Department of Chemical Engineering and Materials Science

University of Oklahoma, Norman, OK 73019

ABSTRACT.

Platelet adhesion was simulated an in vitro parallel plate flow chamber using blood anticoagulated with 20 U/mL HEPARIN. Blood was perfused over collagen type III protein coated slides (.8 mg/mL) on the surface of the chamber for 5-10 minutes over a physiogical range of shear (100- 2000 s·1

). After 5 minutes the% platelet coverage decreased from 74 (+/-9)% at 100 s·1 to 12 (+/- 8)% at 2000 s-1

. The effect of insulin on platelet coverage was investigated by perfusing blood with different additives through a parallel plate flow chamber for 5 minutes at physiological shear (1 00, 1000, and 2000 s"1

). The following samples were tested: control (no additive), LNAME, which is an inhibitor ofNO production, insulin, a possible promoter ofNO synthesis, and LNAME/insulin. At 100 s-1

, there wasn't a statistical difference in% coverage. However, at 1000 s"1

, the% coverage between insulin and the other samples was statistically significant. After 2 minutes the% coverage for insulin was 26.43 (+/- 6.11) %. However,% coverage for all other samples ranged from 50.37 (+/- 8.47) to 60.33 (+/- 9.15). At 2000 s·1

, perfusion ofLNAME-containing samples resulted in statistically more% coverage than non-LNAME samples. After 5 minutes, samples without LNAME ranged from 10.36 (+/-2.41) to 11.00 (+/- 2.15) %, compared to samples containing LNAME that ranged from 15.52 ( +/- 1.88) to 21.98 ( +/- 2.49) %. This research presents the first evidence that platelet adhesion decreases strongly with shear rate in an in vitro model that closely simulates arterial shear flow. The minimal effect of insulin on platelet adhesion at low shear rates (-100 s"1

) implies that insulin plays a minor role in capillary and venal clotting processes. However, there is evidence to suggest that insulin inhibits thrombus formation through an insulin-dependent NO production pathway at medium shear rates (-1000 s"1

). At high shear rates (-2000 s"1) insulin has a small

effect on% coverage due to shear-induced, optimal NO production by platelets.

INTRODUCTION

Recent studies have suggested that diabetics may have a larger rate of macro vascular thrombus (clot) formation than normal individuals. 1

•2

•3

•4

•5

•6 Current evidence strongly suggests

that thrombus formation is an aggregating factor in CD.8•6•7

•8 Therefore, it is hypothesized that

the link between diabetes and CD may be through enhanced thrombus formation. 10•9 Thrombus

formation is initiated at a site of vascular injury, where the protective endothelial cell layer lining the blood vessels has been removed. 14

•10

'11

'12 This exposes extracellular matrix (ECM) proteins

lining the vessel wall, such as collagen types I and III (figure 1). The ECM proteins utilize arginine-glycine-aspargine (RGD) amino acid sequences as sites for firm attachment of platelets, which are the primary cellular types involved in thrombus formation. 17

•18

•13

•14

•15

•16

Platelets are anucleic and are directly derived from megakaryocytes. The approximate diameter is 3 ).liil, the mean lifetime in body (MLB) range is 8 to 14 days, and the typical adult platelet concentration is 310,000/J,.LL.17 Platelets normally remain in a "ground" or "resting" state

55

Page 61: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

as they move through the vasculature. However, during an injury platelets are very responsive to a1 . 1" fr . . . 19 20 18 Th" ak th . . extern stimu I sent om an InJury site. ' ' IS m es em essential m the primary immune

response (hemostasis) but also can be an aggravating factor in arterial vessel occlusion/ pathological thrombosis. There are adhesive glycoproteins on the surface of platelets, localized on microvilli extending from the platelet membrane.

These glycoproteins are generally termed integrins due to their ability to integrate entire cell grou~s into orfanized structures and to regulate metabolic processes (i.e. wound healing). 9

'20

'21

'22

'2

'24 Integrins are present on the surface of all cells and are responsible for cell

attachment and organization (i.e. recognition oflike cells) into tissues, organs, etc.24 The two platelet integrins important in wound healing are the GPib and arrbl33 integrins (figure 1). 19

,20

,22

,24GPib integrins are comprised of a single chain, while aubl33 integrins are

heterodimeric?2 Both types of integrin promote adhesion/thrombus formation to an injury site. However, each integrin responds differently to physiological shear and promotes adhesion through different mechanisms of action.

At low shear rates (y-100 s·\ the GPib integrin can bond directly to the RGD site. It can also bond to the plasma derived von willebrand factor (vWF). 19

'20

'22 Due to the rigidity of the

bond formed between the GPib integrin and the RGD site on the collagen, it is extremely difficult for a permanent bond to form at high shear. The force of blood flowing through the vessel exceeds the mechanical arm strength of the GPib/RGD bond. Therefore, at higher shear rates (> 1 00 s "1

) v WF is required for permanent adhesion to occur between platelets and collagen?0

'22 This is due to the multivalency ofvWF, which allows more bonding between RGD

sites and GPib integrins and promotes higher overall GPib/RGD bond strength. This results in a longer residence time of platelets on the collagen surface, allowing for activation of a11bj33

integrins and platelet recruitment.20

After the GPib-vWF-RGD complex is formed, the calcium (Ca2+) channel present in the plasma membrane opens, allowing Ca2+ to diffuse through the membrane. The increased Ca2+ concentration results in an aubl33 integrin conformational change (figure 1). During the conformational change the aub and !33 chains link to become "activated" (attain a bonding conformation)_2°,22 After the anbl33 integrins become active, platelets can crosslink to each other through fibrinogen and vWF proteins,.resulting in thrombus formation. There are several agonist that inhibit thrombus formation by blocking Ca2+-dependent anb~3 activation and crosslinking.25

There is evidence to suggest that physiological insulin acts as an agonist that affects thrombus formation through this mechanism.6

'7

A receptor site on platelets for insulin has been characterized. A recent study has also demonstrated that elevated levels of insulin have resulted in reduced thrombus formation to collagen type I. The most widely accepted pathway for insulin-dependent inhibition is the NO/cGMP pathway.25 When insulin bonds to the receptor, a signal is sent to endothelial nitric oxide synthase (eNOS) located within the platelets (figure 2). The eNOS, using NADPH as the cofactor, produces nitric oxide (NO) through the conversion ofL-arginine into L-citrilline. The NO is used by phosphodiesterese III (PDE III) to synthesize cyclic adenosine monophosphate (cAMP) and by cyclic guanosylate cyclase (cGase) to produce cyclic guonosylate monophosphate ( cGMP). Several studies have demonstrated that elevated cAMP and cGMP levels result in decreased thrombus formation.

In the vasculature, insulin also has an affect on endothelial cells.7 Further, endothelial cells possess an insulin receptor and produce NO through an almost identical mechanism as platelets. Endothelial cells also produce much greater levels of NO than platelets. It is still

56

Page 62: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

unclear what effect insulin has on the NO production of platelets in the presence of endothelial cell monolayers.

In this study, the extent of platelet coverage to collagen type III at different shear rates will be quantified with an in vitro system. This will be done to determine the effect of shear rate on imtial platelet adhesion through GPib integrin receptors and it will be shown that platelet adhesion decreases substantially at high shear rates (1500-2500 s-1

). These results disagree with several previous studies that have suggested increased thrombus formation at higher shear rates.22 ·

The relationship between platelet coverage and insulin-mediated NO production will also be quantified. It will be demonstrated that at medium shear rates (1000 s·1

) insulin has a substantial effect on thrombus formation through platelet derived NO production. However, at high shear rates (2000 s·1

) insulin has a reduced effect due to shear-induced NO froduction. This is the first study to indicate that shear may play a direct role in NO production. 2

MATERIALS AND METHODS

Materials: HEPARIN, HEPES, NaCl, collagen type III from calfskin, bovine serum albumin (BSA), L-Nitro Arginine Methyl Ester (L-NAME), insulin from porcine pancreas, and isoproterenol are obtained from Sigma. Mepacrine (quinacrine) is obtained from ICN Biomedicals. Glass cover slips (24x50 mm) are obtained from Fisher Scientific. BAECs are obtained from the labs of Roger E. McEver, Oklahoma Medical Research Foundation (OMRF), Oklahoma City, OK 73012. HUVECS are obtained from Norman Regional Hospital, Norman, OK 73070.

Preparation of Glass Coverslips: Glass coverslips are soaked in 69.6% HN03 for 12 hours. Each coverslip is then washed with 20 mL nanopure H20 and 10 mL 95% ethanol. After washing is complete, the coverslips are then placed in a 95% ethanol bath for at least 12 hours before use.

Protein Coating of Coverslips: Collagen type III solution is prepared at .8 mg/mL by dissolving 24 mg in. 30 mL of 17 mM acetic acid (pH 2.6). The .1% BSA solution is prepared at .1% by diluting .33 mL of33% BSA with 99.67 mL of 10 mM HEPES/115 mM NaCl buffer (pH 7.4). Coverslips are removed from a 95% ethanol bath and allowed to air dry (approximately 15 min). Half of each coverslip is coated with collagen and allowed to incubate for 4 hours in a humidified environment (80-90%) at room temperature (-24 °C). After incubation, each slide is rinsed with 10-15 mL ofHEPES buffer solution and soaked in .1% BSA for at least 2 hours.

Platelet Preparation: Blood collected from healthy donors (30-60 mL, depending on shear rate and length of experiment) is anticoagulated with HEPARIN to a final concentration of 20 U/mL. The fluorescent label mepacrine is then added to a final concentration of 1 0 ~M and allowed to incubate for 1 0 min.

Flow Experiments: The fluid mechanical environment of the vasculature is modeled with an in vitro parallel plate flow chamber as has been previously characterized. Due to the wide range of shear rates studied, different flow regime dimensions are utilized. These dimensions are determined by the width ofthe gasket opening. At 100 and 500 s·1

, .013 x 1.31 and .013 x .50 em are the dimensions used, respectively. At 1000-2500 s·1

, the dimensions .013 x .25 em are

57

Page 63: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

used. Also, the trials for each experiment are conducted in random order for each donor. This is done to average the effects of increasing time-dependent platelet activity on platelet coverage.

Shear Dependent Study: The effect of shear rate on platelet adhesion is determined at 6 shear rates (1 00, 500, 1000, 1500, 2000, and 2500 s"1

) with 5 donors. This range of shear rates simulates those found in a healthy individual. In each trial, the blood is perfused for 6 minutes.

Insulin Effect on NO Production Study: In some studies, 500 nM insulin is added to the anticoagulated blood 10 minutes before the start of an experiment. In other studies, L-NAME, an inhibitor of platelet NO synthesis is added at a concentration of200 J..I.M for 20 minutes before the start of an experiment. In still other studies, both L-NAME and insulin are added as described above, except in sequential order.

Image Analysis: For all studies, the rate ofthrombus formation is recorded on S-VHS analog media. The extent of mural thrombus formation on the surface is quantified by digital image analysis performed on an SGI Indy workstation running the ISEE® image analysis software by Inovision. A background image is acquired after blood begins flowing over the surface but before adhesion of any platelets. This image is subtracted from subsequent images. Platelets are identified based on their size and intensity using adjustable parameters. Images are acquired and analyzed every 10-30 seconds over the course of each experiment, which lasts from 5-10 minutes ..

RESULTS AND DISCUSSION

Preliminary results have been obtained for the shear rate and insulin dependence of thrombus formation. These results provide evidence that the effect of insulin on thrombus formation is shear rate dependent and that platelets produce enough NO to inhibit adhesion to a collagen type III surface.

Thrombus formation is a function of shear rate

Platelets, obtained from healthy donors, abundantly adhere and aggregate on collagen­coated surfaces, while only a small number of platelets attach to albumin-coated surfaces. Single platelets adhere to the collagen surface before formation of mural thrombi. The thrombi increase in size in chains parallel to the direction of flow. Platelets adhere at a slightly hifher rate on collagen at medium shear rates (1000 s"1

), compared to lower shear rates (1 00 s· ). The rate of adhesion is much smaller at higher shear rates (2000 s"1

). However, as time elapses (5-10 minutes), the numbers of adherent platelets at lower and higher shear rates are comparable. Figure 3 shows a sequence of images representing platelet accumalation at a shear rate of 1000 s·1

. There is no platelet adhesion onto the left hand side of each image since this portion of the coverslip is coated with .1% BSA to simulate the non-adhesive nature of intact endothelium. The right hand side of the coverslip is coated with type III collagen to simulate the exposed sub­endothelial matrix proteins at a site of vascular injury. The flow is from left to right. The extent of mural thrombus formation is quantified by digital image analysis as described in detail in the previous section. Figure 4 demonstrates the extent of mural thrombus formation expressed as the percentage of the collagen coated surface that is covered by platelets as a function of time for five levels of shear rate within the physiological range for arterial and venous vascular beds. The

58

Page 64: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

--------~ ---~--

results are the average of five experiments with five different donors. Some results are similar to those obtained by previous investigators at lower shear rates. However, at shear rates of 1500 and 2000 s·1 the rate of platelet accumulation on the surface is lower than several published results. 17·19'24•26 This demonstrates the important role of shear rate in determining the rate of mural thrombus formation. At low shear rates, thrombus formation rate is limited by the rate of transport of platelets to the surface. At sufficiently high shear rates, fluid mechanical drag on individual platelets and platelet aggregates is high. This prevents attachment to the surface or promotes detachment of the cells from the surface, which reduces the rate of platelet deposition on the surface.

Insulin modulation of thrombus formation from healthy donors

Previous studies have shown that insulin causes platelets to produce nitric oxide, which in turn causes elevated intracellular levels of cAMP, inhibiting platelet activation.34•41 Insulin is added, at a concentration of 500 nM, to blood collected from healthy donors and blood is then perfused through a parallel plate flow chamber to determine the extent of mural thrombus formation. This insulin concentration is much higher than that normally found in the blood, but is used to induce a maximal response. The results are shown in figures 5, 6, and 7 where the extent ofthrombus formation, expressed as %coverage ofthe injured area, is plotted as a function of time.

At the two lower shear rates (100 and 1000 s"1), the addition of insulin causes a statistically significant reduction in the formation of mural thrombus. The effect is much larger at 1000 s·1 than at venous levels of shear rate. These differences disappear in the presence ofL­NAME (an inhibitor of eNOS), suggesting that platelet NO production in response to the added insulin is responsible for the differences seen in mural thrombus formation. L-NAME alone has no effect on mural thrombus formation at 100 and 1000 s·1. These results are consistent with the current hypothesis that platelet adhesion to collagen under flow is. primarily mediated by GPib and anb~J integrins. As mentioned earlier, constitutively expressed GPib is responsible for most platelet adhesion to the surface at low shear rates. At higher levels of shear rate, GPib mediates only transient adhesion while anb~3 is required for stable adhesion?0·41 However, anb~3 exists on the platelet membrane in an inactive, or low affinity, form. Therefore, anb~3 integrins must be activated to become fully functional. 20·41 It is anticipated that increased nitric oxide production and subsequent inhibition of platelet activation only affects anb~3-mediated adhesion and this is precisely what is observed. At 1000 s·1, a large effect of insulin-mediated nitric oxide production is observed, while at 100 s·1 only a modest effect is observed.

At the highest shear rate studied (2000 s·\ added insulin does not have a significant effect on the formation of mural thrombi. However, a curious phenomenon is observed that includes L-NAME but not insulin (figure 7). These cases are anticipated to be identical to the controls that do not have any additives. Instead, it is found that adding L-NAME causes an increase in mural thrombus formation in the absence of insulin. This suggests that at these high levels of shear rate, platelets are producing NO and that NO is inhibiting platelet aggregation on the surface. These results are the first to suggest that shear stress induces nitric oxide production in platelets and that the NO is produced in a sufficient amount to affect platelet function, even in whole blood. Also, a number of studies have demonstrated that shear stress, by itself, can activate platelets and promote aggregation and activation?4·27•28 Perhaps, shear stress-induced signal transduction in platelets promotes activation of eNOS. However, it is uncertain whether

59

Page 65: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

any studies demonstrate that shear stress induces platelets to produce inhibitory substances, such as NO.

SUMMARY

• Diabetes is a metabolic disorder that affects the way that glucose is broken down in the body. In type I diabetes, insulin-producing islet~ cells are destroyed.Z.3 In type II diabetes, insulin is not produced in sufficient quantity to facilitate sufficient glucose uptake by the cells.2

'3 A major complication for both types of diabetes is cardiovascular disease

(CD).2 It is hypothesized that the fluctuation in physiological insulin concentration may be a major link between diabetes and CD onset.8

•9

• Platelets and endothelial cells possess an insulin receptor, which has been shown to upregulate eNOS production ofNO by a similar mechanism.33

•35

•38

•39 This modulates

conformational changes in heterodimeric integrins, such as anb~J types. • Insulin is hypothesized to have a greater effect on inhibition of thrombus formation as

arterial shear rate increases.Z7•48

'49 At low shear rates, GPib single-chain integrins are

responsible for adhesion to the RGD sites ofECM proteins, such as collagen type III.20,2

6

However, at higher shear rates (> 100 s·1) the platelet anb~J integrins are required for

thrombus formation.Z0•26 Thus, insulin effect increases as anb~3 integrins play more of a

role in thrombus formation.41

• Insulin has a neglible effect on thrombus formation at high shear rates (2000 s"1). This

may be due to near-optimum NO synthesis induced by high physiological levels of shear.

Cell Adhes-ion

Ca 2+

chan\1

. 0

lf . ~Receptors

"n,r;~.7 [33 Ib

Resting Platelet

Endothelial Cell Layer

FIGURES

Recruited Platelet---"""""':':

Exposed Collagen

Figure 1: Model of platelet adhesion to collagen type III surface. At high shear rates, platelets require vWF to mediate adhesion. Platelet-platelet binding through amPJ integrins is mediated through multivalent fibrinogen and vWF proteins.

60

Page 66: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Figure 2: Model for insulin inhibition of platelet aggregation. The insulin binds to a receptor on the surface of platelets, which signals the eNOS to produce more NO, thus resulting in inhibition of Ca2+ influx. ·

Lt') , __ .. <

1 (d :~;=::-':~ ·~· _:__j

1 0 sec 3 0 sec 6 0 sec .. - ..._ ... ::!'-,...-:.

-- .-;.;: _"'!'!:.

- -- -· - --

·~:~

9 0 sec 120 sec 180 sec

Figure 3: Images showing development of platelet thrombi (1000 s-1). These images are

representative of 5 separate experiments.

61

Page 67: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

80

70 r:::

.Q 60 "' 4)

.r:: 50 "'C

-+-100 s-1 (1:1 -40 4)

4) 30 -..!!! -11-500 s-1 -..t.-1 000 s-1 -ll-1500s-1~----~~~~-----------~-----=~~~~~

c. 20 ~

10

0

0 1 2 3 4 5

time (min)

Figure 4: Shear dependence of platelet coverage. At higher shear rates, there are decreasing rates of coverage due to more mechanical stress on GPib-substrate bonds or less contact time for permanent bonds to form.

80 --control

70 --200 uM LNAME CD 60 C) -.-sao nM Insulin ns ... CD 50 > .....,.200 uM LNAME + 0 (.) 500 nM Insulin - 40 CD

Ci) - 30 ns Q. ~ 20

10 -

0

0 1 2 3 4 5

time (min)

Figure 5: Insulin mediated NO production at 100 s-1• At low shear, insulin has a small

effect on platelet adhesion since GPib integrins can support bonding directly to the substrate without recruitment of other platelets through inside-out signaling mechanisms.

62

Page 68: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

100 .....,.control

90 .....,.200 uM LNAME

80 C) -.!r-500 nM Insulin C'G 70 a.. Q,) ""*'" 200 uM LNAME + > 60 0 500 nM Insulin u - 50 ~ Q,)

40 -C'G a. 30 ~ c

20

10

0 I

0 1 2 3 4 5

time (min)

Figure 6: Insulin mediated NO production at 1000 s-1• At medium shear, insulin promotes

inhibition of platelet coverage through eNOS production ofNO. This is indicated by a higher rate of platelet coverage from samples preincubated with eNOS-inhibitor L-NAME.

60 --control

---200 uM LNAME

Ill 50 Cl -soo nM Insulin ns ...

-200 uM LNAME+ Ill > 40 - 500 nM lnsu lin 0 (,) -Ill Qi 30 -ns Q. ~ 0

20

10

0

0 2 4 6 8 10

time (min)

Figure 7: Insulin mediated NO production at 2000 s-1• Insulin has a minimal effect on

eNOS production of NO due to near-maximum platelet NO production at high shear. This is indicated by the high rates of coverage for LNAME samples, as compared to control samples.

63

Page 69: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

REFERENCES

1. National Diabetes Information Clearinghouse. Diabetes Statistics. 2000.

2. National Diabetes Information Clearinghouse. Diabetes Overview [Web Page]. 2000.

3. National Diabetes Information Clearinghouse. Hypoglycemia [Web Page]. 2000.

4. Harris, S. B. Diabetes Care. 2000 Jul; 23(7):879-81.

5. Staehr, P. Diabetes. 2001 Jun; 50(6):1363-70.

6. Wingard, D. L. Diabetes Care. 1995 Sep; 18(9):1299-04.

7. Busse, R. Curr Hypertens Rep. 1999 Feb; 1(1):88-95.

8. Colwell, J. A. Diabetes Care. 1981 Feb 28; 4(1):121-33.

9. Colwell, J. A. Am J Med. 1983 Nov 30; 75(5B):67-80.

10. Bern, M. M. Platelet functions in diabetes mellitus. Diabetes. 1978 Mar; 27(3):342-50.

11. Anonymous. N Eng! J Med. 1993 Sep 30; 329(14):977-86.

12. Ross, R. N Engl J Med. 1999 Jan 14; 340(2):115-26.

13. Meyer, B. J. Circulation. 1994 Nov; 90(5):2432-8.

14. Nguyen, E. Y. Norman, OK 73019: UniversityofOklahoma; 1999: pp 78.

15. Shechter, M. JAm Coli Cardiol. 2000 Feb; 35(2):300-7.

16. Tsuji, S. Blood. 1999 Aug I; 94(3):968-75.

17. Barstad, R. M. Thromb Haemost. 1996 Apr; 75(4):685-92.

18. Alberio, L. Eur J Clin Invest. 1999 Dec; 29(12):1066-76.

19. Polanowska-Grabowska, R. Thromb Haemost. 1999 Jan; 81(1):118-23.

20. Konstantopoulos, K. Adv Drug Deliv Rev. 1998 Aug 3; 33(1-2):141-64.

21. Kehrel, B. Blood. 1993 Dec 1; 82(11):3364-70.

22. Wu, Y. P. Arterioscler Thromb Vase Bioi. 2000 Jun; 20( 6): 1661-7.

23. Geigy Scientific Tables. Ashley, NY: Geigy Pharmaceuticals; 1970; 7th ed.

24. Goto, S. J Clin Invest. 1998 Jan 15; 101(2):479-86.

25. Houdijk, W. P. J Clin Invest. 1985 Feb; 75(2):531-40.

64

Page 70: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Molecular Mechanics Calculations to Quantify Segmental Interactions in Bioerodible Polyanhydrides: Consequences for Drug Delivery

Introduction

Matt Kipper and Balaji Narasimhan •

Department of Chemical Engineering Iowa State University Ames, lA 50011-2230

*To whom all correspondence should be addressed

Predicting polymer-polymer miscibility and polymer-solute miscibility (where the solute is a low molecular weight molecule, such as a drug) is the goal of a large body of research [1, 2]. An application of such research is the design of polymeric controlled release devices for drug delivery. Here, the phase behavior of the polymer-drug system is important for two reasons. First, for multicomponent polymeric systems (i.e. blends and copolymers) a single homogenous phase or multiple phases may be desirable. It is important to know under what conditions the system will phase separate. Second, in microphase-separated polymer systems, dissolved drug thermodynamically partitions into a preferred phase. Drug partitioning can be used to increase drug solubility in the polymer, tailor drug release profiles, and stabilize macromolecular drugs. Thus, predicting how dissolved drugs will partition into a phase-separated system and how the drug may alter the phase behavior of the system are also important.

Phase separation is a thermodynamically driven process that occurs (when the temperature and pressure and composition of a system are fixed) when the Gibbs' free energy can be lowered by separating into two equilibrium phases. Thus, many models for phase behavior are based on predictions of the Gibbs' free energy of mixing. The Flory-Huggins lattice model is one of the most widely used because of its simplicity [3-5]. More recent models are extensions or modifications to the Flory-Huggins model. This model assumes that the segments of polymer chains are packed in a lattice, and that the two components of the system have segments of approximately equal volume. The Flory-Huggins model equation for the Gibbs' free energy of mixing for binary polymer systems is

(1)

The Gibbs' free energy here has units of energy per mole of volume segments. R is the ideal gas constant and T is the system temperature. cpA and cpa represent the volume fractions of components A and B respectively. NA and N8 are the number of moles of volume segments of A and B. The first two terms on the right-hand side of Equation 1 represent the combinatorial entropy associated with packing the given number of each type of volume segment in the lattice. The third term on the right-hand side of Equation 1 represents the enthalpic part of the Gibbs'

65

Page 71: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

free energy of mixing. X is the Flory-Huggins interaction parameter. Since the entropic part of the Gibbs' free energy is relatively small (the number of attainable conformations being greatly restricted by chain connectivity) and always favors mixing, the enthalpic part, dictated by x. is very important. If x is known for a system, the phase behavior can be predicted.

If volume changes on mixing are neglected, then x accounts only for the energy of mixing and is given by the equation

1 L\Emix x=z RT

(2)

Here, the energy of mixing is scaled as the Gibbs free energy is in Equation 1. The z' in Equation 2 is the effective coordination number of the lattice, the number of nearest neighbor sites for each lattice site, excluding sites occupied by immediate members of the reference segment's own chain. The interaction energy can be obtained from [2]

The energy terms on the right-hand side of Equation 3 are the pairwise interaction energies associated with pairs of volume segments indicated by the subscripts.

(3)

Pairwise interaction energies of the type used in Equation 3 can be obtained from molecular mechanics calculations on pairs of volume segments in vacuo. Monte Carlo simulations can be used to generate a large number of initial structures, whose local structures are allowed to relax until convergence criteria are reached. Likely conformations are selected based on the energy of the final conformations [2].

The simplicity of the Flory-Huggins model is obtained at the expense of making some significant assumptions about the microscopic structure of the polymer. These assumptions include

• no non-combinatorial entropic effects contribute to the Gibbs' free energy of mixing • there is no volume change on mixing • the volume segments are approximately equal in size • the volume segments are packed in an on-lattice configuration

These assumptions lead to the conclusion that the effective coordination number for polymers is always 4 when packed in a cubic lattice. Each volume segment has six nearest neighbor segments in a cubic lattice, two of which are immediate members of the same chain and therefore don't contribute to the energy of mixing.

We extend the applicability of the Flory-Huggins model to a polymer system of interest for controlled drug delivery applications wherein the polymer segments have unequal volumes. The polymer system of interest is blends of copolymers of sebacic acid, SA, and 1 ,6-bis-p­(carboxyphenoxy)hexane, CPH, which has been shown to microphase separate at temperatures of interest for certain copolymer compositions. The merits of this system for controlled release of drugs and the microphase separation are discussed elsewhere [6].

66

Page 72: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

The repeat units of this system are shown in Figure 1. In this system the ratio of the radii of the volume segments is about 1.6 to 1 (ratio of volumes is about 4.1 to 1 ), so the assumption that the volume segments are equal is not valid. Also, on-lattice packing for an amorphous system becomes difficult to rationalize if the volume segments are not approximately equal in size. Furthermore, it is conceivable that significant volume changes on mixing can occur as small segments may occupy the spaces between large segments that would be otherwise inaccessible in a phase-separated system. Finally, it is noted that if the volume of the system in not constant, the simple combinatorial entropy term must be modified to account for the change in volume. Thus, in order to account for volume segments of unequal size, all of the assumptions aforementioned inherent to the Flory-Huggins model must be relaxed. This, in turn leads to a composition dependence for z' and thus for X·

Figure 1: Structures for repeat units ofpolyCPH (top) and polySA (bottom).

Several approaches are available in the literature to model thermodynamics in polymer systems. For instance, Case et al. use atomistic simulations on polymer fragments [1]. This technique is accurate for the interaction energies, but the coordination number is left as an adjustable parameter, to which the model is quite sensitive. The work by Fan and coworkers uses a Monte Carlo packing algorithm to find z' [2]. This algorithm randomly packs monomer units around a reference monomer. Chain connectivity is accounted for by putting a 'dummy' atom at the two ends of each monomer to exclude volume that would otherwise be occupied by the immediate members of the monomer's own chain. The potential for a composition dependence ofz' is not included since the model is restricted to polymers and solvents with approximately equal sizes for the volume fragments. In this model, no objective convergence criterion is established to determine when a monomer is 'jammed', i.e. the value of z' when no more monomers can be packed around the reference monomer.

This work presents a Monte Carlo packing algorithm that can be used to calculate z' for a binary blend of copolymers of unequally sized volume segments when the ratio of the segment radii and the composition is known. The purpose of the model is to investigate the dependence ofz' on the composition variables (both copolymer and blend composition) and the ratio of the segment radii. The volume segments are modeled as mutually attracting hard spheres and are packed in trimer units around a reference trimer. Since the spheres are mutually attracting, periodic

67

Page 73: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

boundary conditions are introduced to prevent the system from gravitating towards the center. Results are presented for homopolymer blends of polyCPH and polySA.

Model algorithm The packing algorithm first constructs the reference trimer by randomly packing two monomer units on the surface of the reference monomer. These first two monomer units are selected based on the composition of the copolymer to which the reference monomer belongs and are members of the reference monomer's chain. They will not contribute to the coordination number, but they will exclude volume and participate in attractive interactions.

To determine the orientation of the first monomer in the trimer, two random numbers are generated and used to find a vector in space that defines the location of the first monomer on the surface of the reference monomer according to

e = 21t(r1) cp = 1-2(r2 )

(4)

Here r1 and r2 are random numbers between 0 and 1, and e and cp are angles defining the direction of the vector in space. A similar technique is used to pack the second monomer. Trimer units are then randomly packed around the reference monomer by first packing the center unit, in contact with the reference monomer, then packing the adjacent two monomers in the newly packed monomer's chain on its surface. The monomers constituting each trimer unit are randomly selected based on the composition variables of the system.

Each time a monomer is packed the distance to all previously packed monomers is checked. If the newly packed monomer is within the interaction distance to another monomer, it is attracted and translates toward the attracting monomer until their surfaces just touch. If the new monomer overlaps with a previously packed one, then a new random vector is generated and it is re­packed. When a new monomer fails to pack after a specified number of attempts, 't, the reference monomer is assumed to be 'jammed' and z' for this reference monomer is computed as the number of monomers within the interaction distance of the reference monomer (excluding the first two monomers packed). This process is iterated over a specified number of reference monomers and the mean z' is calculated for the system.

This algorithm introduces three additional parameters to the model: N, the total number of reference monomers packed around to obtain z'; L, the characteristic length of a unit cell; and 't the number of attempts to pack each monomer. The sensitivity to N can be dealt with by choosing N sufficiently high. If doubling N results in a change in the result by more than a few percent, N is too small. L must be sufficiently small that the density of the system is approximately uniform, but sufficiently large that monomers are not attracted by their images in the next cell. The most important parameter is 't. To find z' when the system is truly jammed, 't must be very large. To obviate the need to perform the packing algorithm with very large 't, data

68

Page 74: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

were collected for t varying over three orders of magnitude and fit to a power law given in Equation 5.

z' -z' = Kt-k 2t t

(S)

Here z'2t represents the value ofz' obtained when tis twice as large as the value used to find z' t· The values ofK and k are obtained from the fit and z' at very large tis computed such that the solution is converged to a sufficient tolerance (0.0001). This value is taken to be the 'jammed' z'.

Results and Discussion An example set of data for z' is shown in Figure 2 along with the fit to Equation 5. (The example shown is for a blend with a polyCPH mole fraction of 0.8). This procedure was performed for several blend compositions to obtain the composition dependence ofz'.

1.00

N' I 0.10 ~ N

0.01 1

' ~ Y = 1.2904x.o.3941 ~

R2 =0.9885

10 100 1000 10000

't

Figure 2. Example of procedure used to compute the value of the constants K and kin Equation 5. The data shown is for a blend ofpolyCPH and polySA with a mole fraction ofpolyCPH of0.8.

Figure 3 shows the composition dependence of z' for a homopolymer blend of CPR and SA. F is the mole fraction of CPR in the system. The values of z' for the homopolymers are approximately equal, as is expected. When F is close to 0, most of the monomer units are SA. When packing around an SA monomer, the occasional randomly selected CPR excludes volume and reduces the coordination number for the SA reference monomer. When the reference monomer is CPR, the higher surface area, coupled with the fact that most of the monomers selected to be packed are SA results in an increase in the coordination number by maybe one or two. However when a CPR is chosen to be packed, it may not find room, though there is space for an additional SA. The CPR reference monomer would be jammed. Furthermore, when

69

Page 75: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

computing the overall value of z', the value is weighted with respect to the overall blend composition, so the z' for the SA units dominates. Thus, the value for z' is slightly lower than for the homopolymers.

5.5

5.4 t------y = 1.5004x2- 1.4957x + 5.3642 ______ -l

R2=0.8335

5.3

~ 5.2 ;;: .e II .!:. 5.1 "N !:

5

4.9

4.8 0 0.2 0.4 0.6 0.8

MJie fraction of CFH in the polymer blend, F CPH

Figure 3. Results for the dependence of z' at infmite 't on the mole fraction of polyCPH in a blend of polyCPH and polySA.

When F is close to 1, for the CPH reference monomers, the occasional SA monomer excludes a smaller volume than the more probable CPH monomer but does not occupy so small a space that an additional CPH monomer unit can be packed. When the reference monomer is SA, the available surface area is small and most of the monomers packed are large CPH molecules which greatly reduces the value of z'. The overall z' for the system is again reduced from that of the homopolymers. There appears to be a minimum in the value ofz' at a blend composition near 50:50. The variation in the predicted z' is about ten percent. This corresponds to a ten percent variation in the prediction for the critical temperature from the Flory-Huggins model, which is significant. Additionally, variations in the value of x. with composition are observed experimentally for many polymer systems. This model postulates a possible origin of such a compositional dependence.

Preliminary molecular mechanics simulations have also been performed using the Insightii software package from Accelrys Inc. to obtain the interaction energies. A small number (2500) of each type of pair-wise interactions (A-A, A-B, and B-B) have been simulated. The energies obtained were weighted with a Boltzman distribution of the form [2]

(6)

70

Page 76: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

-------~--------

This distribution results in a temperature dependence for the interaction energies, where K is the Boltzman constant and pis the probability that the energy state ~+I exists, based on the value of the previous existing energy state Ei. A Monte Carlo algorithm selects the existing energy states for each temperature and computes an average interaction energy for that temperature. Linear temperature dependencies for the pair-wise interaction energies were obtained over the temperature range of interest. When combined using Equation 3, a linear function of temperature for the energy of mixing is obtained. Substituting this energy of mixing into Equation 2 results in a function for x. with temperature and composition dependence and two constants, A and B, of the form

(7)

In the form shown in Equation 7, x. decreases as temperature increases and predicts an upper critical solubility temperature (UCST) for the phase diagram. Thus, as temperature is increased, the phase-separated mixtures become miscible. The numerical values of the constants A and B are not quantitatively accurate. When performing this type of atomistic level simulation, quantitative results are not expected. Rather, as previously stated, the purpose of the model is to provide insight into the mechanisms that dominate the phase behavior of the polymer system of interest. The prediction of the shape of the phase diagram will be used to design experiments that can yield quantitative values for the constants.

Conclusions The Flory-Huggins model for binary polymer systems has been modified to extend its applicability to a polymer system wherein the sizes of the polymer volume segments are unequal. The assumptions of no volume change on mixing and on-lattice packing are also relaxed. These modifications result in a composition dependence for the coordination number, z', and ultimately a composition dependence for the Flory-Huggins interaction parameter X.· Atomistic level molecular mechanics simulations were perforrmed to predict the temperature dependence of the energy of mixing so that a functional form for x. could be derived. The results provide insight into the phase behavior of a polymer system that will ultimately aid in the design of controlled drug delivery devices.

References 1. Case, F. H., and J.D. Honeycutt. Will My Polymers Mix? Methods for Studying Polymer

Miscibility. Trends in Polymer Science. 1994; 2(8): 259-266,.

2. Fan, C. F., Olafson, B. D., Blanco, M., and S. L. Hsu. Application of Molecular Simulation to Derive Phase Diagrams of Binary Mixtures. Macromolecules. 1992; 25: 3667-3676.

3. Flory, P. J. Principles of Polymer Chemistry. Cornell University Press, 1953.

71

Page 77: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

4. Fried, J. R. Polymer Science and Technology. Prentice Hall, 1995.

5. Strobl, G. R. Physics of Polymers. Springer, 1996.

6. Kipper, M. K., Shen, E., Sywassink, A., and B. Narasimhan. Design of an Injectable System based on Bioerodible Polyanhydride Microspheres for Sustained Drug Delivery. Biomaterials (submitted), 2001.

72

Page 78: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Three-Dimensional Hydrophobic Cluster Analysis: The Use of a Virtual Environment for Protein Sequence Analysis: HELIX v0.3

Abstract

Anthony D. Hill, Alain Laederach, and Peter J. Reilly Department of Chemical Engineering

Iowa State University Ames, IA 50011-2230

Hydrophobic cluster analysis (HCA) allows simple, structurally important sequence infor­mation to be viewed and analyzed across species. Traditionally, HCA is performed by comput­ationally wrapping an amino acid sequence around an a-helix, then cutting and flattening the sequence. The resulting pattern of hydrophobic residues produces a fingerprint for the protein structure. These hydrophobic fingerprints can be compared across species to find likely structural patterns. This is a study in the use of a virtual environment to display HCA data. The amino acid sequence is wrapped around an a-helix and displayed in three dimensions. The helix appears before the users and can be placed alongside other three-dimensional helices. The use of an additional dimension in the presentation of the hydrophobic fingerprint allows for an additional human sense to be used in recognizing and matching patterns.

Introduction and Methods Many methods exist to display hydrophobic cluster analysis (HCA) in a virtual environment.

The methods range from using an HCA preprocessor to generate a three-dimensional metafile to be displayed by a dedicated virtual environment rendering program to writing a custom graphics processor to display HCA data. For the purposes of this project, we decided that the best option was to write a program to generate spatial coordinates needed to display HCA data on the fly and to use a library to interact with the virtual environment.

The Iowa State University (ISU) Virtual Reality Applications Center (VRAC) has developed a library of C++ classes, VRJuggler, to interface with virtual environments (Bierbaum, 2000). VRJuggler allows the programmer to use abstracted input and to display objects in a program's code. At run-time, VRJuggler attaches these abstracted inputs and displays to real input and dis­play devices. VRJuggler then handles all the complications of managing each input and display device. This approach has two major benefits: It allows the programmer to focus on the program code unique to that program's particular task, and it allows for a program to be written once (and for one set of input and display devices) and then run using a variety of devices (from conven­tional monitors and mice to three-dimensional tracking gloves and headsets). It is for these ben­efits that VRJuggler was used in the development of HELIX.

Although VRJuggler handles management of input and display devices, it does not actually render three-dimensional coordinates into displayed objects. To accomplish this task, another library, OpenGL, was used. OpenGL is a cross-platform three-dimensional graphics display lib­rary developed by Silicon Graphics (Segal and Akeley, 2001). Three-dimensional graphics hardware accelerators are commonplace on Windows, Macintosh, and a wide variety of UNIX computers. This three-dimensional graphics hardware allows for complex three-dimensional scenes to be rendered from simple three-dimensional coordinates and primitives quickly. It is because of the wide availability of OpenGL accelerators and its cross-platform nature that OpenGL was used to display the three-dimensional graphics.

73

Page 79: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

In its current implementation, HELIX v0.3 reads a text file containing the amino acid sequen­ces of interest in FAST A format. Each sequence is read and its amino acids are displayed as spheres in a right-handed spiral, with 3.6 amino acids per turn, or one amino acid residue each 100°. Each sequence's spiral is oriented vertically, with each subsequent sequence displayed to the right of the previous one. This display geometry was chosen because HCA aligns amino acid residues in this fashion into an a.-helix.

Amino acid residues are displayed as either large yellow spheres (hydrophobic residues) or small grey spheres (all other residues). In considering a display color scheme, several options were examined. The first option was to color each residue according to a large number of prop­erties, with each range of the color spectrum representing different properties, such as acidity, hydrophobicity, and ability to bond with other amino acids (Taylor, 1997). This scheme proved too complex to actually pick out patterns visually, as all the patterns were covered up in the noise of a rainbow column of spheres several stories tall. Therefore we decided to reduce the informat­ion shown to only hydrophobicity. Using smaller darker spheres for the non-hydrophobic resi­dues virtually made these residues disappear from notice, leaving only the hydrophobic patterns visible. So, we determined that the yellow-grey, large-small sphere color scheme enabled the user to find the same hydrophobic pattern across several species easily.

A wireless mouse with a motion tracker is used as an input wand. This wand is used to select amino acid sequences, rotate their representative columns, and move the columns vertically to align homologous regions of amino acids.

Results At this point, HELIX v0.3 has been tested by only a small number of people. All this testing

has taken place in the C6 virtual environment in the ISU VRAC. Sample input files of six to ten amino acid sequences from a family of j3-glycosidases have been used to test the qualitative abil­ity of researchers to find homologous hydrophobic amino acid patterns. At this point, the results are promising; the researchers were able to quickly find homologous hydrophobic patterns within the enzymes' primary sequences. Figures 1-2 show two screen captures of HELIX v0.3.

t\ ~ •• e:. (;, . • • • •• . - ·-a •o

? r . •• • • • •• • • •• • //~)1 •• • • ,/[ • •I ~- n •• t t

Figure 1. Screenshot of HELIX v0.3 displaying two aligned enzyme sequences.

74

Figure 2. Screenshot of HELIX v0.3 display­ing twelve nonaligned enzyme sequences.

Page 80: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

-------- ---------- ---

Future Work The goal of any computational research tool is to allow researchers to discover new informat­

ion. This project is no different; it seeks to allow researchers to better use their abilities of spatial pattern recognition and thought to analyze protein sequence and structure. HELIX is far from actually implementing all of the techniques that can be used to aid researchers. Currently com­puters can do quite a bit of homology detection through multiple sequence alignment, domain recognition, and motif recognition. Additionally, more senses can be used to detect homology: colors, shapes, and sound. As development of HELIX continues, these features will be added.

If HELIX is to be of more value than simply a novel way to view sequence analysis, it must enable the researcher to use this new information. A very likely way of using this information is to query the online protein databases SwissProt and GenBank in a BLAST -like manner, ranking similarity based on three-dimensional HCA patterns, rather than primary sequence. This could allow similarly folding domains to be found between sequences which are not as closely related over the whole protein.

Additionally, some further user interface refinements must be made. Users must be able to mark a hydrophobic pattern and preserve its relationship to similar patterns in other sequences if any sense is to be made of more than one hydrophobic pattern in a set of sequences. Along with this ability, a convenient method to save and restore a homology searching session is needed.

Conclusions Although HELIX v0.3 lacks many features that could make it a serious tool for research, it

shows a great deal of promise. HELIX uses a virtual environment to display protein sequence information in a manner that allows a researcher to use more of his or her senses to analyze in­formation. It already is able to help a researcher find homologous patterns without guidance. With the abilities to automate and guide the homology search and BLAST protein databases for similar patterns, HELIX should be able to explore territories in sequence analysis that other sequence analysis programs miss.

Acknowledgments The authors gratefully acknowledge the support of a Carver Trust Grant for this work, as well

as Greg R. Luecke for access to the Virtual Reality Applications Center.

References Bierbaum, A. VR Juggler: A Virtual Platform for Virtual Reality Application Development.

M.S. Thesis, Iowa State University, 2000. http://www.vtjuggler.org/pub//Bierbaum­VRJuggler-MastersThesis-final.pdf.

Segal, M. and Akeley, K. The OpenGL Graphics System: A Specification (Version 1.3). 2001. http://www .opengl.orgldevelopers/documentation/version 1 3/ glspec 13.pdf.

Taylor, W. R. Residual Colours: A Proposal for Aminochromatography. Protein Eng., 10:743-746. 1997.

75

Page 81: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

MECHANICAL PERFORMANCE OF ELASTIN-MIMETIC HYDROGELS

Eder D. Oliveira1, Sharon A. Hagan2 and Stevin H.Gehrke2

1Department of Chemical Engineering, Universidade Federal de Minas Gerais, Belo Horizonte, MG- 30 160-030 BRASIL 2Department of Chemical Engineering, 105 Durland Hall, Kansas State University, Manhattan, KS 66506 USA

Abstract Elastin-mimetic and polyamine hydrogels were used to investigate the effect of polypeptide

secondary structure on shear modulus as a function of polymer volume fraction. Although the elastin­mimetic protein undergoes changes in secondary structure as a function of temperature, the crosslinked material behaves as an ideal rubber under all conditions, as do most synthetic gels (e.g. poly(allylamine)). This stands in contrast with results from poly(a,L-lysine) hydrogels, where the change in secondary structure with pH altered the dependence of the modulus on polymer volume fraction.

Introduction Protein-based polymers, polymers of amino acids, pose several advantages over synthetic

polymers. The diversity of monomers spans twenty natural occurring amino acids as well as amino acids with modified side chains. Amino acid polymers can be synthesized through two pathways, chemically and with recombinant DNA, both having specific advantages and disadvantages. Using either synthetic pathway, precise control of polymer properties such as sequence, stereochemistry and chain length is easily achieved [1]. One of the most promising advantages of protein-based polymers is their ability to mimic the properties of naturally occurring proteins such as structure or function (e.g. biological recognition sites). Potential high biocompatibility, low cost bioproduction, and renewable raw materials are all advantages that are environmentally friendly [1]. One class of highly studied protein-based polymers is modeled after a recurring repeat sequence of native elastin [ 1-6]. Elastin­mimetic proteins have the potential to exhibit enhanced elastic properties and durability.

Hydrogels, water-swellable crosslinked polymer networks, made from elastin-mimetic proteins hold great promise as a biomaterial for a wide variety of biomedical applications, including responsive drug delivery devices, catheter coatings, tissue engineering matrices, and regeneration, replacement, or repair of vascular tissue [7 -1 0]. Such materials possess excellent biocompatibility, as shown in at least eleven different tests [7]. However, in general, poor mechanical properties are frequently the biggest limitation of hydro gels. Polypeptide gels can possess a variety of secondary structures (both rigid and flexible) not commonly seen in synthetic polymers; therefore, it has been hypothesized that the mechanical properties of polypeptide gels may be different from (and possibly superior to) gels of synthetic polymers [8, I 0]. Polypeptide gels also have the advantage of easily varying the secondary structure through manipulation of the amino acid sequence [7].

Responsive hydrogels shrink and swell with environmental stimuli such as pH, temperature and ionic strength [11]. These same stimuli can also induce secondary structure transitions in polypeptides. Figure I depicts the pH-dependent swelling of a poly(a,L-lysine) gel in aqueous solution. The uncrosslinked polymer exhibits a secondary structure transition (random coil to ~-sheet) for pH between 10 and 11 for this polymer concentration [11]. This secondary structure transition corresponds to the pH transition that the poly(a,L-lysine) gel exhibits in the swelling curve. Poly(allylamine) gels exhibit the same pH dependence with a transition between pH 6 and pH 8, but its free polymer does not have a structural transition.

76

Page 82: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

35

~30 < CT Q)

>~ 25 0) c

(1) 20 3: rn -0 15 (1)

~ g' 10

"'0 (1)

E 5 ::J 0 >

0

0

Iii

2

I

" . •

• measured while increasing pH

c measured while decreasing pH

4 6 pH

8

0

10

Ill 0

• ~

12

Figure 1: pH-dependent swelling ofpoly(a,L-lysine) gels in sodium chloride solution (pH< 5.0, pH> 11.4; !=0.05) or sodium p-phenolsulfonate buffer solution (5.0 :::;; pH :::;; 11.4; !=0.05). The gels are pH­responsive and show no hysteresis upon swelling and shrinking.

Theoretically, it has been predicted that the shear modulus of gels with flexible chains and flexible crosslinks (as in almost all synthetic polymer gels), varies with polymer volume fraction ~2 as G - ~2 113 • In contrast, gels of rigid rods connected by flexible cross links should have a much steeper dependence of modulus on the polymer volume fraction: G - ~l12 [12]. In fact, the log-log plot in Figure 2 shows that for a crosslinked gel of poly(a,L-lysine) under conditions of a random coil configuration G - ~2°·33:!:0·06, while under conditions of probable ~-sheet configuration G - ~2 1.50:!:0.2, matching theoretical predictions within experimental uncertainty [10]. In other words, poly(a,L-lysine) gels behave as flexible networks below their transition pH and behave as rigid networks with flexible junctions above their transition pH [10]. The poly(a,L-lysine) gels with the organized secondary structure also had moduli several times greater than the poly(a,L-lysine) gels with random coil configuration of the same polymer volume fraction. Poly( allylamine) gels, a similar polymer gel, behave as flexible networks, G - ~2 °·31

:!:0·01

, for all pH values (results not shown). Therefore, the modulus changes of poly(a,L-lysine) gels cannot be attributed solely to the effect of pH or polymer volume fraction. Elastin-mimetic polymers have been shown to undergo a transition from a random coil configuration to an organized secondary structure (~-spiral) upon an increase in temperature [7]. The hypothesis for this work is that for elastin-mimetic gels, this transition might be accompanied by a modulus shift analogous to poly(a,L-lysine) gels.

77

Page 83: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

10

1

• pH< 11.0 (swollen at variable pH)

c pH> 11.0 (swollen at variable pH)

• pH= 6.5 {partially dehydrated)

0.04 0.07 0.1 0.4

Figure 2: Shear modulus dependence ofpoly(a,L-lysine) gels on volume fraction of polymer. The gels behave as flexible networks below their transition at pH 11, and the gels behave as rigid networks with flexible junction above their transition pH.

Experimental Methods The elastin-mimetic protein used in these experiments was synthesized by genetic engineering of

E. coli, microbial protein expression, and purification to yield an artificial protein with a molecular weight near 90 kDa [9]. The protein sequence [(VPGVG)4(VPGKG)]4o, where Vis valine, Pis proline, G is glycine, and K is lysine, was modeled after a recurring sequence in native elastin (VPGVG) with the crosslinkable group lysine substituted in to every 25th position. After the expression of the protein in E. coli, the cells were disrupted using sanification and the protein was released into solution. The DNA was removed by precipitation with polyethyleneimine. Making use of the inverse temperature transition, the protein was salted out of the remaining solution using 1M sodium chloride at 37°C. The protein was recovered by redissolving the precipitate at 4°C in phosphate buffered saline. The temperature cycle was repeated to further purify the protein. After the second temperature cycle, the resulting solution was desalted using a centrifugal filter with a molecular weight cut-off of 300 kDa. The desalted solution was then freeze dried to yield a white powder form of the protein.

Elastin-mimetic hydrogels have been synthesized in several ways in the literature [7-9]. The goal of this synthetic procedure is to produce gels using a well-defined precursor polymer and a well­controlled crosslinking scheme. For this study, elastin-mimetic gels were synthesized by crosslinking the lysine groups of [(VPGVG)4(VPGKG)]4o by a condensation reaction with disuccinimidyl suberate in anhydrous dimethylacetamide. Gel disks were prepared by combining appropriate amounts of 0.1 g/mL polymer solution and 0.038 g/mL crosslinker solution (molar ratio of polymer amino groups to crosslinker was 2). Gelation occurred at 3°C in -10 minutes. After 24 hours, the gel disks were

78

Page 84: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

removed from the mold and placed in a I 0 v/v% solution of ethylenediamine in anhydrous dimethylsulfoxide for 24 hours at room temperature to terminate any unreacted N-hydroxysuccinimidyl groups remaining inside the gel with an amino group.

Poly(a,L-lysine) and poly(allylamine) hydrogels were synthesized with a similar procedure, but they were crosslinked by a base-catalyzed addition reaction with N,N-methylene-bis-acrylamide and without the subsequent termination reaction. Swelling curves were obtained from mass measurements after equilibration in aqueous solution for 24 hours at each pH or temperature. Shear moduli were obtained from uniaxial compression experiments.

Results and Discussion The linearity of Figure 3 for a plot of stress vs. strain indicates ideal elastic behavior [13]. Ideal

behavior is further verified through the calculation of the Mooney-Rivlin constants C2, which is a direct measure of the departure of a network from ideality [13]. The Mooney-Rivlin constants are virtually zero for elastin-mimetic gels at all conditions of temperature, pH, polymer volume fraction, and crosslink density tested.

60

50

40 ro a.. 30 ~

"'5" 20

10

0 0 0.2 0.4 0.6 0.8 1

la-a-21

Figure 3: Stress-strain isotherm of elastin-mimetic gel obtained at pH=ll.O, T=l5°C, and llleq/Illdry= 1.96, where G = shear modulus, cr = nominal stress, and a = ratio of the deformed length, ~ to the undeformed length, £0 • Linearity of the plot indicates ideal elastic behavior.

Similar to the pH-dependence of poly(a,L-lysine) gels, elastin-mimetic gels exhibit a temperature-dependence, contracting with increasing temperature, as demonstrated in Figure 4. The slight increase in swelling with reduction in pH seen at low temperatures indicates the presence of some ionic species, either unreacted lysine groups or singly-reacted crosslinker groups terminated with ethylenediamine. The uncrosslinked polymer is proposed to undergo a secondary structure transition (random coil toP-spiral) around 27°C.

79

Page 85: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

--------- -- ------

14

~ 12 E -0"

·..,.. .pH= 6.0 1- •• -• e pH= 11.0 -Q)

10 E ~- -.. -C) c:

Ci> 8 3: en - 6 0 Q)

~ 4 C) Q)

"'0 en 2 en <tl

.. ... - ..... • -•

~- • -

~~~ -

1- -

1- ~ • • •• ::: 0

I

0 10 20 30 40 50 Temperature, oc

Figure 4: Temperature- and pH-dependent swelling of elastin-mimetic gels in unbuffered sodium hydroxide or sodium chloride solution (open symbols represent a second trial at pH=ll). The gels are temperature responsive; the slight pH-dependence shows that some free amine groups exist.

After equilibration at 20°C or 40°C, the polymer volume fraction of the elastin mimetic gels was reduced to varying levels by partial dehydration in order to test the theoretical predictions of modulus dependence on polymer volume fraction [12]. This is tested in Figure 5 for elastin-mimetic gels above and below the transition temperature, where the reduced shear modulus is used to eliminate the dependence of modulus on temperature [13-14]. The reduced modulus is defined as the ratio of the measured modulus to the limiting modulus of a perfect elastic dry network prepared in bulk conditions, PtRT, where Pt is the theoretical crosslink density of the network [14]. The log-log plot shows that the elastin-mimetic gels behave as flexible networks both above and below the transition temperature, despite the proposed secondary structure change. Thermoelasticity experiments also indicate that elastin-mimetic gels behave as typical rubbers at all temperatures [14].

Summary and Conclusions Elastin-mimetic gels behave as flexible networks and display ideal rubber elastic behavior at all

pH, temperature, and polymer volume fractions tested. This is similar behavior to poly( allylamine) gels, but in contrast to the poly(a,L-lysine) gels which exhibit a change in network behavior upon crossing the structural transition pH of the polymer. It was shown that the rigid secondary structure of the poly( a,L­lysine) polymer can improve the mechanical properties of protein-based gels, but the f3-spiral structure proposed for the high temperature configuration of the elastin-mimetic polymer appears to lack the rigidity necessary to alter the mechanical performance of its gel. Despite the differences in protein­based gel performance, it is evident that protein-based hydrogels display elastic behavior distinct from that observed in conventional vinyl polymer gels.

80

Page 86: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

----------~-------

0.1 0.09 0.08 0.07

~ 0.06 a: -(!) 0.05

0.04

0.03

0.02

• T = 20°C c T =40°C

0.4

·------ Theoretical __ Experimental

0.7 cll2

1

Figure 5: Dependence of reduced shear modulus on polymer volume fraction for elastin-mimetic gels above and below the transition temperature (at pH=ll). A single least squares line (solid line) fits all the data points with an exponent that closely matches the theoretical scaling law (dashed line) for flexible networks [12].

References I. D Urry. Journal of Physical Chemistry B. 101 (1997) 11007-11028. 2. D Gowda eta/. International Journal of Peptide and Protein Research. 46 (1995) 453-463. 3. C Guda eta/. Biotechnology Letters. 17 (1995) 745-750. 4. B Li eta/. Journal of Molecular Biology. 305 (2001) 581-592. 5. H Reiersen eta/. Journal of Molecular Biology. 283 (1998) 255-264. 6. J Rodriguez-Cabello et al. Biopolymers. 54 (2000) 282-288. 7. D Urry et a/. in Biotechnological Polymers, C Gebelin, Ed., Lancaster, PA: Technomic Publishing,

1993, 82-103. 8. E Welsh, D Tirrell. Biomacromolecules. 1 (2000) 23-30. 9. R McMillan, V Conticello. Macromolecules. 33 (2000) 4809-4821.

10. E Oliveira, S Gehrke. Polymer ?reprints. 41 (2000) 745-746. 11. B Davidson, G Fasman. Biochemistry. 6 (1967) 1616-1629. 12. J Jones, C Marques. Journal of Physics France. 51 (1990) 1113-1127. 13. BErman, J Mark. Structures and Properties of Rubberlike Networks, New York: Oxford University

Press, 1997. 14. E Oliveira, PhD Dissertation, University of Cincinnati, 2000.

Acknowledgements This work was supported by NASA Microgravity Biotechnology Program Grant 97-HEDS-02-082, and a fellowship to EDO from Conse1ho Naciona1 Desenvolvimento Cientifico e Tecnol6gico - CNPql BRAZIL. Dr. Vincent P. Conti cello, Emory University, Atlanta GA, provided the biosynthetic elastin, bacteria clones, and helpful advice.

81

Page 87: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Multiple Sequence ~~nment and Phylogenetic Analysis of Fa y 1 (3-Giycosidases

Abstract

Anthony D. Hill, Alain Laederach, and Peter J. Reilly Department of Chemical Engineering

Iowa State University Ames, IA 50011-2230

~-Glycosidases are important enzymes in the digestion of ~-linked glycosidic polymers, pri­marily in the breakdown of plant cell-wall tissue. A rough multiple sequence analysis of glyco­side hydrolases divides the known sequences of ~-glycosidases into two families, Family 1 and Family 3. These families are differentiated by sequence and structure, with Family 3 consisting of ~-glycosidases primarily from fungi and prokaryotic organisms. Family 1 contains enzymes of many substrate specificities: glucosidases, mannosidases, galactosidases, and even a myrosinase. This study further analyzes the ~-glycosidases in Family 1 using multiple sequence and phylo­genetic analysis. Accurate subgrouping of the enzymes within Family 1 allows representative three-dimensional structures to be chosen so that automated docking can model the interaction of each subfamily with multiple ~-linked substrates.

Methods A multiple sequence alignment of functionally related enzymes is useful for two reasons.

First, it would be expected that the functionally important amino acid motifs would be strongly conserved across multiple species. Finding regions of high homology helps to highlight poten­tially important amino acids. Second, the multiple sequence alignment is used to guide the con­struction of a phylogenetic tree, which shows how the each amino acid sequence is related to each other aligned sequence. These relationships can be used to group enzymes into sequence­related subfamilies.

As it is published, the amino acid sequence of an enzyme is added to online sequence data­bases like Swiss-Prot (Gasteiger et al., 2001) and GenBank (Benson et al., 2000). These seq­uences can be downloaded to a personal computer or workstation for use. The sequences used in this analysis are a subset of the sequences categorized as Family 1 ~-glycosidases (Henrissat, 1991). CAZy is an online database of carbohydrate enzymes categorized by sequence similarity (Coutinho and Henrissat, 1999). From CAZy, the Family 1 ~-glycosidases were screened so that fragments, open reading frames, and large precursors were not included. Using only the sequen­ces of known functional enzymes eliminated unrelated variation in the multiple sequence align­ment. The remaining sequences were downloaded from both Swiss-Prot and GenBank as linked to by CAZy (Coutinho and Henrissat, 2001).

The method for determining an optimal alignment of two sequences according to a scoring function is described completely by a computational algorithm (Smith and Waterman, 1981). This method is well known and can be computed in time proportional to the product of the num­ber of letters in each sequence. Extrapolating Smith and Waterman's algorithm to a multiple sequence alignment would again require time proportional to the product of the number of letters in each sequence. It is easily seen that this results in an exponential growth in the time required to compute an alignment of multiple sequences. Such a global solution is unreasonably comput­ationally demanding, so heuristics to solve multiple sequence alignments have been developed

82

Page 88: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

and are available in several released programs. One such program is Clustal W and its graphical intetface Clustal X (Thompson et al., 1994). These programs were used to automate the process of finding an optimal alignment.

It is important to note that any method of solving an optimal sequence alignment requires a scoring function. This defines the metric by which the quality of alignment is judged. To align protein sequences, this metric is most easily defined as the probability of mutation from one seq­uence to another. The total probability of mutation is obviously the product of the probability of mutation at each amino acid. For simply comparing optimal metrics and when desiring smaller amounts of memQry to be used to store the metric, the sum of the probability of mutation at each amino acid can be used. A substitution matrix simply defines the probability of mutation from, one amino acid to another, as well as the probability of insertion or deletion of amino acids. Such a matrix has been determined theoretically (Kimura, 1980), but for practical purposes a matrix determined by a broad sample of nature is more accurate (Jones et al., 1992).

In conjunction with manual alignment of the known catalytic residues using the program SeaView (Galtier et al., 1996), Clustal W was used along with the Gonnet PAM 250 substitution matrix (Gonnet et al., 1992) and the following programs (Table 1) to find the multiple sequence alignment used for sequence and phylogenetic analysis.

Table 1. Program descriptions. Program Purpose Proml Computes a maximum likelihood phylogenetic tree for each of the

bootstrapped multiple sequence alignments. Protdist Computes the alignment score between each pair sequences in an

alignment. Seq boot Creates many sets of the multiple sequence alignment, each with

random permutations in the amino acid sequence, simulating data samples from the population. This process is known as bootstrapping.

Drawtree Outputs the graphical representation of the tree. Con sense Finds a consensus tree from each of the trees created by passing the

bootstrapped multiple sequence alignment through Proml.

Phylogenetic trees are computed as the inost likely evolutionary path. This is accomplished by guessing the most likely parent sequences that could mutate into a pair of the child sequences that are observed today. Each pair of parent sequences in tum has its parent sequence guessed, and so on until only one parent sequence remains. The sequences of these parent sequences and their relation to the individual child sequences are optimized so that the probability of the series of mutations occurring is maximized.

One such method for computing this tree is called maximum likelihood. This attempts to find a phylogenetic tree that displays the property its name implies: maximum likelihood of occur­ring. The algorithm does so by using tabulated amino acid transition probabilities and calculating branch occurrence probabilities to find the probability of the phylogenetic tree in question occur­ring. This overall probability is then maximized to find the maximum likelihood tree (Felsenstein and Churchill, 1996).

PHYLIP is a collection of programs that implement the computer algorithms necessary to construct a phylogenetic tree from a multiple sequence alignment (Felsenstein, 1993). These programs are listed in Table 2.

83

Page 89: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

The multiple sequence alignment produced by Clustal X was used as input to Seq boot. The relevant parameters used are also listed in Table 1:

Table 2. Seq boot parameters. Option Value Explanation Sequence, Morph, Molecular The input sequences are molecular sequences. Rest., Gene Freq? Sequence Bootstrap, Jack- Bootstrap The purpose of running seq boot is to bootstrap the knife, or Permute ~eguences

Block size 1 The frequency of site change appears independent. How many 100 A data pool of 100 samples is beginning to get replicates? statistically significant, without being

computationally impossible

Seq boot simulates having taken more samples from the population of sequences. It achieves this result by creating new data sets from an input data set by the method of bootstrapping. Boot­strapping samples N characters at random with replacement from the data set, so that the new data set has the same number of characters as the original, only some characters have been re­moved and others have been duplicated. Bootstrapping is statistically typical of sampling new data sets (Kiinsch, 1989). The parameters used are listed in Table 2.

The multiple sequence alignment was bootstrapped to 100 data sets. For each data set, the maximum likelihood tree was found using Proml (Table 3).

These trees were then used as input to Consense (Table 4), which found the majority consen­sus tree representing all data sets. By using a consensus tree, only inner nodes that are statistic­ally distinct are differentiated. Two nodes that are statistically close enough to be the same are replaced by one node with several branches, more closely modeling the behavior found in nature.

In addition to finding a phylogenetic tree, the bootstrapped data set was analyzed by Protdist (Table 5) to find the pairwise distance between each species. The distances over the species set were averaged using Excel and the standard deviations are reported as well. The resulting matrix lists the distance between each sequence as an expected percent of amino acids changed from sequence to sequence.

The majority consensus tree file was used as input to Drawtree. The resulting PostScript file was edited in Adobe Dlustrator to make the labels of the sequen­

ces more complete and meaningful and to add color behind the tree to denote the different sub­families. Division into subfamilies was a qualitative judgment to find the natural breaks between them. This process is easily aided by finding a common trait among each member of a subfamily.

Results The multiple sequence alignment results in the homologous residues being aligned. The cat­

alytic portions of the alignment are pictured in Figure 1. It can be seen there that the catalytic acid, a glutamic acid, and is fully conserved throughout the species, except for the myrosinases, where a glutamine is aligned in place of a glutamic acid. From the literature previously pub­lished, it is not apparent which residue is the proton donor in myrosinases.

As shown in Figure 1, the catalytic base is fully conserved, with the exception of the third A thaliana f3-glucosidase. It appears that the DNA sequence that was translated has an error resul-

84

Page 90: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Table 3. Promll!_arameters. Option Value Ex_planation Distance Matrix ITT The Jones-Thompson-Taylor distance matrix is a

matrix based on the observation of a very large set of amino acid sequences. The XX distance matrix is based on a smaller set of amino acid sequences, while the XX distance matrix is based purely on theoretical _probabilities.

Speedier, but No, not This option uses more CPU cycles, but produces rougher analysis rough a more accurate tree. Global Yes After the tree is created by building it one seq-rearrangements uence at a time, this option tells the program to

remove each sequence and re-add it optimally, refining the tree even more.

Randomize input Yes, 4n+ This adds the sequences in a random order. The order of 1, 1 4n + 1 random number seed starts the random sequences number generator. The seed should be different

for each data set, so that the same random pro-gression is not always followed, and should be of the form 4n + 1, because other numbers quickly collapse to the same random number progres-sion. One jumble is sufficient, given the rearr-an_g_ements.

Write out trees Yes The tree file is used as input to consense and onto a tree file eventually to produce the phylogenetic tree of

interest.

Table 4. Consense parameters. Option Value Ex_planation Consensus Type Majority This will create a tree containing each species,

rule yet will be consistent with the majority of the (extended) trees in the bootstrapped data set.

Table 5. Protdist ~arameters. Option Value Explanation Distance Matrix ITT The Jones-Thompson-Taylor distance matrix is a

matrix based on the observation of a very large set of amino acid sequences. The XX distance matrix is based on a smaller set of amino acid sequences, while the XX distance matrix is based Q_urely on theoretical probabilities.

85

Page 91: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

ting in a translation to glutamine rather than glutamic acid. Overall, the sequence shows very high homology; none of the sequences differing by more

than 3.6901 PAMs. The distance between each amino acid sequence can be seen in Figures 2-5. However, in spite of the high homology between sequences, the actual enzymes demonstrate a wide variety of substrate specificities.

Figure 6, the phylogenetic tree of Family 1, demonstrates that sequentially the enzymes are more related by taxonomic family than by substrate specificity. However, within each taxonomic subfamily, smaller subfamilies based on substrate specificity can be seen. Figure 6 has been pattern-coded to show the subfamilies apparent within Family 1. These subfamilies were identified qualitatively, noting distinct breaks within the tree. Accordingly, subfamilies from four sources are observed: one from plants and fungi, a second mostly from thermophilic bacteria, a third from bacteria primarily specific to {3-glucosyl polymers, and a fourth from bacteria that are primarily specific to {3-phosphoglycose chains. Within each subfamily the variations in specific­ity are also color-coded, showing the subfamilies within subfamilies.

Within the plant subfamily (white dots), a small grouping ofmyrosinases (dark grey) is seen. It is interesting to note that while this subgrouping has a different specificity, and is accordingly grouped together, it resides in the midst of the entire plant subfamily. This phylogenetic place­ment indicates that much stronger sequence similarity exists between the myrosinases and other plant enzymes than between {3-glucosidases from different species subfamilies.

It can be seen that there exists a fair amount of sequence difference between enzymes spec­ific to {3-phosphoglycoses and those specific to other {3-glycoses. The {3-phosphoglycose-specific bacterial subfamily (pyramids) lies opposite the {3-glycose-specific bacterial subfamily (light grey) on the phylogenetic tree. The separation in the phylogenetic tree indicates a strong sequence divergence to enable specificity towards {3-phosphoglycosyl polymers. Smaller groupings can be seen within the {3-phosphoglycose-specific bacterial subfamily. The enzymes specific to P-{3-galactose (white) and P-{3-glucose (pyramids) have distinct sequence differences, as indicated by their separation into subgroups.

Another important feature of the phylogenetic tree is the separation of enzymes from thermo­philic bacteria from other bacteria. An entire subfamily of thermophilic bacteria is indicated (dark grey) and a thermophilic subgroup is seen in the {3-glycose-specific bacterial subfamily (swirled dots). The distinction between thermophilic and non-thermophilic bacteria indicates that significant sequence and therefore structural differences exist that enable the enzymes to be more thermostable. Although it can be seen that the enzymes are sequentially more related by taxonomy than by specificity, it is not a far stretch to represent each subfamily with one of the available three-dimensional structures found by crystallization. The subfamily clustering by substrate specificity indicates there is also a need to use several crystal structures to represent not just taxonomic differences, but also substrate specificity differences between enzymes.

Future Work Obviously, more thorough methods must be used to determine the causes of substrate

specificity among the enzymes. One such method is computational docking. For computational docking, three-dimensional crystal structures must be used. As the phylogenetic analysis shows, the available crystal structures represent Family 1 well.

The available crystal structures will be used to dock {3-glycoside di-mers into the active site. The resulting conformations should allow the determination of residues and motifs important to substrate specificity.

86

Page 92: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

~­~ ~ ~IBIICJSidase ~ ~

SP~ ~mase ~

SP~ ICXlEiidase SP~ ICXlEiidase

SP~ SP~:m:tase SP~ ICXlEiidase SP~ ICXlEiidase

34

203 179 209 213 206 208 196 210 191 192 208 211 190 210 209 265 243 250 171 170 170 169 171 171 170 168 169 169 169 249 201 173 183 176 183 172 172 170 171 171 214 213 215 212 212 216 216 213 159 167 165 165 165 165 165 181 177 176 176 179 178 184 185 202 182 180 184

417 393 423 426 422 424 412 419 424 507 421 423 411 413 426 474 458 464 381 371 377 353 355 359 367 365 342 342 342 456 382 366 387 363 382 360 359 358 356 357 390 376 393 391 391 418 418 402 324 316 379 379 379 379 379 377 366 365 365 372 370 376 382 393 383 381 381

Figure 1. Multiple sequence alignment around the catalytic acid (left) and catalytic base (right). The residue numbers correspond to positions within the full multiple sequence alignment, displayed in Appendix A. Unless otherwise noted, enzymes are f3-glucosidases.

87

Page 93: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

r-=-'l.Aio--~ '2~-

':!~-·--··-~-·--._,...,. ·---·opw.-·~-...

,,.,..,;..; 1=.:' 1::::.-

gg ru--·Qootw-•A..,.IIIIJ\e

·z...,. ._,.. __ ·r-.-"GIMI ,_.

~~1~IL53tii1-ILmi••I1..,.11-ILmiLmlwti1-IL~ILml1-l1-l1-l1-l1•l~mln~

ual1.•lt.014I1 .... 11.t11I1.4Jtl1.al '"'711.11171 , ... 11.5131 L7:1211-~l1.eool1.ml Laai1.141I1.79SI1.eool ~~1

t.CWOI1-I1.aai1.771111-I1.IIQOI1a.IU0811.781I1.718I1.N211.eool 1.84111 .... 11.77111 L1131~713I~387IO.~

2.0C4It..,lt•lt.•l t.ml t.•l t.mlt.•l tNOit.l7t I 1.8.141 t.82el 1.1571 t.-1 •-•I ual 0.7471 o.ooo

2.0041 Ll2211 ... l1.181l U0711.408I1.eetl Lt10(1.1180I1.""'1 L ... l 1.83011.83411 ...... 11.1241 L0171 0.~

t.mi1~1Leoo(LWIL17811-I1-IL11171LMI1-I1-ILGIL~IW71Lmld~

W3ILD(Lbi1-I~IL~IL~IL~IL79SIL~IL•IL~I~IWII~

L79SI~I•miL~I••I•387ILM41L~ILaiL~ILmi•~ILmln~

L&I~I~I1-I~IU301~1~1~1~1~1Lml~~

t.ICMI UI05lt.7S7l t.OU1lt.734l t.-4011 U13lt.787l1.ee&lt.OIS21t.50tl O.OO'J

LMI1.178I Ut1IU10IU81I1.231I1.304I1.877I L""'l1.817l d~

2.11811.12411.11171 L7881 L78711.408l1 .... 1 a1021 aeool o.~

2.1.12.0111112011111 LDI1.183I1 . ...,11 .... I0.3UIO.~

2.228l1.180l2.d841 L79SI Ll8411.408l1.""'1 d~

1.oe111.11oe11 .... 1 L04811 . ..,1 u•ln~

t.740it.al•.•lt.31Mit.t7TI o.cm

•.• l1.7mlt.7fl71o.•tl0.000

2.08111.1241 1.8701 0.~

L048 I 0.8401 o.~

UG!OI 0.000

0.~

"lhormcllosll._,_,

"lhormcllosllnMIInw

•Cloobltfun--

·~~

·~ ... j2.71riilrul~

- ........ ,_.,.,..oqiJIJiicus ~ ·-... '•Giolllomnlo .....

1.38311.1i3)11.340IU11171 L730I2.410I1.584I U2711.1117(1.aa(1.183(1.584( 1.872f1.831(1.337(1.302f1.211(0.171,0.~1

L&ILCIL387(L~(Lm(t.302IL~(L~(~(~(L~ILe(~ILm(Lmi1-IL~Io.~l

U17lt.Sf71t.212lt.t12) UW0)2.2118)t.8B6)t.fil41 UDO) Uil41 1.8QO)t.Mit.711ltDI1.321)1.DI 0.000

1 ..... 1 L!ll511.53tll L3441 L731l2.413(1.781l1 .... (1.101l1.al1 ... 1 L818IU1112I L""'l 0.11121 0.~

~l~l~l~l~lt.CI~I~I~I~I~I~I~IU~Io.~

~~·~l~l~lwol~l~l~l~l~lmelo.cwol~lo.~

1.781l1.llll211 .... 1 L4Jtl L88tl2.408( L-11.3481 1.8211 1412(1.8141 0.1181 0.~

L03011.102l1.!03l 14271 U101I2430I1.~11 .... IU14I1.321I1.788I o.~

Ul51(2.031(1.8153(1.111(2.041(2.137(2.131(1.4tllt.853lt.s:tzl 0.000

Figure 2. Plant subfamily distance matrix. . .,.,.._~ ·-- 1.~11.7:12lteool1.mi1.~I2.32Sit7841 1.3271 Uti I o.~

1.eetl1.~11.-l1.434l1.733l2-l1.788l1.185l 0.~

·~ ... •-•11 ...... 11.811lt•l1.eoel2.323l1.1117l o.~

·~ ... 1.72211.85411 .• 11.mlt7t1l u111~~ ·-- 2.:11115l2.441l2318lt.:llllll23711~~

i'2.-pd)nJoot 1.584(1.815(1.12211.1411 d~ "Jiaii .. __ _

U4711.820I1.•Io.~ ·-- t.otol 0.78010.0110

"1.-pd)nJoot 1.0&11 0.000 ·-... ..~

Figure 3. P-f3-galactose and P-f3-glucose bacteria subfamily distance matrix.

Page 94: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

00 \0

\~~~,_,,~:~~~·~<\~,~~

\\\~\~\\\~\,,\\ nWn

2.420 22111 2.314 2324 2.270 2.293 2.323 2.312 22111 .a L.acloooa:us lactis

2.432 2248 2.325 2335 2292 2.321 2284 2.250 2.248 - 2.332 2198 2.275 2.362 2309 2339 2.218 2200 2ZT7 ;a

L.aclobaciHus acK10pf1I/UIJ 2.495 2308 2420 2312 24SO 2410 2.348 2.321 2308

iUICtobacillus ca. 2.551 2.438 2.538 2.481 2.475 2.381 2.478 2.388 2.398 .a

lf:111'6t1Xl0Ga.S (8801111 1.490 1.511 1.827 1.843 1.844 1.1151 1.531 1.525 1.818 .a

~ .,. 1.318 1.334 1.287 1.224 1.227 1.155 0.1163 0.858 0.597

~o.)'IOCB 1.198 1.333 1.281 1.322 1.227 1.118 0.888 0.828 0.520

1.Esch8rlcNa OJii 1.320 1.388 1.310 1.371 1.227 1.178 0.983 0.854 0.000

• Bad/Ius IUllillis 1.089 1.051 1.098 1.200 0.974 1.(124 0.819 0.000

uosurotm /o{jspot1lfl 1.255 1.245 1.239 1.148 1.108 1.157 0.000

2.~eoli 1.255 1.253 1.380 1.324 1.318 0.000

• Lactobacillus {Jatt811171 1.223 1.281 1.303 1.255 0.000

2.l.tlclobaQHus gasserl 1.358 1.334 1.2811 0.000 oA

1:~gasserl 1.024 0.997 0.000

1:!801/usstbins 0.775 0.000 '"A .

3. ESciiiJifCIWa COli 0.000

A.P.I\.nh"""""""'

Figure 4. ~-Glucosidase bacterial subfamily distance matrix.

2.219 2333 2418 1.240 0.708 0.339 0.191

2198 2.254 2439 1.197 0.875 0.348 0.000

2243 2.287 2485 1.198 0.833 0.000

2392 2.408 2560 0.1125 0.000

2.375 2.392 2.891 0.000

1.714 1.899 0.000

0.505 0.000

0.000

o.oool

....-

----~~ IJ..<Mgo!dpo P}toooor:us fiM1osus

ilmiWaaaxaidi

~

P}toooor:us

P}toooor:us--

P)IOOOh)ll

'7hllrllllOOt>:UUp.

·~trJCIIBI

2.555 2.117

2.1123 2.373

2.812 2.510

2.793 2.550

2.539 2.109

2.545 2.113

2.7118 2.198

2.214 (1000

0.000

1.784 1.858 1.854 1.255 1.283

1.782 1.717 1.714 O.ecl2 0.588

1.801 1.878 1.875 0.128 0.000

1.807 1.870 1.887 0.000

1.043 0.004 0.000

1.043 (1000

0.000

Figure S. Thermophilic bacteria subfamily distance matrix.

1.2401 0.000

0.0001

Page 95: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Figure 6. Phylogenetic tree of Family 1 f3-glycosidases. Asterisks denote f3-glucosidases. Crystal structures exist for Sulfolobus solfactarius f3-galactosidase, Lactococcus lactis 6-P-f3-galacto­sidase, Sinapis alba myrosinase, and Trifolium repens, Zea mays, Streptomyces sp., Bacillus circulans, 1. Bacillus polymyxa, Thermospheara aggregans, and Pyrococcus furiousus ~-glucosidases.

90

Page 96: Proceedings of the Thirty-first Annual Biochemical ...Proceedings of the Thirty-First Annual Biochemical Engineering Symposium The Thirty-First Annual Biochemical Engineering Symposium

Conclusions Family 1 f3-glycosidases exhibit a variety of specificities. In order to understand the elements

causing specificity, more research than simple sequence and phylogenetic analyses must be per­formed. However, the current analyses support the use of the available three-dimensional crystal structures to computationally dock the various substrates into each enzymes active site. Each of the four subfamilies, and more importantly, five of the seven subfamily clusters are represented by crystal structures.

References

Benson, D.A., Karsch-Mizrachi, 1., Lipman, D.J., Ostell, J., Rapp, B.A., and Wheeler, D.L. GenBank. Nucleic Acids Res. 28:15-18. 2000.

Coutinho, P.M., and Henrissat, B. Carbohydrate-active enzymes: An integrated database approach. In Recent Advances in Carbohydrate Bioengineering, H.J. Gilbert, G. Davies, B. Henrissat, and B. Svensson, eds., The Royal Society of Chemistry, Cambridge, pp. 3-12. 1999.

Coutinho, P.M. and Henrissat, B. Carbohydrate-Active Enzymes. URL: http://afmb.cnrs-mrs.fr /-cazy/CAZY /index.html. 2001.

Felsenstein, J. PHYLIP (Phylogeny Inference Package) version 3.6a2. Distributed by the author. Department of Genetics, University of Washington, Seattle. 1993.

Felsenstein, J., and Churchill, G.A. A Hidden Markov Model approach to variation among sites in rate of evolution. Mol. Biol. Evol. 13:93-104. 1996.

Galtier, N., Gouy, M., and Gautier, C. SeaView and Phylo_win, two graphic tools for sequence · alignment and molecular phylogeny. Comput. Applic. Biosci., 12:543-548. 1996.

Gasteiger, E., Jung, E., and Bairoch, A. SWISS-PROT: Connecting biological knowledge via a protein database. Curr. Issues Mol. Biol. 3:47-55. 2001.

Gonnet, G.H., Cohen, M.A., and Benner, S.A. Exhaustive matching of the entire protein seq­uence database. Science 256:1443-1445. 1992.

Henrissat, B. A classification of glycosyl hydrolases based on amino-acid sequence similarities. Biochem. J. 280:309-316. 1991.

Jones, D.T., Taylor, W.R., and Thornton, J.M. The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 8:275-282. 1992.

Kimura, M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16: 111-120. 1980.

KUnsch, H.R. The jackknife and the bootstrap for general stationary observations. Ann. Statist. 17: 1217-1241. 1989.

Smith, T.F. and Waterman, M.S. Identification of common molecular subsequences. J. Mol Bioi. 147:195-197. 1981.

Thompson, J.D., Higgins, D.G., and Gibson, T.J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. June 1994.

91