Information Technology in Support of the Simulation of ...nom/Papers/ndeer2002_slides.pdf ·...
Transcript of Information Technology in Support of the Simulation of ...nom/Papers/ndeer2002_slides.pdf ·...
Information Technology inSupport of the Simulation of
Natural Organic Matter(NOM) in the Soil
Yingping Huang Xiaorong XiangEric Chanowich Gregory Madey
Computer Science & EngineeringUniversity of Notre Dame
AbstractWe describe an application of infrastructure, queryOptimization, data warehousing and data miningtechnologies in support of the simulation of NaturalOrganic Matter (NOM) in the soil.
We present an agent-based stochastic simulation ofNOM transformations. It employs advancedinformation technologies such as J2EE, Oracle,data warehousing and data mining to improve thereliability and scalability of the stochasticsimulations and to facilitate analysis of the resultinglarge datasets.
Natural Organic Matter• NOM is ubiquitous in terrestrial, aquatic and marine
ecosystems– Results from breakdown of animal & plant material in the
environment• Important role in processes such as
– compositional evolution and fertility of soil– mobility and transport of pollutants– availability of nutrients for microorganisms and plant
communities– growth and dissolution of minerals
• Important to drinking water systems– Impacts drinking water treatment– Impacts quality of well water
Focus of the Modeling
Soil
Ground Water
Surface WaterNOMs — Microbes
InternetInternet
Remote Clients/Servers
gemini.cse.nd.edu
Intel Dual 400Win2K Server
OC4J/Orion ServerReports Server
joy.cse.nd.edu
Intel Dual 800Redhat 7.2
OC4J/Orion Server
tenor.cse.nd.edu
Intel Dual 800Redhat 7.2
OC4J/Orion Server
simu2.worldfoyt.cse.nd.eduIntel Dual 400
Redhat 7.2Oracle9i 2
Data Mining
etech.worldsymphony.cselab
Intel Dual 733Win2K Server
Oracle9i 2Data Miningmynom.world
bigband.cselabIntel Dual 733
Solaris 8Oracle8i
HTTP HTTP
JDBC
JDBC
Web Interface/Reports
Database Servers/Data Mining Servers
Application Servers/Simulation Running
The Simulation Infrastructure
NOM 1.0• Distributed systems
– 2 Application servers (Orion Servers)– 3 Database servers (Oracle)– Reports server (OC4J Server/Reports Server)
• Load balancing (round robin based on computational needs)– application servers & database servers
• Fail over– application servers & database servers
• Why fail-over (Assume down probability p for each machine)– No fail-over
• Simulation system down probability: 1-(1-p)2 = 2p-p2
– With fail-over• Simulation system down probability: 1-(1-p2)(1-p3) = p2 + p3 – p5
– Improvement:• 2/p = 200 if p=0.01 (the smaller p, the larger improvement)
The Simulation Model• Agent-based stochastic simulation• GUI version - stand alone
– Animation of molecules• Web-based version
– Using OC4J server & Oracle Reports– Oracle database servers
• Load-balancing & fail-over– Goal: efficiency, availability & reliability
• Data warehousing & Data Mining– Goal: data/pattern analysis
GUI AnimationBlack - No AdsorptionGray – Adsorption SitesRed - LigninBlue - ProteinGreen - CellulousYellow - ReactedOrange - Adsorbed
Insertion and Query PerformanceComparison
5 seconds0.0107Withaggregations
>0.5 hour0.0122Withindexes
>1 hour0.0106No indexes
No aggregation
Query TimeInsertion
(sec/row)
Scenario(>16million)
Data Mining & MicelleFormation
Maurice, 1999
NOMRings
Conclusion &Acknowledgement
• Contributions are– New model which treats NOM as a heterogeneous
mixture– Simulation system with advanced web & database
tools– System aspects of implementation of load-
balancing and fail-over– Basic data mining features
• Acknowledgement– Steve Cabaniss, Chemistry, UNM– Patricia Maurice, CE/GEOS, UND