Major repositories using the Arctos database:
description
Transcript of Major repositories using the Arctos database:
ArctosA multi-institution, multi-
collection museum databasehttp://arctos.database.museum
http://code.google.com/p/arctos/
Major repositories using the Arctos database:(34 collections of specimens or observations, 1.3M records)
in partnership with
which is a member of
TeraGrid – A nationwide network of 11 supercomputing facilities
U. S. National Science Foundation’s
Office of Cyberinfrastructure
which is sponsored by
Arctos: A 15 year history MVZ: 1995 - 1995 - Hired Stan Blum to develop relational data model (following
modeling by Assoc. Systematic Collections).
MVZ: 1997 - Hired John Wieczorek to implement model (desktop application) using Sybase and Versata. Partial implementation (e.g., no loans).
UAM: 1998-2000 - John W. migrated mammal data to Oracle, set up Versata.
UAM: 2002 - Dusty McDonald replaced Versata with ColdFusion, implemented full model (first web-based instance, aka Arctos).
MSB: 2003 – Joined Arctos at UAM (first multi-hosting instance).
MVZ and MCZ: 2005-2007 - Implemented separate instances of Arctos at Berkeley and Harvard (MVZ: first Postgres, then Oracle).
MVZ: 2009 - Moved hosting of data to Alaska (Virtual Private Database version).
Arctos is an ongoing effort to integrate access to museum data, collection management tools, and external resources on the Web.
Specimen Catalog
label data (and more)
Projectscontribute and/or
use specimens
Accessions Loans,usage
Publicationscite specimens
GenBank
Federated portals
Arctos
BerkeleyMapper
“Media” in TeraGrid
The rest ofCyberspace
Citations
Arctos Philosphy Maximize access to high quality information about Maximize access to high quality information about collections on the web.collections on the web.
Integrate different kinds of information (specimen data, Integrate different kinds of information (specimen data, DNA sequences, media, etc.) into easily and visually DNA sequences, media, etc.) into easily and visually retrievable records.retrievable records.
Facilitate management of data by curators and collection Facilitate management of data by curators and collection managers. managers. Collaboration (development, curatorial) is key Collaboration (development, curatorial) is key to success.to success.
Show value of specimens for scientific research to the Show value of specimens for scientific research to the public through web resources.public through web resources.
Breadth of Data in Arctos Fish, amphibians, reptiles, mammals, birds and bird Fish, amphibians, reptiles, mammals, birds and bird eggs/nests, plants, arthropods, fossils, molluscseggs/nests, plants, arthropods, fossils, molluscs Specimens and observationsSpecimens and observations Media (images, audio)Media (images, audio) Publications, fieldnotesPublications, fieldnotes
Arctos constantly evolving to incorporate new kinds of Arctos constantly evolving to incorporate new kinds of data, e.g.,:data, e.g.,: Better representation of non-publication documents Better representation of non-publication documents (fieldnotes, correspondence)(fieldnotes, correspondence) Cultural collections (art, anthropology...)Cultural collections (art, anthropology...)
Nearly all that is known about an object (or observation) can be included in Arctos.
Linking specimen records to archival documentation…
Access and Security Virtual Private Database (Row-Level Security) allows Virtual Private Database (Row-Level Security) allows
collections to maintain control over data while sharing collections to maintain control over data while sharing certain nodes (taxonomy, agents, geography).certain nodes (taxonomy, agents, geography).
Access to data is controlled by user privileges (collections, Access to data is controlled by user privileges (collections, tables).tables).
Public can query without user account. More options if Public can query without user account. More options if logged in.logged in.
Business rules enforced at database (not application) Business rules enforced at database (not application) level.level.
Arctos is…A versatile online collections management system
Cataloged Items (ID, attributes, parts, etc.; batch uploading, downloading, editing; encumbrances)
Localities & Collecting Events (mapping, media, history)
Transactions (loans, accessions, borrows, permits; email reminders)
Usage (publications, projects, sponsors, GenBank)
Curatorial (object tracking, parts, condition, relations, etc.)
Determination history (identification, georef, attributes)
Arctos is…Modular and Pluggable
Projects and Publications
Media (local, external URI)
External Services (GenBank, TACC, MorphBank, CalPhotos, Ubio, etc.; ) – Collaborative
DiGIR (ORNIS, HerpNET, MaNIS, VertNET, etc.)
Arctos is…Customizable
Specific features, e.g., Attributes (A cataloged item can A cataloged item can have any number of attributes, customized to collections. A have any number of attributes, customized to collections. A generic solution.)generic solution.)
Users (search, results, identifiers, download; save and email searches)
Managers (users, headers, CSS, reports)
Arctos is…Open
Open source code
Built upon community needs
Compartmentalized (choose what you want to share)
Community input valued (user group, code issue list)
Responsive development model (bugs generally fixed within hours, requests within days to weeks unless major new feature) – community discussion drives development
Arctos is…Economical
$15,300/year hosting (currently split by 3 institutions):• secure environment• dedicated web server and database server• server and database administrators• Oracle 11G• offsite backups in Alaska, TACC, San Diego
1 full-time developer
1 part-time DBA
Lots of input from users
Arctos Multihosting Shared CodeShared Code Shared CostShared Cost Shared ExpertiseShared Expertise
CuratorialCuratorial TechnicalTechnical
Location not important
(electrons are fast!)
MCZ
?
GoalRun Arctos as a single system from one or more mirrored Run Arctos as a single system from one or more mirrored locations, locations,
from which each participating institution can be from which each participating institution can be represented to its users and operators as an exclusive represented to its users and operators as an exclusive instance through row-level security, instance through row-level security,
but in which vocabulary, programming, and physical but in which vocabulary, programming, and physical resources are shared. resources are shared.
(No technical issues to adding collections. Social issues (No technical issues to adding collections. Social issues in multihosting.) in multihosting.)
Governance and Operating Support Steering Committee and Advisory Group with
representatives of each institution contributing to operational budget. Steering Committee (institutional Directors or designees):
budget, proposals for funding, proposals to connect to external resources, addition of collections, etc.
Advisory Group (curators): priorities for development - Chair of Advisory Group oversees activity of developer(s), reports to Steering Committee
Operating costs shared among institutions, funded by outside grants - identify developments that require outside funding vs. in-house support
Summary Cost-effective access to a deeply relational, highly
expandable database capable of recording all aspects of specimen curation, running on Enterprise-caliber hardware and software, managed by professional IT staff.
Community of experienced users provide input into data structure and new development. Burden of development and operation shared among institutions.
All Arctos data are online. The public can query it, privileged users manage it through web browsers.
Demonstrates and documents the scientific context of specimens (projects, documents, media, etc.)…more than just label data.