BHW EMC Data Archive POC Notes

download BHW EMC Data Archive POC Notes

of 4

Transcript of BHW EMC Data Archive POC Notes

  • 8/2/2019 BHW EMC Data Archive POC Notes

    1/4

    EMC Data Archive POC Notes

    Architecture:The initial architecture was to use Rainfinity to write data to a Celerra and then Kazeon would

    read the target Celerra for search capability.

    Legal Data

    Drag and drop

    Archive DataDrag and drop

    Rainfinity

    EMC SM POC Target

    Celerra

    Kazeon

    Production SM Source

    Celerra

    During the POC the architecture changed due to the limitation of Kazeon and Rainfinitys abilityto archive data from a non-shared system. So the new architecture was to use the Celerra to

    present a public share which uses could access to drag and drop the files for archive to. After the

    files has been moved to the Celerra share Kazeon would then index the data for search capability

    and lastly Rainfinity would archive the data to another filesystem for long term retention.

    Legal Data

    Archive Data

    Rainfinity

    EMC SM POC TargetCelerra

    Kazeon

  • 8/2/2019 BHW EMC Data Archive POC Notes

    2/4

    Installation:

    KazeonThe installation of Kazeon was very simple and quick after the system was racked we

    were active for testing within a few hours.

    CelerraThe NAS was setup and ready by the end of the day for configuration. There were no

    major problems with this setup.RainfinityThere was not SE that was available for the installation. Once we did get someone

    to install the RF appliance we still needed another person to configure the appliance. The

    appliance took 1.5 days to get configured this seems a lot more complex then I would havefigured. The one thing to keep in mind though is that this was a one-time setup, after the setup

    the policy to archive the data was easy to configure. Also to note was that the architecture was

    modified during the setup of this process of how we would archive.

    POC Test:Objective testingThis test is simply a test of the objectives outlined in the RFI.

    Objective 1 - We need a solution that must provide the means to automate the archiving ofstructured and unstructured data to a centralized storage location based on IT corporate policies.

    Objective Achieved and TestedWe could move unstructured data to a NAS share where itwas then archived for long term retention. I was unable to test structured data due to lack of time

    and equipment availability.

    Objective 2 - We need a solution that must provide the ability to collect and track all

    incremental changes of structured and unstructured active data. The collected data must then be

    stored on a centralized secondary storage location while preserving the file date information and

    preventing the destruction of data for means of long term legal retention.

    Objective Achieved and TestedUsing Kazeon we were able to scan file shares for data

    requested for long term retention. After which a copy of the found data could be copied and

    retention applied for long term retention on an incremental basis.

    Objective 3 - Provide the capability to search through collected data as identified in the 2nd

    bullet with a future e-discovery process.

    Objective Achieved and TestedUsing Kazeon to index data we were able to search data atboth metadata level and a document contents level searching.

    Objective 4 - Capability to assign ownership of data when archived.

    Metrics:

    Copy of data from PC to archive file system - 21GB2 hours

    Kazeon basic scan of files on archive file system21GB2 min

    Kazeon deep scan of files on archive file system21GB

    Rainfinity archive to legal file system21GB30 min

  • 8/2/2019 BHW EMC Data Archive POC Notes

    3/4

    RFI Testing Criteria:

    1. Can your solution de-duplicate against all the data that it stores? The Celerra filesystemcan de-duplicate. This a post process

    2. Does your solution support Oracle, MS SQL and/or Unix (Redhat Linux and HP-UX)?The data written can be either NFS or CIFS data so as long as the data to be archived can

    write the data to this location it can be recovered at a later date. But it should be notedthat this whole process would be manual it is unable to do this via tools or automation.

    3. Can the network be throttled during ingestion of archive data? No, this is not done withany tool set with this EMC solution. You would have to utilize Qos rules to make anysort of bandwidth limitations.

    4. Can replication solution support a remote site greater than 500 miles away? Yes theCelerra can replicate to another Celerra.

    5. Can the data in the repository be searched? Yes, Kazeon can index both source anddestination data. There is a basic index which pulls only meta data and there is also a

    deep index which will search inside documents. The searching with Kazeon if very good

    and provides a lot of information very easily.

    6. Are servers required for your solution? No, the equipment needed at the production siteare 1 Celerra (NAS), Kazeon appliance and Rainfinity appliance, all of which are selfcontained in a rack.

    7. Can your solution achieve all the project objectives? Yes all project objectives can bemet.

    8. Does your solution require an agent? No agents are required data is accessed by beingshared to the network.

    9. Can you set quotas on storage? Yes you can set quotas on the Celerra for storagerequirements.

    10.Can identified data for recovery be written to removable medium? Yes identified data

    can be copied down to the desktop or exported into a legal extraction csv file.

    Support Matrix

    Description EMC

    solution that must provide the means to automate the

    archiving of structured and unstructured data to a centralized

    storage location based on IT corporate policies.

    x

    solution that must provide the ability to collect and track allincremental changes of structured and unstructured active data.The collected data must then be stored on a centralizedsecondary storage location while preserving the file dateinformation and preventing the destruction of data for means of

    long term legal retention.

    x

    Provide the capability to search through collected data as

    identified in the 2nd

    bullet with a future e-discovery processx

    Capability to assign ownership of data when archived x

    Can your solution de-duplicate against all the data that it

    stores?x

  • 8/2/2019 BHW EMC Data Archive POC Notes

    4/4

    Does your solution support Oracle, MS SQL and/or Unix

    (Redhat Linux and HP-UX)?x

    Can the network be throttled during ingestion of archive data?

    Can replication solution support a remote site greater than 500

    miles away? x

    Can the data in the repository be searched? x

    Are servers required for your solution?

    Can your solution achieve all the project objectives? x

    Does your solution require an agent?

    Can you set quotas on storage? x

    Can identified data for recovery be written to removable

    medium?x

    Thoughts & Perceptions: