BHW EMC Data Archive POC Notes
-
Upload
barry-wright -
Category
Documents
-
view
217 -
download
0
Transcript of BHW EMC Data Archive POC Notes
-
8/2/2019 BHW EMC Data Archive POC Notes
1/4
EMC Data Archive POC Notes
Architecture:The initial architecture was to use Rainfinity to write data to a Celerra and then Kazeon would
read the target Celerra for search capability.
Legal Data
Drag and drop
Archive DataDrag and drop
Rainfinity
EMC SM POC Target
Celerra
Kazeon
Production SM Source
Celerra
During the POC the architecture changed due to the limitation of Kazeon and Rainfinitys abilityto archive data from a non-shared system. So the new architecture was to use the Celerra to
present a public share which uses could access to drag and drop the files for archive to. After the
files has been moved to the Celerra share Kazeon would then index the data for search capability
and lastly Rainfinity would archive the data to another filesystem for long term retention.
Legal Data
Archive Data
Rainfinity
EMC SM POC TargetCelerra
Kazeon
-
8/2/2019 BHW EMC Data Archive POC Notes
2/4
Installation:
KazeonThe installation of Kazeon was very simple and quick after the system was racked we
were active for testing within a few hours.
CelerraThe NAS was setup and ready by the end of the day for configuration. There were no
major problems with this setup.RainfinityThere was not SE that was available for the installation. Once we did get someone
to install the RF appliance we still needed another person to configure the appliance. The
appliance took 1.5 days to get configured this seems a lot more complex then I would havefigured. The one thing to keep in mind though is that this was a one-time setup, after the setup
the policy to archive the data was easy to configure. Also to note was that the architecture was
modified during the setup of this process of how we would archive.
POC Test:Objective testingThis test is simply a test of the objectives outlined in the RFI.
Objective 1 - We need a solution that must provide the means to automate the archiving ofstructured and unstructured data to a centralized storage location based on IT corporate policies.
Objective Achieved and TestedWe could move unstructured data to a NAS share where itwas then archived for long term retention. I was unable to test structured data due to lack of time
and equipment availability.
Objective 2 - We need a solution that must provide the ability to collect and track all
incremental changes of structured and unstructured active data. The collected data must then be
stored on a centralized secondary storage location while preserving the file date information and
preventing the destruction of data for means of long term legal retention.
Objective Achieved and TestedUsing Kazeon we were able to scan file shares for data
requested for long term retention. After which a copy of the found data could be copied and
retention applied for long term retention on an incremental basis.
Objective 3 - Provide the capability to search through collected data as identified in the 2nd
bullet with a future e-discovery process.
Objective Achieved and TestedUsing Kazeon to index data we were able to search data atboth metadata level and a document contents level searching.
Objective 4 - Capability to assign ownership of data when archived.
Metrics:
Copy of data from PC to archive file system - 21GB2 hours
Kazeon basic scan of files on archive file system21GB2 min
Kazeon deep scan of files on archive file system21GB
Rainfinity archive to legal file system21GB30 min
-
8/2/2019 BHW EMC Data Archive POC Notes
3/4
RFI Testing Criteria:
1. Can your solution de-duplicate against all the data that it stores? The Celerra filesystemcan de-duplicate. This a post process
2. Does your solution support Oracle, MS SQL and/or Unix (Redhat Linux and HP-UX)?The data written can be either NFS or CIFS data so as long as the data to be archived can
write the data to this location it can be recovered at a later date. But it should be notedthat this whole process would be manual it is unable to do this via tools or automation.
3. Can the network be throttled during ingestion of archive data? No, this is not done withany tool set with this EMC solution. You would have to utilize Qos rules to make anysort of bandwidth limitations.
4. Can replication solution support a remote site greater than 500 miles away? Yes theCelerra can replicate to another Celerra.
5. Can the data in the repository be searched? Yes, Kazeon can index both source anddestination data. There is a basic index which pulls only meta data and there is also a
deep index which will search inside documents. The searching with Kazeon if very good
and provides a lot of information very easily.
6. Are servers required for your solution? No, the equipment needed at the production siteare 1 Celerra (NAS), Kazeon appliance and Rainfinity appliance, all of which are selfcontained in a rack.
7. Can your solution achieve all the project objectives? Yes all project objectives can bemet.
8. Does your solution require an agent? No agents are required data is accessed by beingshared to the network.
9. Can you set quotas on storage? Yes you can set quotas on the Celerra for storagerequirements.
10.Can identified data for recovery be written to removable medium? Yes identified data
can be copied down to the desktop or exported into a legal extraction csv file.
Support Matrix
Description EMC
solution that must provide the means to automate the
archiving of structured and unstructured data to a centralized
storage location based on IT corporate policies.
x
solution that must provide the ability to collect and track allincremental changes of structured and unstructured active data.The collected data must then be stored on a centralizedsecondary storage location while preserving the file dateinformation and preventing the destruction of data for means of
long term legal retention.
x
Provide the capability to search through collected data as
identified in the 2nd
bullet with a future e-discovery processx
Capability to assign ownership of data when archived x
Can your solution de-duplicate against all the data that it
stores?x
-
8/2/2019 BHW EMC Data Archive POC Notes
4/4
Does your solution support Oracle, MS SQL and/or Unix
(Redhat Linux and HP-UX)?x
Can the network be throttled during ingestion of archive data?
Can replication solution support a remote site greater than 500
miles away? x
Can the data in the repository be searched? x
Are servers required for your solution?
Can your solution achieve all the project objectives? x
Does your solution require an agent?
Can you set quotas on storage? x
Can identified data for recovery be written to removable
medium?x
Thoughts & Perceptions: