Difference between revisions of "Procedure Disaster Recovery Plan"

From D4Science Wiki
Jump to: navigation, search
(Created page with " == Disaster Recovery Plan == Disasters can be classified into two broad categories. The first is natural disasters such as floods, hurricanes, tornadoes or earthquakes. Whi...")
 
 
(One intermediate revision by the same user not shown)
Line 8: Line 8:
 
=== Strategies ===
 
=== Strategies ===
 
* use of high availability systems which keep both the data and system replicated on-site and off-site, enabling continuous access to systems and data, even after a disaster.  
 
* use of high availability systems which keep both the data and system replicated on-site and off-site, enabling continuous access to systems and data, even after a disaster.  
* use of Hybrid Cloud solutions that replicate both on-site and to off-site data centers. These solutions provide the ability to instantly fail-over to local on-site hardware most of the D4Science services, but in the event of a physical disaster at the main D4Science data center, servers can be brought up in two additional data centers that are federated to D4Science.
+
* use of Hybrid Cloud solutions that replicate both on-site and to off-site the main D4Science data center. This solution provides the ability to instantly fail-over to local on-site hardware most of the D4Science services, but in the event of a physical disaster at the main D4Science data center, servers can be brought up in two additional data centers that are federated to D4Science.
 
* backups made at regular intervals of all storage devices (the maximum interval for two consecutive backups is one day).
 
* backups made at regular intervals of all storage devices (the maximum interval for two consecutive backups is one day).
 
* replication of service to an off-site location, which overcomes the need to restore the service (only the data need to be restored or synchronized).
 
* replication of service to an off-site location, which overcomes the need to restore the service (only the data need to be restored or synchronized).
 +
 +
In addition to preparing for the need to recover systems, D4Science also implements precautionary measures with the objective of preventing a disaster in the first place. These include:
 +
 +
* local mirrors of systems and/or data and use of disk protection technology such as RAID;
 +
* surge protectors — to minimize the effect of power surges on delicate electronic equipment;
 +
* use of an uninterruptible power supply (UPS) and backup generator to keep systems going in the event of a power failure;
 +
* fire prevention/mitigation systems equipped with alarms and fire extinguishers;
 +
* firewall and network frameworks to avoid intrusion and attacks.

Latest revision as of 18:56, 27 March 2018

Disaster Recovery Plan

Disasters can be classified into two broad categories. The first is natural disasters such as floods, hurricanes, tornadoes or earthquakes. While preventing a natural disaster is impossible, risk management measures have been applied to avoid disaster-prone situations. The second category is man-made disasters, such as hazardous material spills, infrastructure failure, bio-terrorism, and disastrous IT bugs or failed change implementations. In these instances, monitoring, testing, and mitigation planning have been also defined.

Strategies

  • use of high availability systems which keep both the data and system replicated on-site and off-site, enabling continuous access to systems and data, even after a disaster.
  • use of Hybrid Cloud solutions that replicate both on-site and to off-site the main D4ScienceAn e-Infrastructure operated by the D4Science.org initiative. data center. This solution provides the ability to instantly fail-over to local on-site hardware most of the D4ScienceAn e-Infrastructure operated by the D4Science.org initiative. services, but in the event of a physical disaster at the main D4ScienceAn e-Infrastructure operated by the D4Science.org initiative. data center, servers can be brought up in two additional data centers that are federated to D4ScienceAn e-Infrastructure operated by the D4Science.org initiative..
  • backups made at regular intervals of all storage devices (the maximum interval for two consecutive backups is one day).
  • replication of service to an off-site location, which overcomes the need to restore the service (only the data need to be restored or synchronized).

In addition to preparing for the need to recover systems, D4ScienceAn e-Infrastructure operated by the D4Science.org initiative. also implements precautionary measures with the objective of preventing a disaster in the first place. These include:

  • local mirrors of systems and/or data and use of disk protection technology such as RAID;
  • surge protectors — to minimize the effect of power surges on delicate electronic equipment;
  • use of an uninterruptible power supply (UPS) and backup generator to keep systems going in the event of a power failure;
  • fire prevention/mitigation systems equipped with alarms and fire extinguishers;
  • firewall and network frameworks to avoid intrusion and attacks.