Register | Log in


Subscribe Now>>
Home News Tech2Tech Features Viewpoints Facts & Fun Teradata.com
Applied Solutions
Download PDF|Send to Colleague

Raise the BAR

New backup, archive and restore capabilities enable reliable data security, storage and retrieval.

by Richard Leatherman and Jim Dietz

As the real-time operational nature of active data warehousing evolves and the volume of data increases, data availability becomes critical to organizations making real-time business intelligence (BI) decisions. The increasing movement for these businesses to capture detailed data from more areas of the operation and for saving data for extended periods exemplifies the need to safely store and maintain that data.

Figure 1: Disk backup products
enlarge
Disk backup products offer the reliability of disk systems to provide predictable backup and restore time windows.

Regulatory compliance requirements mandating the long-term retention of data also make archiving increasingly imperative. Sarbanes-Oxley, the Health Insurance Portability and Accountability Act and the European Union Data Protection Directive, among other compliance regulations, are extending the need for long-term data retention—sometimes for as much as 10 years.

This all means that precautionary safeguards against unexpected data loss must be in place along with the means for storing and maintaining data to adhere to regulations. To ensure an organization's data is safe and readily accessible, it must be routinely backed up without causing unnecessary or excessive downtime. This adds up to a greater need for high-performing backup, archive and restore (BAR) solutions, especially for organizations with business-critical and large data repositories. BAR solutions must offer greater database availability, recoverability and protection in today's data warehousing environments.

Backing up to go forward
Backing up data is no longer just a minor housekeeping chore but a critical strategy. With the amount of data growing exponentially and with the increasing use of the data warehouse for real-time operations, business continuity is critical, and the recovery of data during loss incidents and data security is more important than ever. Without current or accessible data, companies cannot continue to function well.

Besides, data unavailability is costly—the longer a company is without its data, the more detrimental the effect. If data is lost, re-creating it and rebuilding data repositories from multiple sources is resource- and time-consuming and reconstructing BI data is difficult. An effective BAR solution, therefore, must be assessed as a business-critical function.

With an effective BAR solution, business continuity is ensured with these capabilities:
Data protection. Preservation of data off-line
Recoverability. Rapid restoration of data to its full integrity in the event of user errors, disasters, hardware failures, etc.

Teradata BAR partners and provisions
> BakBone's NetVault is highly integrated with the Teradata platform.
> Symantec's NetBackup is an integrated solution that is a standard industry platform.
> Sun Microsystems is the industry's leading tape backup vendor.
> LSI provides disk storage products.
> EMC provides the virtual tape library appliance that uses disk but emulates most standard tape libraries and drives.
> Protegrity provides a robust encryption solution for backup, archive and restore processing.
 
—R.L.


Figure: Components of the Teradata BAR solutions
enlarge
Teradata partners with six technology suppliers to produce an industry-leading BAR solution.

BAR must also conform to system recovery objectives, which are predetermined measurements of the amount of time and age of the data the company can afford to be without during a system failure. They are defined as:
Recovery time objective (RTO). Time and service level limits to which a data warehouse and its data must be restored to avoid the unacceptable consequences associated with extended breaks in business continuity
Recovery point objective (RPO). Data's age, or the point in time to which that data must be restored, before its loss or the need to re-enter it is considered intolerable to users

To clarify, a company might establish an RTO of 10 hours (the amount of acceptable system downtime) and an RPO of 24 hours (the oldest acceptable age of the data).With an RPO of 24 hours, the system must be backed up every 24 hours so if it loses data and has to be restored from the backup, users will have access to data that was created no longer than one day prior. With an RTO of 10 hours, the data restoration must be accomplished within the 10-hour time window of the objective. Both measurements help determine the optimum configuration of the backup and recovery solution for a company. These objectives then guide the selection of the actual BAR solution components:
Hardware. Tape and disk storage devices
Software. Storage management and encryption
Services. Planning, implementation and support of a system

Storage devices
Organizations have several choices in backup storage devices: Data can be backed up to tape, conventional disks, virtual tape libraries (VTLs) or, preferably, using a combination of tape and disk technologies. The decision is based on a number of factors, including:
The amount of data and frequency of backup
The time window in which data must be backed up or restored
Accessibility of archived data
Off-site or on-site storage capacity

For instance, a company with long business hours and an actively used data warehouse will have a smaller window in which to back up the data. If this company has a 24-hour RPO, the appropriate BAR solution must be one that enables a daily backup with smallest impact on database access.

The following describes the different types of storage devices available to users:

TAPE, THE MAINSTAY
Tape backup devices are the traditional and most widely deployed solution for data protection. As the device with the lowest cost per data block, tapes are typically used for long-term archiving of data. Their mobility makes them easy to store in a remote location, which offers a safe, cost-effective data refuge in case of a disaster.

Still, relying solely on tapes is an increasingly risky proposition. The dependability of tape-based backup and restore time windows poses a real challenge and is the driving force behind the adoption of disk backup.

DISK EMERGES
While tapes used to be the only option for backing up data, disk products evolved and became an important part of the solution. Their predictable backup and recovery times make them more adaptable to businesses with longer operational hours, thus shorter time windows for data backup. Disk systems are also considered more reliable than tapes because they do not wear out from steady use, nor do they suffer from media errors or mechanical problems.

This immunity of disk systems against media failures is gained through the use of redundant-array-of-independent-disks technology. This technique backs up data in several stripes either on the same disk or on multiple disks. With this redundant method, backup data is protected from any data mishaps on the disks that would have caused a backup or restore delay.

Leveraging BAR with dual systems

Typically, the restore capability of a backup, archive and restore (BAR) solution is rarely utilized—it needs to be invoked only when lost data must be restored to a Teradata system. An innovative use of the restore function is to apply the BAR infrastructure to the implementation of a Teradata dual system. In a dual system, the backup operation normally stores the data from the primary system (A) to the secondary system (B).

Loading data to system B (locally or remotely) can be done numerous ways, including tape transportation and electronic transfer. The frequency of this "back up A, then restore B" operation would determine the recovery time and recovery point objectives for the dual system.

This simple first step to a Teradata dual active environment will enhance the BAR investment and improve the availability of overall Teradata solutions.

—J.D.

Two industry approaches used to back up data to disk-based storage are:
Direct to disk. Standard disk files that directly store data
VTL. Disk-based backup and restore appliance that emulates open systems tape libraries and easily fits into existing tape backup environment without changing current operations or infrastructure

The preferred BAR system configuration creates a copy of the data on a backup-to-disk device, then archives that data to tape with encryption for security (see figure 1 above). Typically, two copies of the data are kept on the disk backup devices, and two others are on tape for archival purposes. This approach achieves the best level of protection for users and can reliably meet aggressive performance goals such as a five- to six-hour backup and a 10- to 12-hour restore.

BAR for Teradata data warehousing
Teradata's massively parallel processing (MPP) architecture puts unique demands on a BAR solution. With MPP, multiple workloads are run in parallel on multiple nodes. This high-performance characteristic and high-speed data flow enable Teradata to quickly query large data tables. Because Teradata tables are hashed across all nodes and storage devices in the system, the parallel architecture puts each element of the BAR solution in the critical path to full performance.

Teradata offers a BAR solution strategy that makes BAR tasks easier, faster and safer. Teradata's BAR solutions are an effective way to leverage the performance and reliability of the Teradata data warehouse for protection of key data to ensure its safety and availability, and to meet backup and restore window needs. The scalable, packaged solution is optimized for high performance by leveraging the MPP approach and therefore minimizes backup and restore windows. Additional security is enabled with the safeguarding of data encryption added to disk and tape backup capabilities.

Putting BAR together
Management of the BAR solution leverages industry-standard backup application software that provides ease of use, operational performance and compatibility with data center operations.

Each of the three BAR management software alternatives (see figure 2, above) provides scheduling and management of backup and restore operations, control and monitoring of disk and tape, and movement of data between the Teradata system and BAR solution. The selected backup application for management links through a software access module called the Teradata Extension (see figure 3) to the Teradata Database nodes.

Figure 2: Teradata BAR integrated solution
enlarge
The nodes in the Teradata data warehouse are networked to the BAR framework servers running the BAR software (extension and application) to support disk and tape backup devices.

Teradata has forged partnerships with six technology suppliers to provide a fully integrated set of industry-leading, matched and certified components. These Teradata BAR elements represent best practices for backup and recovery configurations and deliver the best return on investment (ROI) for companies. Teradata integrates and certifies these components into a complete, end-to-end solution that includes one-stop implementation and support services.

Because tapes often contain sensitive data and are inherently portable (and therefore prone to misplacement or loss when moved off-site), backups that are stored on tapes are especially vulnerable to security breaches. Teradata Defiance BAR Encryption software protects this tape-based data by effectively providing a strong encryption capability. Through a selection of industry-standard encryption algorithms and the secure management of the location where encryption keys are stored and maintained, the backup tapes are well protected. Companies can be assured that their data, so crucial to the success of their business, is safely backed up and stored.

Taking control through BAR
Backing up and restoring data are vital functions for organizations that understand the importance of real-time data to help drive business success. Active data warehouses are a key element in business operations as leaders use this data for near real-time decision making.

For this reason, and to comply with numerous regulatory requirements, backed-up data must be kept available, safe and readily accessible. With Teradata BAR solutions, data can be quickly backed up and safely restored, thus enabling the business to continue to rely on its data for innovative decision making. T

Success is not fleeting

A large financial institution in the United Kingdom is a long-time Teradata user. The organization started with a modest-sized Teradata system that provided decision-support workloads for back-office business workers without any real availability needs. These workers met their initial backup needs with a mainframe solution in which all backup and restore operations were performed through channel connections to mainframe tape devices.

Over the years, the company's Teradata system became more mission-critical and hosted an increasing amount of business data. Yet the backup, archive and restore (BAR) strategy remained the same and the risk around data protection silently grew. A number of challenges demanding change were identified, such as enhanced recovery-time-objective and recovery-point-objective parameters and channel capacity shortfalls.

Situation
The mainframe approach required 39 hours to back up all important data. Yet the company had no more than 12 hours available, a time span that was predicted to shrink during the next 12 months.

Resolution
The financial institution implemented the Teradata BAR solution and achieved its goals, which included:
> Dramatically improving backup and restore performance
> Reducing costs by eliminating use of the mainframe infrastructure
> Putting management of the data protection directly in the hands of the data warehouse staff
> Building a scalable BAR approach aligned with Teradata for growth

The new BAR solution provided backup of all the company's data, including 14TB of user data space, in a three-hour window. Now the company can perform all backups in an acceptable window of time. Just as important, it has removed the risks associated with not protecting all of its important business and customer data.

—R.L.

Richard Leatherman is a former Teradata employee who now works as a freelance and technical writer in San Diego.

Jim Dietz, Teradata platform marketing manager, has more than 12 years of experience with Teradata in developing and marketing data warehouse solutions.

Teradata Magazine-December 2007

More Applied Solutions

Related Links

Reference Library

Get complete access to Teradata articles and white papers specific to your area of interest by selecting a category below. Reference Library
Search our library:


Protegrity

Teradata.com | About Us | Contact Us | Media Kit | Subscribe | Privacy/Legal | RSS
Copyright © 2008 Teradata Corporation. All rights reserved.