June 02, 2014 | SAN DIEGO, California
Newest Teradata Portfolio for Hadoop Delivers Fastest Path to Data Lake Deployment
Portfolio simplifies Hadoop deployment and management with specialized software, appliance, consulting services, training, and customer support
Teradata (NYSE: TDC), the analytic data platforms, marketing applications, and services company, today introduced Teradata Portfolio for Hadoop 2 to reduce the risk, cost, and complexity of Hadoop deployment and management. The comprehensive portfolio helps organizations address the technical and business challenges of leveraging diverse data stored in Apache™ Hadoop®. Now customers can realize the business value of a data lake with value-added software, an appliance, consulting services, training, and customer support all from a single vendor, Teradata.
The challenges facing today’s modern organizations seeking insights from existing and big data sources are varied – they lack a comprehensive data strategy, have multiple disconnected technologies, are forced to hire scarce specialists with expensive skillsets, or lack expertise with big data or analytics. Without a trusted advisor to offer guidance, enterprises can experience high costs, risks, and the unnecessary consumption of time and effort before any insight is gained from big data.
“The Teradata Portfolio for Hadoop 2 supports the fastest path to business value by leveraging the ‘store-everything approach’ of the data lake,” said Scott Gnau, president, Teradata Labs. “We have taken the complexity and risk out of the technology deployment, allowing organizations to focus on high-value activities.”
Teradata Portfolio for Hadoop, complete with four major components, is a flexible and comprehensive solution.
Teradata Open Distribution for Hadoop (TDH) 2.1 – An enhanced software platform, Teradata Open Distribution for Hadoop is built upon Hortonworks Data Platform 2.1 and offers a comprehensive set of vital Teradata software components to make Hadoop technology more enterprise ready, including enhancements for:
- High Availability and Disaster Recovery
- Performance and Scalability
- Data Transformation and Integration
- Data Security
- Setup and Installation
- Monitoring and Manageability
These components radically simplify the operation and accelerate time-to-production, while enhancing Hadoop’s reliability, manageability, connectivity, and ease of use. Teradata Open Distribution for Hadoop leverages core Apache Hadoop 2 components built by Hortonworks, including Apache™ Hadoop® YARN, a next-generation framework for Hadoop data processing.
Teradata Appliance for Hadoop - The enhanced Teradata Appliance for Hadoop, with Teradata Open Distribution for Hadoop, is the first to run on the Hortonworks Data Platform 2.1. The appliance is delivered ready-to-run and optimized for enterprise-class data storage and management. The appliance can scale from 144 terabytes to over 98 petabytes of data to meet the customers’ growth needs. The Teradata Appliance for Hadoop offers fast performance with the latest generation of Intel technology, and the combination of InfiniBand fabric-based hardware and Teradata BYNET®V5 software with scaling and failover capability.
Teradata Consulting Services – The enhanced Teradata Consulting Services now offers expertise on hardening of security and safeguarding privacy, an assessment to determine the best-fit data platform for deployment, and the most effective way to integrate data from various sources. Teradata Consulting Services takes a comprehensive approach to big data by offering help to “identify and advise” and “architect and implement.” In addition, they provide managed services for ongoing operations, and rigorous, in-depth training on deployment and management.
Teradata Customer Services – Unique in the industry, Teradata Customer Services has expanded its services and now supports and maintains big data and Hadoop environments. Teradata helps customers realize the benefits of big data analytics within a Teradata® Unified Data Architecture™. Teradata is dedicated to developing and supporting large-scale production deployments and will provide a single point of contact for customers for all hardware and software, with the backing of Hortonworks. The Teradata Customer Services group also supports Hadoop offered on commodity servers, and software-only implementations of the Hortonworks Data Platform.
“With the explosion of new data types, enterprises are turning to Apache Hadoop to create analytic applications that derive actionable insights from previously uncaptured data,” said Rob Bearden, chief executive officer, Hortonworks. “With the support of Hortonworks Data Platform 2.1, Teradata is helping to usher in the next-generation enterprise architecture with a portfolio that incorporates the latest Apache Hadoop innovations, including YARN and Tez, and effectively manages multi-structured data.”
A large, diversified U.S. insurance provider wanted to change their business model and start analyzing all driving behaviors of the vehicle fleet driver that it insures. This change would enable the provider to understand the appropriate premium to charge riskier drivers. With the Teradata Portfolio for Hadoop, the insurance provider captured multi-structured telematics data from vehicle sensors monitoring customer-driving habits. The large volumes of data were ingested in an Apache Hadoop data lake in real-time. The data came from multiple data sources and was ingested at very high rates, because of the flexibility of the underlying Hadoop Distributed File System. The data was then refined and converted to a standard format, combined with GPS data, and sessionized to create the trip record and a risk rating. The analysis was done by multi-level calculations and aggregations done with MapReduce, Hive queries and user defined functions. Because of the scalability of the Hadoop platform, there was never a need to discard the data, which is useful for future ad-hoc analytics.
Organizations from many industries can benefit from Teradata’s technology. “Cardinal Health is driven to improved patient care through analytic innovation,” said Neeraj Kumar, vice president, Information Management and Analytics, Cardinal Health. “Teradata has helped us gain analytic insights and as a result we have improved our supply chain and enhanced patient care, while realizing a significant business value.”
Teradata continues to partner with innovative software vendors that offer additional productivity and automation tools, which support Hadoop data lake environments with data integration, governance, and security.
- Informatica’s data integration solutions provide increased developer productivity using a flexible data management architecture, an agile development methodology, and existing skill sets to optimize data integration processes. Specifically, Informatica PowerCenter Big Data Edition provides developers the ability to leverage their expertise with Informatica’s flagship product, PowerCenter, and improve developer productivity by five times.
- Protegrity’s Big Data Protector provides comprehensive data protection, allowing users to have file, and field-level data security for sensitive data ranging from privacy to PCI.
- Revelytix Loom provides dynamic dataset management automatically calculated data lineage for all transformations, and Activescan entity resolution, which automatically detects, parses, and profiles any new HDFS files.
The Teradata Portfolio for Hadoop 2 will be available in the third quarter of 2014 with partner support.
- Teradata Portfolio for Hadoop
- Cardinal Health case study
About TeradataTeradata (NYSE: TDC) helps companies get more value from data than any other company. Teradata’s leading portfolio of big data analytic solutions, integrated marketing applications, and services can help organizations gain a sustainable competitive advantage with data. Visit teradata.com.
Teradata and the Teradata logo are trademarks or registered trademarks of Teradata Corporation and/or its affiliates in the U.S. and worldwide.