File Size: 26,245kb Last Update: 6/16/2008 Release Date: 6/16/2008 Introduction Developing a global data warehouse has become a strategic business direction for success in the international market place. Among many technologies available today for globalization, Unicode is a core technology to develop and implement a universal language solution. However, many Teradata customers may not implement Unicode on the existing systems, as the customers have already implemented the Teradata Latin server character set even for non-Latin1 languages including Chinese and Korean. Those customers start experiencing gaps between the legacy data and Unicode data as well as gaps between the existing ANSI applications and Unicode applications. Those gaps will not allow the customer to access the leading edge Teradata Unicode applications such as TRM 6.x and DCM 3.2. As of today with V2R6.x/TTU 8.x, migration from the Teradata Latin to Unicode may not be an easy task. Here are some limitations in the current Teradata system: - ALTER TABLE does not support changing the server character set for character data types
- The TRANSLATE() function only works with Japanese
The Unicode tool kit has been developed for those customers who migrate the Latin server character set to Unicode and build a global data warehouse based on a universal character set Unicode. The Unicode tool kit consists of the following components: 1) User Defined Functions (UDF) for the migration without import/export 2) cConv for the migration with import/export 3) CMigration and cScript for the charset migration on the same system 4) Site-defined session character sets compatible to Windows and IBM code pages 5) Access Modules for translation and validation to load code page data via the UTF8 session in Fastload/Multiload 6) Unicode Test Data
|
|
|
|