Blog: Dan E. Linstedt« Step closer to nanotech hardware | Main | Master Data Management - Just Another Mart? » Active and Right-Time Data Warehousing DefinedRecently I've been asked about Active Data Warehousing, and (Real-time) Right Time Data Warehousing, what do these mean to the enterprise? In this short blog entry, I offer my opinion on the definition of each. In future entries I will define the basics of building one, the questions to ask, and potential value to the enterprise. I lead an effort in Active Data Warehousing, and Right Time Data Warehousing for Myers-Holum, Inc. We have best practices surrounding these efforts, and will soon offer tips and tricks for free on our site. Too often, we are confused by marketing literature and vendor hype. I'm going to set the line and offer my opinion in DEFINING just what an Active Data Warehouse is, and just what a Right Time Data Warehouse is. Are they different? Yes, why? Well, we'll get to that in a minute. For now, this is the way I define each: Active Data Warehousing (ADW) Right-Time Data Warehouse (RTDW) (Not REAL-TIME) RTDW: If the business needs to answer a question at the end of the day, every day - then a RTDW would refresh on a daily basis. If the business needs to answer a question every 30 minutes, then an RTDW would refresh every 30 minutes - assuming the data is available. Is an RTDW an Active Data Warehouse? Is an ADW also an RTDW? So then, what is a Real-Time Data Warehouse? Is the business definition of Real-Time different than Right-Time? Please don't mince words when going forward. Develop best-practices, metadata definitions, and terminology standards (business metadata). Too often our businesses are confused by all the vendor hype and marketing material that "throws a term in" just because it sounds cool. I'd love to hear your thoughts on this topic, even if you disagree. How do you define Real-Time, Right-Time, and Active DW? Thanks, |
Comments
Another aspect of Active Data Warehousing, in my opinion, is the ability of the warehouse environment to create activity in the overall transaction environment. Rather than just receiving and integrating transactions from other systems as they happen, the result of that incoming transaction needs to be the possibility of creating a new outgoing transaction destined to initiate some activity elsewhere in the enterprise. In this respect, I'd argue that active warehousing is more fringe than just transaction speed (versus batch) updates. That being said, I'm still wrapping the pragmatic part of my brain around some of the ideas.
Posted by: Paul Boal | February 7, 2006 12:53 AM
Hi Paul,
I completely agree, I forgot to mention that indeed: Active Warehousing feeds back to transactional systems, and in doing so CHANGES the definition of what a warehouse is... my next entry will discuss the nature of "just what does Data WHAREHOUSE mean to you?"
Great comment! Thanks,
Dan L
Posted by: Dan Linstedt | February 7, 2006 8:08 AM
I would be interested in the implementation challenges surrounding Righ-time data warehouseing. For example the typical batch approach to ETL often places a burden on the warehouse which may prevent extracts from running. How can this be managed if the refresh cycle is 10 minutes?
Posted by: Darrin D | September 30, 2006 6:26 PM