Unified Information Access Blog

Welcome to Attivio's Unified Information Access Blog. Join us for discussions on topics ranging from enterprise search solutions, information access insights, Agile software development methodology to programming with Java. We hope you'll find the articles informative and participate in the discussions by leaving a comment.

Why are all the mega technology vendors suddenly so focused on next generation data warehousing and analytics?

Primarily, it's because Oracle wants to move down the information technology stack into what has traditionally been the realm of server and storage players. Today, more than a third of compute and storage demand is generated from traditional data warehousing and data mart implementations. HP, EMC, IBM and Sun/Oracle have been carving up this large and profitable market for the past 15 years, with each player generating billions in revenue.

In the last three months, however, Oracle has totally changed the game. Using IBM's age old bundling strategy, they have created a Data warehousing package (Exadata - dbase/Sun servers/Sun storage) that challenges the price and performance metrics that drive this market.

Despite outward appearances, this move is not offensive! It is, in fact, a defensive play against the success of Netezza Corporation, which has completely redefined the price/performance leadership benchmark for this space. The defensive strategy of bundling allows vendors to maximize performance on hardware blocks of compute and storage. This approach allows them to increase common parts and reduce bill of material (BOM) costs, thus producing less expensive offerings.

Too often getting access to all the relevant business information we need has forced us to undertake a journey across multiple sources, using different technologies to discover what we need - and often actually settling for less than the complete picture. Until recently there hasn't been a unified source or a single access method for finding everything.

In his recent white paper, "Beyond the Data Warehouse: A Unified Store for Data and Content," data warehousing expert Dr. Barry Devlin points out that legacy technologies forced information into content and data silos. According to Dr. Devlin, "The use of two distinct words, ‘data' and ‘content' is unfortunate, since both are the same concept-information. Content is softer information, while data is harder; two terms at opposite ends of a continuum. At the softer end, information exists as commonly used and interpreted by humans-documents, images, etc. Hard information is the structured records and fields suitable for logical and numerical computer processing, for example, in operational systems.... placed rigorously in defined fields, with only certain values allowed."

The following is an excerpt from a new whitepaper “Beyond the Data Warehouse: A Unified Store for Data and Content” by Dr. Barry Devlin, one of the foremost authorities on business insight and data warehousing.

Content, or soft information, has always been of interest to the business in a wide range of processes, from marketing to executive decision-making. The explosion in volume and variety of soft information driven, in particular, by the Internet has sharpened that interest. However, with years of experience in business intelligence and data warehousing behind them, many users are clear that what they really need is an integrated view of soft information with the harder data already available in the warehouse. While soft information on its own does have value, the real business advantage will come from exploring the entire set of hard and soft information free from the limitations of the pervasive, predefined data structures of hard information.

Content and data are closely related. Data is what IT has made of content in order to control and process it in the structured world of computers. Content as simple as “I’ll buy that red car” is transformed into a purchase transaction, with defined fields, allowed value ranges and keys normalized in a database. The use of two distinct words, “data” and “content” is unfortunate, since both are the same concept — information. Content is softer information, while data is harder; two terms at opposite ends of a continuum. At the softer end, information exists as commonly used and interpreted by humans — documents, images, etc. Hard information is the structured records and fields suitable for logical and numerical computer processing, for example, in operational systems.

More Articles...

Page 1 of 6

Start
Prev
1

SUBSCRIBE

blue-rss-icon.png

Enter your email address:

 

Connect with Attivio

ico_attivio_spiral.jpg facebook-Icon.png twitter-icon.png

Contact Us:
+1.857.226.5040

Recent Posts

UIA: Boosting Customer Loyalty (Or How NOT to Annoy Your Customers)

I am in Tel Aviv enjoying a beautiful summer afternoon with a new customer when my mood is altered by a credit card snafu. Back in the states, a wary...
Read More...

Clash of the Titans: Price and Performance for Data Warehousing and Analytics

Why are all the mega technology vendors suddenly so focused on next generation data warehousing and analytics? Primarily, it's because Oracle wants to...
Read More...

Enriching Unified Information Stores with Text Analytics

Too often getting access to all the relevant business information we need has forced us to undertake a journey across multiple sources, using different...
Read More...
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8