Knowledge and Data Management White Papers
Incorporating Site-Level Knowledge to Extract Structured Data From Web Forums
Overview Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to both complex page layout designs and unrestricted user created posts. This paper, studies the problem of structured data extraction from various web forum sites. The target is to find a solution as general as possible to extract structured data, such as post title, post author, post time, and post content from any forum site. In contrast to most existing information extraction methods, which only lever-age the knowledge inside an individual page, the paper incorporates both page-level and site-level knowledge and employ Markov Logic Networks (MLNs) to effectively integrate all useful evidence by learning their importance automatically.
| Publisher | Microsoft | File Format | |
|---|---|---|---|
| Date Published | April 2009 | ||
| Format | White Papers | ||
| Topics | |||
Accelerating Enterprise Data Governance Part 1
In the first of this series of three white papers, Mike Ferguson of Intelligent Business Strategies defines what data governance is and then looks at the requirements that need to...
Data Governance for Master Data Management and Beyond
There is growing interest on behalf of both data management professionals and senior business managers to understand the motivations, mechanics, and benefits of instituting data governance within an organization. This...
Getting Started with Master Data Management
Master data management forms part of an overall enterprise governance program that aims to establish trusted data throughout the enterprise. This white paper from Mike Ferguson of Intelligent Business Strategies...
Five Steps to More Valuable Enterprise Data
Companies worldwide struggle with inconsistent, inaccurate or unreliable data - and often don't know how to build more useful corporate information. This white paper examines a five-step method for...
The Journey Along an Information-Led Transformation
A shift is underway from simple automation to business optimization, and information is at the center of it. Information, when aligned with your business strategy, holds the key to driving profitable...



