| Publisher | Microsoft | ||
|---|---|---|---|
| Format | 831.8KB PDF | Date added | 24 Apr 2009 |
| Topics | Knowledge and Data Management, Data Acquisition - ETL | ||
| Downloads | 0 | ||
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to both complex page layout designs and unrestricted user created posts. This paper, studies the problem of structured data extraction from various web forum sites. The target is to find a solution as general as possible to extract structured data, such as post title, post author, post time, and post content from any forum site. In contrast to most existing information extraction methods, which only lever-age the knowledge inside an individual page, the paper incorporates both page-level and site-level knowledge and employ Markov Logic Networks (MLNs) to effectively integrate all useful evidence by learning their importance automatically.
Related white papers
The Journey Along an Information-Led Transformation
A shift is underway from simple automation to business optimization, and information is at the center of it. Information, when aligned with your business strategy, holds the key to driving profitable...
The new information agenda:Do you have one?
The lack of trusted information — information that is accurate, timely and relevant— is on the minds of CEOs and senior executives around the world. a paradigm shift from siloed...
Best Practices for Translating Customer Satisfaction into Revenue
Today's support organisations are focused on two top-level metrics: financial results and customer satisfaction. For most, it's easy to track financial performance, but customer satisfaction is akin to speaking a...
Support Strategies: Customer Experience Management
Customer experience is the most powerful tool available today for distinguishing your company from competitors ? each contact with the customer offers an opportunity for strengthening your relationships by delivering...
3 Strategies for Reducing IT Support Costs
As companies brace for more bumps in the economic downturn, many organisations are indiscriminately cutting costs. To ensure a seamless transition into the post-recession market, however, slashing and burning is...
Forrester Strategies for Assessing IT Business Satisfaction
If you aren't assessing customer satisfaction you are overlooking a potential goldmine. This valuable data is crucial to creating a successful IT strategy. But where do you start? This new...
Realising the benefits of going green
Cross over to greener communications with improved data accuracy. Many organisations have processes in place to improve the quality of their contact data to address business drivers, such as cost reduction...



