• Skip Navigation |
  • Accessibility 
Virtual Worlds Forum, 6th - 8th October 2008 @ London
IT-Director.com Logo
  • Is Anticipation Management a Game Changer?
  • Office 2.0 unconference
  • Non-accessible websites will be costly
 

Main navigation - go to a section of this website:

  • ARCHIVE
  • PAPERS
  • RESEARCH
  • EVENTS
  • NEWSWIRE
  • BLOGS
  • POLLS

  

Member Login | Become a Member

 
DOMAINS
  • Enterprise
  • SME
  • Business Issues
  • Technology
  • Services
  • Channels
FEATURED EVENTS
  • Virtual Worlds Forum Europe 2008
    6th October - 8th October
    London, United Kingdom
POPULAR PAPERS
  • The New Europe by Quocirca
TRANSLATE PAGE



USEFUL LINKS
  • Last 7 Days
  • Archives
  • Market Place
  • Top Articles
  • Hall of Flame
INTERACT
  • Advertising
  • About IT-Director.com
  • Site Feedback
  • Newsletters
  • Contact Us
  • Registration
CONTENT FEED

Sitewide
RSS Feed:

RSS Icon

What is RSS?

RANDOM QUOTE
Famous Slights - "His face is livid gaunt; his whole body his breath is green with gall; his tongue drips poison." - John Quincy Adams

ADVERTISEMENT
Blogs > Fern Halper

Syndicating Text Analytics

Fern Halper By: Dr Fern Halper, Partner, Hurwitz & Associates
Published: 9th July 2008
Copyright Hurwitz & Associates © 2008
Logo for Hurwitz & Associates
Page Tools

Request Reprints
Tell A Friend
Contact Author

Recent Blog Posts
  • My two cents on the 2008 Text Analytics Summit
  • Text Analytics and the Predictive Enterprise
  • Four Questions about Innovations in Analysis
  • Four questions about BI Innovation
  • Customer Experience Intelligence and Text Analytics
  • What's Next For Text Analytics?
Blog Archive
  • June, 2008
  • May, 2008
  • April, 2008
  • March, 2008
  • February, 2008
  • January, 2008
  • December, 2007
  • November, 2007
Syndication
  • Delicious Icon Delicious
  • Digg Icon Digg
  • reddit Icon reddit
  • Facebook Icon Facebook
  • StumbleUpon Icon StumbleUpon

Over the past several weeks, I've been briefed by a number of text analytics vendors and companies in partnership with text analytics vendors about syndicated services that make use of text analytics. Of course, syndicated services such as brand monitoring and news services that make use of this technology to some degree have been around for a while. But, how about some of the newer services?

An interesting example of this is illumin8, which is being offered by Elsevier, in partnership with Netbase. The service is targeted at R&D knowledge workers looking to solve technical and business problems. According to Elsevier, knowledge workers spend more time per week trying to discover relevant content relating to a particular problem area than analyzing that information (5.5 hours/week accessing vs. 4.7 hours/week analyzing). These workers are usually using a google-like search engine. I think everyone can agree that the google-like search engine is not ideal for research purposes, so I won't belabor the point here. In the case of the R&D knowledge worker, often one goal is to gather information relating to a particular problem, finding products that solve that problem, as well as understanding the approach used to solve the problem.

Elsevier has aggregated 5 billion business sources, 3 million full text articles, 33 million scientific records, and 21 million patents as the source of information for this service. Using the Netbase semantic index, Elsevier crawls through the information and extracts solutions that solve a problem and the approaches used to address a specific issue. In this way, R&D can help answer the following questions:

  • Solutions that exist to solve a problem
  • New applications and processes that might exist to help solve a problem
  • Information about what competitors are doing in the particular problem space
  • What the experts are saying about a particular problem area

Below is a screen shot of what an end-user might see using this service. In this example, the user is interested in solving the problem of fuel efficiency in boats. He or she wants to see what products and approaches are out on the market to address this problem and what companies are providing these solutions.The user enters the topic (boats) and the benefit (fuel efficiency) in the search box and gets back information that is organized in a logical way. In this example, you can see that query returns information about products that address the problem as well as the companies that make the products, organizations that deal with energy, as well approaches to solving the problem (drag, stroke, etc). These are ranked. Users can then drill down on any of these areas to get snippets (and full text) associated with areas that he/she is interested in analyzing.

During the demo, I asked to see what would happen if we input "text analytics" as the problem space in the search box. I was actually impressed that what was returned was a good set of information about the players, organizations dealing with text analytics and other information about it. The service is not inexpensive, but it does cull a lot of information.

Syndicated Services
I believe that the number of syndicated services using text analytics will continue to grow. We're certainly seeing action in the brand monitoring space on this front. Vendors are also getting into the act. Expert System, for example, has its own service that is targeted at the auto industry. I believe that other vendors may get into the act if they determine that the financial benefits of offering syndicated services (as opposed to SaaS offerings) makes sense.

Reader Comments

We are no longer accepting comments against this item. We suggest contacting the author directly.

9th July 2008: 'T. Benson' said:

Thank you, Dr. Halper, for an informative article. We at Cognition Technologies (www.cognition.com) agree that keyword/pattern-matching search engines are inadequate and the next wave of Semantic Natural Language Processing technologies needs to go mainstream.

All Natural Language Processing companies are trying to solve the “relevancy issue” – meaning, how can the relevancy of the text they are processing (either in-bound -> reading, or out-bound -> delivering text out) be improved for the end-user? This issue is addressed both by technological adjustments to existing solutions (e.g. more sophisticated mathematical and statistical algorithms) and/or by market segmentation, such as through dataset specialization (e.g. automotive, music, videos, medical, etc.). Cognition, on the other hand, addresses the relevancy challenge by changing the NLP paradigm through its Semantic Map, a unique and complete combination of linguistic elements to optimize semantic understanding:

Morphology
• The various forms of word, e.g. singular, plural, tense

Syntax
• The grammatical structure, e.g. verbs, nouns

Semantics
• Word and sentence meaning

Spelling
• The various ways words are spelled (or misspelled)

You can try Cognition on three datasets: Wikipedia, Medline, and a US caselaw dataset. They can all be accessed from www.cognition.com.

Thanks again for a great article.

Reply to T. Benson?

The messages above were all contributed by IT-Director.com readers. Whilst we take care to remove any posts deemed inappropriate, we can take no responsibility for these comments. If you would like a comment removed please contact our editorial team.

  • Site Map
  • | Terms of Use
  • | Privacy

Published by: IT Analysis Communications Ltd.
T: +44 (0)203 051 5760 | F: +44 (0)870 345 9922