Sitewide
RSS Feed:
|
By: Dr Fern Halper, Partner, Hurwitz & Associates Published: 9th July 2008 Copyright Hurwitz & Associates © 2008 |
Over the past several weeks, I've been briefed by a number of text analytics vendors and companies in partnership with text analytics vendors about syndicated services that make use of text analytics. Of course, syndicated services such as brand monitoring and news services that make use of this technology to some degree have been around for a while. But, how about some of the newer services?
An interesting example of this is illumin8, which is being offered by Elsevier, in partnership with Netbase. The service is targeted at R&D knowledge workers looking to solve technical and business problems. According to Elsevier, knowledge workers spend more time per week trying to discover relevant content relating to a particular problem area than analyzing that information (5.5 hours/week accessing vs. 4.7 hours/week analyzing). These workers are usually using a google-like search engine. I think everyone can agree that the google-like search engine is not ideal for research purposes, so I won't belabor the point here. In the case of the R&D knowledge worker, often one goal is to gather information relating to a particular problem, finding products that solve that problem, as well as understanding the approach used to solve the problem.
Elsevier has aggregated 5 billion business sources, 3 million full text articles, 33 million scientific records, and 21 million patents as the source of information for this service. Using the Netbase semantic index, Elsevier crawls through the information and extracts solutions that solve a problem and the approaches used to address a specific issue. In this way, R&D can help answer the following questions:
Below is a screen shot of what an end-user might see using this service. In this example, the user is interested in solving the problem of fuel efficiency in boats. He or she wants to see what products and approaches are out on the market to address this problem and what companies are providing these solutions.The user enters the topic (boats) and the benefit (fuel efficiency) in the search box and gets back information that is organized in a logical way. In this example, you can see that query returns information about products that address the problem as well as the companies that make the products, organizations that deal with energy, as well approaches to solving the problem (drag, stroke, etc). These are ranked. Users can then drill down on any of these areas to get snippets (and full text) associated with areas that he/she is interested in analyzing.
During the demo, I asked to see what would happen if we input "text analytics" as the problem space in the search box. I was actually impressed that what was returned was a good set of information about the players, organizations dealing with text analytics and other information about it. The service is not inexpensive, but it does cull a lot of information.
Syndicated Services
I believe that the number of syndicated services using text analytics will continue to grow. We're certainly seeing action in the brand monitoring space on this front. Vendors are also getting into the act. Expert System, for example, has its own service that is targeted at the auto industry. I believe that other vendors may get into the act if they determine that the financial benefits of offering syndicated services (as opposed to SaaS offerings) makes sense.
We are no longer accepting comments against this item. We suggest contacting the author directly.
9th July 2008: 'T. Benson' said:
Thank you, Dr. Halper, for an informative article. We at Cognition Technologies (www.cognition.com) agree that keyword/pattern-matching search engines are inadequate and the next wave of Semantic Natural Language Processing technologies needs to go mainstream.
All Natural Language Processing companies are trying to solve the “relevancy issue” – meaning, how can the relevancy of the text they are processing (either in-bound -> reading, or out-bound -> delivering text out) be improved for the end-user? This issue is addressed both by technological adjustments to existing solutions (e.g. more sophisticated mathematical and statistical algorithms) and/or by market segmentation, such as through dataset specialization (e.g. automotive, music, videos, medical, etc.). Cognition, on the other hand, addresses the relevancy challenge by changing the NLP paradigm through its Semantic Map, a unique and complete combination of linguistic elements to optimize semantic understanding:
Morphology
• The various forms of word, e.g. singular, plural, tense
Syntax
• The grammatical structure, e.g. verbs, nouns
Semantics
• Word and sentence meaning
Spelling
• The various ways words are spelled (or misspelled)
You can try Cognition on three datasets: Wikipedia, Medline, and a US caselaw dataset. They can all be accessed from www.cognition.com.
Thanks again for a great article.
The messages above were all contributed by IT-Director.com readers. Whilst we take care to remove any posts deemed inappropriate, we can take no responsibility for these comments. If you would like a comment removed please contact our editorial team.
Published by: IT Analysis Communications Ltd.
T: +44 (0)203 051 5760 | F: +44 (0)870 345 9922