Tools for Text Analysis
Text analysis is a process in which semantic and other information can be collated so that it can be analyzed in a quantitative manner to arrive at some decisions. This is a process that has been used for years in traditional market research where open ended questions are collated, a code list is made and then the various response forms are coded before data entry can be done for analysis.
Depending on the depth to which you analyze text, the data can throw up some basic and some very insightful information that can be derived through patterns and trends. The techniques that are used in text analysis include linguistic, statistical and machine learning. It also involves retrieval of data from various sources on the Internet, lexical analysis, pattern recognition, annotations and other such data mining techniques, albeit in the language arena. There is also an element of text categorization, text clustering, concept extraction, and sentiment analysis and document summarization too.
There are various kinds of online, offline, free and paid text analysis tools that are fairly easy to use. These have been listed below.
Free Text Analysis Tools
Some free and open source text mining tools allow you to understand the kind of comments that are being made on your blog, microblog, social networking page or elsewhere on the Internet.
- GATE – This is an open source toolkit that delivers the results in a graphical environment
- INTEXT – A DOS version of TextQuest, this text mining tool has been in the public domain for more than 7 years now.
- Open Calais – Another open-source text analyzer that includes semantic functionality and can search and analyze text within a blog, content management system websites, applications or more.
- LingPipe – Part of the suite of Java libraries this tool is free and can be used for a variety of linguistic analysis.
- RapidMiner Text Mining – A great source that allows you to check out the comments on your networking page without having to view them manually.
- S-EM (Spy-EM) – A tool that helps in text classification and dividing them into positive, negative and neutral responses based on learning.
- The Semantic Indexing Project – An open source tool again that includes semantic analysis and search applications too.
- Text Analyzer – This online tool is extremely easy to use. All that you need to do is to enter the text that you want analyzed to give you a detailed analysis on the same web page in seconds. Data retrieval is not part of this text analysis tool, though.
- Tagul – Gorgeous tag clouds responsible for the above tag cloud of this post.
Commercial Text Analysis Tools
Some of the text analysis tools that you may want to consider if you are more serious about the depth of analysis that you perform on your site, social media pages and more have been detailed below.
- ActivePoint – A tool that offers natural language processing (NLP) and categories text based on contextual search.
- Alceste – An easy to use software that allows automatic analysis of all kinds of text.
- ClaraBridge – Text mining software for businesses.
- Crossminder – A good text analysis tool that used natural language processing and various other text analytics techniques.
- Eaagle text mining software – A tool that is used by many due to the speed with which it structures large volumes of data to give direction.
- ClearForest – This solution gives meaning to unstructured information by using data mining technologies.
- SPSS LexiQuest – The advanced text analysis tool from SPSS.
- Expert System – A tool that uses the proprietary COGITO platform and creates clusters of text that can be interpreted.
- Analyze Words – An intelligent software that can analyze the personality of a website, a brand or even a person based on the words that are used. It categorizes words and content into upbeat, worried, angry, depressed, arrogant, personable, sensory and more. Basically the three dimensions that are analyzed are emotional style, social style and thinking style.
- Lexalytics – Transforms unstructured text to structured information, almost magically.
- Lextek Profiling Engine – A tool that classifies, routes and filters electronic text based on user defined profile.
- Recommind MindServer – A tool that uses PLSA (Probablistic Latent Semantic Analysis) for accurate text retrieval and further classification.
- Attensity – A software that goes a step further and classifies text based on “who”, “what”, “where”, “when” and “why” facts.
- SAS Text Miner – An advanced and reliable tool that is used by many market researchers and web analyzers for text analysis.
- DiscoverText – A tool used by many market research and web analytic companies to create text analysis solutions.
- Xanalys Indexer, an information extraction and data mining library aimed at extracting entities, and particularly the relationships between them, from plain text.
- Wordstat – An easy and yet powerful tool that helps analysis textual information in responses, open-ended questions and interviews.
- OpinionEQ – A solution based on advanced semantic and linguistic research focusing on the problem of collecting, interpreting, and structuring both Web and real time communications.






One of the most common grouse of companies is the relative lack of data that is available for social media to assess the effectiveness of the dollar spent on the same. However, there are now various tools that exist that can help in collating data and then understanding the manner in which social media is being consumed.
