List of text mining software

Text mining computer programs are available from many commercial and open source companies and sources.

Commercial

  • Angoss – Angoss Text Analytics provides entity and theme extraction, topic categorization, sentiment analysis and document summarization capabilities via the embedded
  • AUTINDEX – is a commercial text mining software package based on sophisticated linguistics by IAI (Institute for Applied Information Sciences), Saarbrücken.
  • Autonomy – text mining, clustering and categorization software
  • Averbis – provides text analytics, clustering and categorization software, as well as terminology management and enterprise search
  • Citibeats – Language and data agnostic platform that provides a suite of text analysis modules based on graphs to extract insights
  • Basis Technology – provides a suite of text analysis modules to identify language, enable search in more than 20 languages, extract entities, and efficiently search for and translate entities.
  • DigitalMR – social media listening & text+image analytics tool for market research
  • Endeca Technologies – provides software to analyze and cluster unstructured text.
  • FICO Score – leading provider of analytics.
  • General Sentiment – Social Intelligence platform that uses natural language processing to discover affinities between the fans of brands with the fans of traditional television shows in social media. Stand alone text analytics to capture social knowledge base on billions of topics stored to 2004.
  • IBM LanguageWare – the IBM suite for text analytics (tools and Runtime).
  • IBM SPSS – provider of Modeler Premium (previously called IBM SPSS Modeler and IBM SPSS Text Analytics), which contains advanced NLP-based text analysis capabilities (multi-lingual sentiment, event and fact extraction), that can be used in conjunction with Predictive Modeling. Text Analytics for Surveys provides the ability to categorize survey responses using NLP-based capabilities for further analysis or reporting.
  • InfraNodus – text network analysis and visualization application. The software offers text mining and topic modeling functionalities, as well as structural analysis to identify the existing gaps in a discourse and generate new ideas.
  • Inxight – provider of text analytics, search, and unstructured visualization technologies. (Inxight was bought by Business Objects that was bought by SAP AG in 2008).
  • Language Computer Corporation – text extraction and analysis tools, available in multiple languages.
  • Lexalytics – provider of a text analytics engine used in Social Media Monitoring, Voice of Customer, Survey Analysis, and other applications. Salience Engine. The software provides the unique capability of merging the output of unstructured, text-based analysis with structured data to provide additional predictive variables for improved predictive models and association analysis.
  • Linguamatics – provider of natural language processing (NLP) based enterprise text mining and text analytics software, I2E, for high-value knowledge discovery and decision support.
  • Mathematica – provides built in tools for text alignment, pattern matching, clustering and semantic analysis. See Wolfram Language, the programming language of Mathematica.
  • MATLAB offers Text Analytics Toolbox for importing text data, converting it to numeric form for use in machine and deep learning, sentiment analysis and classification tasks.[1]
  • Medallia – offers one system of record for survey, social, text, written and online feedback.
  • NetOwl – suite of multilingual text and entity analytics products, including entity extraction, link and event extraction, sentiment analysis, geotagging, name translation, name matching, and identity resolution, among others.
  • RapidMiner with its Text Processing Extension – data and text mining software.
  • SAS – SAS Text Miner and Teragram; commercial text analytics, natural language processing, and taxonomy software used for Information Management.
  • Sketch Engine – a corpus manager and analysis software which providing creating text corpora from uploaded texts or the Web including part-of-speech tagging and lemmatization or detecting a particular website.[2]
  • Smartlogic – Semaphore; Content Intelligence platform containing commercial text analytics, natural language processing, rule-based classification, ontology/taxonomy modelling and information visualization software used for Information Management.
  • Sysomos – provider social media analytics software platform, including text analytics and sentiment analysis on online consumer conversations.
  • WordStat – Content analysis and text mining add-on module of QDA Miner for analyzing large amounts of text data.

Open source

  • Carrot2 – text and search results clustering framework.
  • GATE – general Architecture for Text Engineering, an open-source toolbox for natural language processing and language engineering.
  • Gensim – large-scale topic modelling and extraction of semantic information from unstructured text (Python).
  • Natural Language Toolkit (NLTK) – a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for the Python programming language.
  • OpenNLP – natural language processing.
  • Orange with its text mining add-on.
  • Stanbol – an open source text mining engine targeted at semantic content management.
  • The programming language R provides a framework for text mining applications in the package tm.[3] The Natural Language Processing task view contains tm and other text mining library packages.[4]
  • The KNIME Text Processing extension.
  • The PLOS Text Mining Collection.[5]
  • Voyant Tools – a web-based text analysis environment, created as a scholarly project.
  • spaCy – open-source Natural Language Processing library for Python
  • KH Coder – for Quantitative Content Analysis or Text Mining

References

  1. "Text Analytics Toolbox". mathworks.com. Retrieved 2019-07-10.
  2. "Text analysis with Sketch Engine". Sketch Engine. LEXICAL COMPUTING CZ s.r.o. 14 December 2017. Retrieved 17 January 2018.
  3. "Introduction to the tm Package: Text Mining in R" (PDF).
  4. Wild, Fridolin (February 20, 2020). "CRAN Task View: Natural Language Processing" via cran.rstudio.com. Cite journal requires |journal= (help)
  5. "Table of Contents: Text Mining". PLOS. doi:10.1371/issue.pcol.v01.i14 (inactive 2020-06-01). Cite journal requires |journal= (help)
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.