Skip to content
MathWorks - Mobile View
  • Melden Sie sich bei Ihrem MathWorks Konto anMelden Sie sich bei Ihrem MathWorks Konto an
  • Access your MathWorks Account
    • Eigener Account
    • Mein Community Profil
    • Lizenz zuordnen
    • Abmelden
  • Produkte
  • Lösungen
  • Forschung und Lehre
  • Support
  • Community
  • Veranstaltungen
  • MATLAB erhalten
MathWorks
  • Produkte
  • Lösungen
  • Forschung und Lehre
  • Support
  • Community
  • Veranstaltungen
  • MATLAB erhalten
  • Melden Sie sich bei Ihrem MathWorks Konto anMelden Sie sich bei Ihrem MathWorks Konto an
  • Access your MathWorks Account
    • Eigener Account
    • Mein Community Profil
    • Lizenz zuordnen
    • Abmelden

Videos und Webinare

  • MathWorks
  • Videos
  • Videos Homepage
  • Suche
  • Videos Homepage
  • Suche
  • Vertrieb kontaktieren
  • Testsoftware
2:14 Video length is 2:14.
  • Description
  • Full Transcript
  • Related Resources

What Is Text Analytics Toolbox?

Text Analytics Toolbox™ provides tools for extracting text from documents, preprocessing raw text, visualizing text, and performing machine learning on text data. The typical workflow begins by importing text data from documents, such as PDF and Microsoft® Word® files, and then extracting meaningful words from the data. Once text is preprocessed, you can interact with your data in a number of ways, including converting the text into a numeric representation and visualizing the text with word clouds or scatter plots. 

Features created with Text Analytics Toolbox can also be combined with features from other data sources to build machine learning models that take advantage of textual, numeric, audio, and other types of data. You can import pretrained word-embedding models, such as those available in word2vec, FastText, and GloVe formats, to map the words in your dataset to their corresponding word vectors. You can also perform topic modeling and dimensionality reduction with machine learning algorithms such as LDA and LSA. 

To get started transforming large sets of text data into meaningful insight, download a free trial of Text Analytics Toolbox. 

Text Analytics Toolbox provides tools for extracting text from documents, preprocessing raw text, visualizing text, and performing machine learning on text data.  

You can use Text Analytics Toolbox to analyze data from sources like maintenance reports, operations logs, financial documents, web and social media sources.

You can extract raw text from a variety of sources including Microsoft Word, Microsoft Excel, and PDF and use word clouds to view the relative frequency of words and interactive scatter plots to understand the numeric relationships between words.

Text Analytics Toolbox provides functions for pre-processing raw text such as removing common words and punctuation and tokenizing documents into individual words or phrases.

Once text is pre-processed, converting text to numeric representations let you do more analysis and visualizations to understand word frequencies including: 

  • Histograms to compare word counts
  • Bag of Words and Ngrams to enable efficient visualization  and computation 
  • and TF-IDF models for text mining and machine learning 

Statistics and machine learning algorithms can be used with text analytics to perform topic modeling to identify themes in documents, classify documents and make predictions. 

You can train machine learning models or use pre-trained word embedding models such as word2vec, FastText and GloVe. 

In this example, the Latent Dirichlet Allocation algorithm is used to build a topic model with 60 topics in storm reports to identify damage and weather patterns. 

You can also use deep learning algorithms to build accurate classifiers when you have large sets of documents and use parallel computing to speed up text processing and training.  

For more information about Text Analytics Toolbox, see the product page, or choose a link below.

Related Products

  • Text Analytics Toolbox

Getting Started with Text Analytics in MATLAB (White Paper)

Bridging Wireless Communications Design and Testing with MATLAB

Read white paper

Documentation

Getting Started with Text Analytics in MATLAB

Download white paper

Feedback

Featured Product

Text Analytics Toolbox

  • Request Trial
  • Get Pricing

Up Next:

6:21
Import Tool Enhancements for Text Files

Related Videos:

3:22
How to Import Data from Spreadsheets and Text Files Without...
3:31
Munich Re Trading Creates a Risk Analytics Platform with...
42:45
Signal Processing and Machine Learning Techniques for...
9:36
Big Engineering Data Analytics with MATLAB

View more related videos

MathWorks - Domain Selector

Select a Web Site

Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .

  • Switzerland (English)
  • Switzerland (Deutsch)
  • Switzerland (Français)
  • 中国 (简体中文)
  • 中国 (English)

You can also select a web site from the following list:

How to Get Best Site Performance

Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.

Americas

  • América Latina (Español)
  • Canada (English)
  • United States (English)

Europe

  • Belgium (English)
  • Denmark (English)
  • Deutschland (Deutsch)
  • España (Español)
  • Finland (English)
  • France (Français)
  • Ireland (English)
  • Italia (Italiano)
  • Luxembourg (English)
  • Netherlands (English)
  • Norway (English)
  • Österreich (Deutsch)
  • Portugal (English)
  • Sweden (English)
  • Switzerland
    • Deutsch
    • English
    • Français
  • United Kingdom (English)

Asia Pacific

  • Australia (English)
  • India (English)
  • New Zealand (English)
  • 中国
    • 简体中文Chinese
    • English
  • 日本Japanese (日本語)
  • 한국Korean (한국어)

Contact your local office

  • Vertrieb kontaktieren
  • Testsoftware

MathWorks

Accelerating the pace of engineering and science

MathWorks ist der führende Entwickler von Software für mathematische Berechnungen für Ingenieure und Wissenschaftler.

Entdecken Sie…

Produkte

  • MATLAB
  • Simulink
  • Software für Studierende
  • Hardware-Unterstützung
  • File Exchange

Testen oder Kaufen

  • Downloads
  • Testsoftware
  • Vertrieb kontaktieren
  • Preise und Lizenzierung
  • Store

Lernen

  • Dokumentation
  • Tutorials
  • Beispiele
  • Videos und Webinare
  • Schulungen

Support

  • Hilfe zur Installation
  • MATLAB Answers
  • Consulting
  • License Center
  • Support kontaktieren

Über MathWorks

  • Jobs & Karriere
  • Newsroom
  • Soziales Engagement
  • Berichte von Anwendern
  • Über MathWorks
  • Select a Web Site United States
  • Trust Center
  • Handelsmarken
  • Datenschutz
  • Datendiebstahl verhindern
  • Status von Anwendungen

© 1994-2022 The MathWorks, Inc.

  • Facebook
  • Twitter
  • Instagram
  • YouTube
  • LinkedIn
  • RSS

Folgen Sie uns