Welcome to the Text Analysis Project Wiki ¶
Table of Content ¶
- SubmittingPatches, How to submitting patches
- BuildFromSource, How to build from source
- TextMiningResources, Resources on Text Mining (links, papers, extracts of books)
- WordnetApi, our interface to the WordNet database of lexical relationships.
- Clustering, Guide to the clustering algorithms used in Text-Analysis
- InteractiveShell, Guide to the interactive shell and scripting features
- PCA, Guide to the Principal Component Analysis
- MultiFileUploader, An applet to upload multiple file to a remote server.
- LowLevelModules, a list of the low-level text mining/NLP abilities.
- TextExtraction, extraction of useful and relevant content from web pages.
- StringMatching, Aho–Corasick algorithm.
- AutomaticSummarization, Create a summary extracting the most significant sentences in a text (or in multiple documents).
- TextSimilarity, Measure the similarity between strings or senteces.
For a complete list of all the wiki pages, go to TitleIndex.