API Reference
ΒΆ
Lang, Doc, Corpus
Doc Extensions
Datasets and Resources
Capitol Words Congressional speeches
Supreme Court decisions
Wikimedia articles
Reddit comments
Oxford Text Archive literary works
IMDB movie reviews
UDHR translations
ConceptNet
DepecheMood
Text Preprocessing
Pipeline
Normalize
Remove
Replace
Information Extraction
Basics
Matches
Triples
Acronyms
KWIC
Keyterms
Text Statistics
Basic Stats
Readability Stats
Pipeline Components
Document Similarity
Edit-based Metrics
Token-based Metrics
Sequence-based Metrics
Hybrid Metrics
Document Representations
Networks
Sparse Vectors
Vectorizers
Topic Modeling
File I/O
I/O Utils
Visualization
Data Augmentation
Miscellany
Language Identification
Utilities
Semantic Networks
←
Terms and Topics in the U.S. Congress
Lang, Doc, Corpus
→
Navigation
Installation
Quickstart
Tutorials
API Reference
Lang, Doc, Corpus
Datasets and Resources
Text Preprocessing
Information Extraction
Text Statistics
Document Similarity
Document Representations
Topic Modeling
File I/O
Visualization
Data Augmentation
Miscellany
Changes
Related Topics
Documentation overview
Previous:
Terms and Topics in the U.S. Congress
Next:
Lang, Doc, Corpus
Quick search