Natural Language Processing (NLP)¶
- EDA and visualization of text data
text2emotion
to detect emotions from textual data- Quantify information in statements with entropy from Information Theory
- NLTK
- Chatbot using
rasa
rasa
: open source ML framework to automate text-and voice-based conversations- WordNet for a lexical taxonomy of English words
- Clean text
- Gramformer for text correction
- Styleformer for text styling
- Data jobs description analyzed with scattertext
- Scattertext
texthero
yarl
for URL processingEcco
for pattern visualization in text data- Data QA to label data
- Text summarization
- Text similarity with Levenshtein distance
- Autocorrect for multilanguage spelling correction
- Neattext for cleaning textual data and text preprocessing
- Texthero tutorial
- Microsoft
presidio
for NER (Named Entity Recognition) and data anonymization - SEER model for information extraction based on user-specified examples
- Textnets: text analysis with networks
- Universal romanizer tool
- Text summarization
- Sentence embedding
- Semantic search with
txtai
- Arabica and Cappuccino for text EDA
- Simple spelling check in Python
- A guide to computational linguistics and conversational AI
- diff-match-patch: robust algorithms to perform the operations required for synchronizing plain text
- PolyFuzz: fuzzy string matching and string grouping
- Outlines: a library for neural text generation
sense2vec
: query vectors for multi-word phrases based on part-of-speech tags and entity labels- NLP Course
Topic modeling¶
- Concept modeling to link text and images
- Topic coherence measures
- Intro to topic modeling with Latent Dirichlet Allocation (LDA)
- Topic modeling strategies comparison
- Hands-on topic modeling via LDA
- Advanced Topic Modeling with BERTopic
- Topic Modeling with Llama 2
- cluestar: visualisation tools to get started with text classification tasks
- Practical Guide to Topic Modeling with Latent Dirichlet Allocation (LDA)
- BERTrend: neural Topic Modeling for Emerging Trends Detection
- Turftopic: robust and fast topic models with sentence-transformers
- topicwizard: powerful topic model visualization in Python