[파이썬] textblob 단어 빈도수 계산

Have you ever wondered how to count the frequency of words in a given text using Python? Look no further, as Textblob, a powerful Python library, makes it easy to perform this task. In this blog post, we will explore how to use Textblob to calculate word frequencies.

Installation

Before we begin, make sure you have Textblob installed. If you haven’t, you can install it using pip:

pip install textblob

Getting started

First, we need to import the Textblob module:

from textblob import TextBlob

Next, we need to create a Textblob object with the text we want to analyze:

text = "The quick brown fox jumps over the lazy dog."
blob = TextBlob(text)

Calculating word frequencies

To calculate the word frequencies, we can use the word_counts attribute of the Textblob object. It returns a dictionary where the keys are the words and the values are their respective frequencies:

word_frequencies = blob.word_counts

We can print the word frequencies to see the results:

for word, frequency in word_frequencies.items():
    print(f"{word}: {frequency}")

Filtering stop words

Textblob also provides a list of common stop words that we can filter out if needed. Stop words are words that are commonly used and do not provide much meaningful information in the text.

To filter stop words, we can use the stopwords attribute of the Textblob object. We can access the list of stop words using TextBlob.DEFAULT_STOPWORDS. Here’s an example of how to filter stop words from the word frequencies:

stop_words = TextBlob.DEFAULT_STOPWORDS
filtered_word_frequencies = {word: frequency for word, frequency in word_frequencies.items() if word not in stop_words}

for word, frequency in filtered_word_frequencies.items():
    print(f"{word}: {frequency}")

Conclusion

Using Textblob, calculating word frequencies in a text becomes a simple task with just a few lines of code. This can be useful for a variety of applications, such as text analysis, natural language processing, and information retrieval.

Remember to install Textblob and give it a try in your next Python project!