워드클라우드

%pwd

!pip3 install WordCloud
!pip3 install konlpy

from wordcloud import WordCloud
from collections import Counter

from konlpy.tag import Okt
from konlpy.tag import Komoran

from PIL import Image
import matplotlib.pyplot as plt
import numpy as np

with open('./data/대한민국헌법.txt', 'r', encoding='utf-8') as f:
    text = f.read()

okt = Okt()

# 명사만 추출
nouns = okt.nouns(text) 
nouns

words = [n for n in nouns if len(n) > 1] # 단어의 길이가 1개인 것은 제외
words

# 위에서 얻은 words를 처리하여 단어별 빈도수 형태의 딕셔너리 데이터를 구함
wcnt = Counter(words) 
wcnt

wc = WordCloud(font_path='malgun', width=400, height=400, scale=2.0, max_font_size=250)
gen = wc.generate_from_frequencies(wcnt)
plt.figure()
plt.imshow(gen)

wc.to_file('법전_워드클라우드.png')

'Computer Engineering > Linux' 카테고리의 다른 글

DB, FTP 연결 (0)	2023.06.26
별 출력하기 (0)	2023.06.23
Notepad++ 연결 (0)	2023.06.23
리눅스 명령어 실습 (0)	2023.06.23
Linux 기본설정 (0)	2023.06.21

.

워드클라우드

'Computer Engineering > Linux' 카테고리의 다른 글

댓글

티스토리툴바

워드클라우드

'Computer Engineering > Linux' 카테고리의 다른 글

관련글

댓글

티스토리툴바