Data Science/Python
[Python] NLP : TreebankWordTokenizer, Pos Tag
nltk 모듈을 이용해서 Treebank WordTokenzer로 향상된 단어 토근화 방법과 pos_tag함수로 품사 태깅 from nltk.tag import pos_tag from nltk.tokenize import TreebankWordTokenizer nltk.download('averaged_perceptron_tagger') # 품사 태킹을 위한 데이터 다운로드가 필요함 text = "some teacher don't know how to teach it in then way that students understand it. \n\r that causes students to fail and they may repeat the class. The X.G Compay" tb_worktoken..
2022. 9. 17. 06:38
최근댓글