Python | 自然语言处理 (一)

2021-06-22 11:03

阅读：647

标签：次数 person from load code 文本 height 出现结果

小白博主最近想参加一个关于NLP的比赛，于是入坑自然语言处理,想借博客一边学习，一边整理

首先安装库nltk，直接pip install nltk即可

1 from nltk.book import *

*** Introductory Examples for the NLTK Book ***
Loading text1, ..., text9 and sent1, ..., sent9
Type the name of the text or sentence to view it.
Type: ‘texts()‘ or ‘sents()‘ to list the materials.
text1: Moby Dick by Herman Melville 1851
text2: Sense and Sensibility by Jane Austen 1811
text3: The Book of Genesis
text4: Inaugural Address Corpus
text5: Chat Corpus
text6: Monty Python and the Holy Grail
text7: Wall Street Journal
text8: Personals Corpus
text9: The Man Who Was Thursday by G . K . Chesterton 1908

这样，证明库已安装，接下来便可以开始我们的学习了:

技术分享图片

搜索文本

1.关键词索引:text1.concordance("words") 从文中找到该word

2.用离散图表示词语出现的位置及频繁程度:

技术分享图片

计算语言:简单的统计

1.频率分布

技术分享图片

从输出结果来看，可以得知fdist为字典类型，键为字符，值为出现的次数

至此，我们先了解了一下ntlk库，和一些基础函数~

继续加油！

Python | 自然语言处理 (一)

标签：次数 person from load code 文本 height 出现结果

原文地址：https://www.cnblogs.com/Virtual-Z/p/9678511.html

上一篇：(转)python 判断数据类型

下一篇：Python上下文管理器(context manager)

文章来自：搜素材网的编程语言模块，转载请注明文章出处。
文章标题：Python | 自然语言处理 (一)
文章链接：http://soscw.com/index.php/essay/97342.html

评论

亲，登录后才可以留言！

关于我们 | 版权声明 | 常见问题 | 素材投稿 | 联系我们 | 网站地图 |

搜素材网素材除本站原创外均由用户分享，若发现权利被侵害，请联系及时联系我们，我们会在第一时间进行处理。

特别说明：本站所有资源除本站原创外仅供学习与参考，请勿用于商业用途,如有侵犯您的版权请联系客服服务QQ：

点击这里给我发消息

Copyright © 2025 soscw.com 搜素材网素材网版权所有蜀ICP备18015633号-1