【免费】基于Python的信息检索与信息抽取系统-课程设计.rar

共55个文件

css：10个

js：10个

png：10个

python

信息检索

信息抽取

课程设计

需积分: 0 112 浏览量更新于2023-06-15 4 收藏 110.92MB RAR 举报

本项目利用Python实现了一个信息检索与信息抽取系统，包括数据、前端和后端代码。信息检索（Information Retrieval）是用户进行信息查询和获取的主要方式，是查找信息的方法和手段。狭义的信息检索仅指信息查询（Information Search）。即用户根据需要，采用一定的方法，借助检索工具，从信息集合中找出所需要信息的查找过程。广义的信息检索是信息按一定的方式进行加工、整理、组织并存储起来，再根据信息用户特定的需要将相关信息准确的查找出来的过程。又称信息的存储与检索。一般情况下，信息检索指的就是广义的信息检索。信息抽取（Information Extraction: IE）是把文本里包含的信息进行结构化处理，变成表格一样的组织形式。抽取系统的输入信息是原始文本，输出的是固定格式的信息点。信息点从各种各样的文档中被抽取出来，然后以统一的形式集成在一起。这就是信息抽取的主要任务。信息以统一的形式集成在一起的好处是方便检查和比较。信息抽取技术并不试图全面理解整篇文档，只是对文档中包含相关信息的部分进行分析。至于哪些信息是相关的，那将由系统设计时定下的领域范围而定。

收起资源包目录

基于Python的信息检索与信息抽取系统-课程设计.rar （55个子文件）

基于Python的信息检索与信息抽取系统-课程设计

TouristScenicSpotSearchEngine

__init__.py 2KB

.gitattributes 93B

templates

show.html 7KB

404.html 292B

layout.html 1KB

index.html 8KB

algorithm2.py 12KB

words.py 3KB

algorithm.py 14KB

static

bootstrap.js 120KB

bootstrap.bundle.js.map 348KB

bootstrap.bundle.min.js.map 286KB

bootstrap.min.js 50KB

bootstrap-paginator.js 20KB

china.js 60KB

bootstrap.bundle.min.js 69KB

bootstrap.bundle.js 206KB

bootstrap.js.map 205KB

echarts.min.js 691KB

jquery.js 379KB

bootstrapv3.js 57KB

bootstrap.min.js.map 170KB

qunit-1.11.0.js 56KB

css

bootstrap-grid.css.map 95KB

bootstrap.min.css 138KB

bootstrap-grid.css 37KB

bootstrap-grid.min.css.map 67KB

bootstrap-reboot.min.css.map 25KB

bootstrap.css.map 416KB

bootstrap-responsive.css 22KB

site.css 72B

bootstrap.css 169KB

bootstrap-reboot.css 5KB

bootstrap-reboot.css.map 58KB

bootstrap-grid.min.css 28KB

bootstrapv3.css 117KB

qunit-1.11.0.css 5KB

bootstrap.min.css.map 547KB

bootstrap-reboot.min.css 4KB

images

404.png 636KB

README.md 3KB

introduction

TRAVELDB.sql.zip 62.09MB

3.png 286KB

IR-IE.docx 7.54MB

china.zip 39.6MB

1.png 229KB

6.png 259KB

5.png 260KB

4.png 275KB

plot_detail.png 284KB

7.png 258KB

stopwords.txt 20KB

2.png 266KB

plot.png 116KB

name.py 3KB

资源推荐

资源预览

资源评论

# 北京邮电大学暑期课程信息检索与信息抽取课程设计 <br><br> ## 文件介绍 <br> |文件名|作用| |:---|:---| | \_\_init\_\_.py|flask程序入口| |algorithm.py|信息检索与抽取算法1| |algorithm2.py|信息检索与抽取算法2| |words.py|算法1用到的向量空间模型计算方法| |name.py|命名实体识别| |static文件夹|css,image,js文件| |template文件夹|html文件| |introduction/china.zip|网络爬虫得到的保存着中国所有景点信息的TXT文件| |introduction/TRAVELDB.sql.zip|数据库建表sql文件| |introduction/IR-IE.docx|实验报告| |introduction/stopwords.txt|停用词表| ## 说明 <br> 默认使用的是算法2，要改为算法1，请将 *\_\_init\_\_.py* 文件中的 *import algorithm2 as ag* 改为 *import algorithm as ag*， <br> 并在words.py中设置 *mystopwords = stopwordslist(stopwords.txt文件路径)*。 <br> introduction/china.zip解压缩后的 *景点名.txt* 文件是该景点的总介绍，*景点名_detail.txt* 文件是该景点详细文字介绍的 <br> 分词结果。 <br> <div align=center><img width="480" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/plot.png"/></div> <br> <div align=center><img width="480" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/plot_detail.png"/></div> ## 程序运行截图 <br> ### 主界面 <br> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/1.png"/></div> ### 搜索框输入历史悠久的红色旅游景点 <br> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/2.png"/></div> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/3.png"/></div> ### 搜索框输入令人心旷神怡的地方 <br> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/4.png"/></div> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/5.png"/></div> ### 搜索框输入江西革命圣地 <br> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/6.png"/></div> <div align=center><img width="580" height="360" src="https://github.com/cswangyuhui/TouristScenicSpotSearchEngine/blob/master/introduction/7.png"/></div>