225 / 2017-01-13 19:41:13
PDF Crawler using Inverted Index and Interval lists
keyword search, key-phrase search, inverted index, interval list.
摘要录用
Snehal Kadwe / Yeshwantrao Chavan College of Engineering
Abstract- The search operation in PDF document has become very indispensable now a days and loads of research have being organised to store and process the index required for search operation in a very simple and effective manner. Whenever indexes are stored, its access time is large and it requires large amount of storage space. The above techniques has some limitation like it can be done only for small number of PDF documents. To increase the access time and to reduce the storage space we are using the concept of inverted index and interval list. With the help of inverted index of a keyword available in PDF it can easily retrieve the PDF document. It can assigns unique id to each and every document (docID) available in repository. Interval list is used for lower bound and upper bound of document present in repository. The ¬inverted index and interval list make it easy to retrieve an information of PDF document with the help of keyword. The combination of both can improves the information retrieval system (IR) and it allow us to search millions of PDF document.
重要日期
  • 会议日期

    03月22日

    2017

    03月24日

    2017

  • 02月15日 2017

    初稿截稿日期

  • 02月20日 2017

    初稿录用通知日期

  • 02月22日 2017

    终稿截稿日期

  • 03月24日 2017

    注册截止日期

移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询