A Framework For Aggregating And Retrieving Relevant Information Using TF-IDF And Term Proximity In Support Of Maize Production

Show simple item record

dc.contributor.author Kasyoka, Philemon
dc.contributor.author Mwangi, Waweru
dc.contributor.author Kimwele, Michael
dc.date.accessioned 2015-01-12T05:55:20Z
dc.date.available 2015-01-12T05:55:20Z
dc.date.issued 2014-03
dc.identifier.citation International Journal of Scientific and Technology Research volume 3, issue 3, March 2014 en_US
dc.identifier.issn 2277 -8616
dc.identifier.uri http://www.ijstr.org/final-print/mar2014/A-Framework-For-Aggregating-And-Retrieving-Relevant-Information-Using-Tf-idf-And-Term-Proximity-In-Support-Of-Maize-Production.pdf
dc.identifier.uri http://hdl.handle.net/123456789/546
dc.description.abstract This paper presents a framework for aggregating and retrieving relevant maize information using Term Frequency Inverse Document Frequency and Term Proximity. The framework aggregates information from agricultural websites and blogs through the use of RSS technology. Term Frequency Inverse Document Frequency is able to retrieve relevant documents from the aggregated RSS feeds however; the presence of a query term within a retrieved document does not necessarily imply relevance. Documents with same similarity score do not necessarily have the same level of relevance. To mitigate that problem we implement a term proximity scoring approach that will be able to improve relevance in the top-k documents returned by TF-IDF. The approach for term proximity score uses both the span-based method and pair-based method to ensure effective proximity scoring. User preference profile is based on keywords which form user query while text documents are composed of RSS description content and RSS title tag content. Stemming is applied on query and document terms for better precision. This framework will ensure maize farmers get the most relevant information from online sources. en_US
dc.language.iso en en_US
dc.publisher IJSTR en_US
dc.subject Inverse Document Frequency en_US
dc.subject Information Retrieval en_US
dc.subject RSS en_US
dc.subject Term Frequency en_US
dc.subject Term Proximity en_US
dc.title A Framework For Aggregating And Retrieving Relevant Information Using TF-IDF And Term Proximity In Support Of Maize Production en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Dspace


Browse

My Account