Repository logo
  • English
  • Deutsch
  • Español
  • Français
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. ETD - Faculty of Informatics and Design
  3. Faculty of Informatics and Design - Department of Information Technology
  4. Information Technology - Doctoral Degree
  5. Hybridised indexing for research based information retrieval
 
Loading...
Thumbnail Image

Hybridised indexing for research based information retrieval

Author(s)
Fitzgerald, Kyle Andrew
Date Issued
2019
Type
Thesis
Publisher
Cape Peninsula University of Technology
Abstract
Challenges exist for information retrieval systems in handling mismatching vocabularies in queries and candidate source documents. As a result, these information retrieval systems may retrieve some documents that are non-relevant and miss some that are relevant. This increases the time for research by forcing additional perusal of unsatisfactory results, and additional searches using alternative vocabularies, which renders information retrieval systems less effective than they could be, and inhibits productive research.
The aim of this research was to design, build, and rigorously pilot test a hybrid indexing method that maintains phrase-term word ordinality and word proximity, and to compare the effectiveness of this method with the traditional inverted indexing method. The objectives were to prove statistically that the hybrid indexing method: i) increases the effectiveness of retrieving only those documents that are judged relevant by the user; ii) reduces errors in incorrect identification of user judged relevant documents, thus reducing the number of documents for the user to peruse; and iii) increases the rejection quality of user non-relevant documents, thus providing confidence to the user in the judgement of the information retrieval system. Finally, to determine whether this hybrid indexing method solves the problem of mismatching vocabulary between a query and a document, and satisfies the information needs of the user by retrieving only those documents from the collection relevant to the user. It must be noted that the results from the statistical analysis in this research are not the contribution to knowledge, as the statistics are used to prove that the hybrid indexing method worked. This indexing method is the contribution to the body of knowledge.
The strategy used was based on design science research performing both an exploratory and an explanatory study. Quantitative data were collected from the results of processing search queries through two information retrieval systems (one using the hybrid indexing method and the other the inverted indexing method) and from the results of a questionnaire completed by five participants during an experiment. The quantitative data were converted to binary and tested statistically using the mean averages for precision, recall, and specificity, and the Kappa coefficient.
The hybrid indexing method was presented and proved, with significance, to increase system effectiveness and specificity. Based on the results, the vocabulary mismatch problem between a query and a document was solved, but the information needs of the user were not satisfied.
Additional information
Thesis (Doctor of Information and Communication Technology: Information Technology)--Cape Peninsula University of Technology, 2019
Subjects

Hybrid token index

Information retrieval...

Information storage a...

File(s)
Loading...
Thumbnail Image
Name

Fitzgerald_Kyle_205118801_Vol._2.pdf

Description
Appendices File
Size

5.8 MB

Format

Adobe PDF

Checksum

(MD5):7dec46c5768ff504350b81dcb9eea302

Loading...
Thumbnail Image
Name

Fitzgerald_Kyle_205118801_Vol_1.pdf

Description
Main Thesis File
Size

4.42 MB

Format

Adobe PDF

Checksum

(MD5):98201933e6a03645990d017d0426ab19

  • Metrics
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your Institution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify