"An Efficient Search Paradigm Implementing a Unique Identifier" by Devendra Manandhar

Document Type

Thesis - University Access Only

Award Date

2009

Degree Name

Master of Science (MS)

Department / School

Electrical Engineering and Computer Science

Abstract

The Internet is an ocean of information. One can get information on many subjects with sufficient depth; what and how deep, depends on the tools that the user has at their disposal. There have been many studies regarding tracking information about the Internet's contents. There is no best way to do this because the Internet is very dynamic. In addition, different Internet technologies are being developed every day. Searching forms an interesting branch of the Internet technology. There are many search techniques and search algorithms for the Internet and many more are being tried daily. In this thesis, a new paradigm of searching has been designed and tested. In a basic form, searching starts with crawlers downloading web documents, wrappers extract and index web content from those downloaded web documents in a database. The stored data can then be used by other applications for various purposes. There is lots of information in the Internet which are not accessible through simple navigation. To access this information parameterized navigation is required where the additional information to the web server must be provided through used of the web forms. This extra information is passed to the web server in the form of query string. In this thesis, a searching model has been designed and tested where the Uniform Resource Locator (URL) and the query string is constructed using the unique identifier of an item supplied by the web server in response to the query constructed for the particular item. The search domain is confined to book search from different vendors using the ISBN number as the unique identifier for constructing the URL as the ISBN numbers are universally unique.

Library of Congress Subject Headings

Internet searching

International Standard Book Numbers

Uniform Resource Identifiers

Format

application/pdf

Number of Pages

100

Publisher

South Dakota State University

Share

COinS