Dockerised Search Engine in Java

Description

The project comprises three distinct modules: a web crawler for downloading and managing books, an indexer that incorporates both expanded and aggregated data structures, and a query engine for processing search requests. To ensure reproducibility and platform independence, the entire system is encapsulated in portable containers using Docker. The benchmarking results highlight the advantages of the expanded index in terms of faster indexing speeds, while the aggregated index demonstrates slightly superior performance in single-word query scenarios. Ultimately, the expanded structure was selected for its balanced trade-off between speed and scalability, aligning well with the anticipated requirements of future project developments.
The objective of this project is to create an efficient and robust search engine using
an inverted index, a data structure designed to store words and their positions within
documents. This approach facilitates rapid and precise searches by not only identifying
which books contain specific terms but also pinpointing their exact locations within the
text.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

No more offers for this product!

General Inquiries

There are no inquiries yet.

Regular price

Cokoliv