Improving Relevance Prediction for Focused Web Crawlers
Safran, Mejdl . 2012
Database advanced application Relevance Prediction Web Crawler IR
A key issue in designing a focused Web crawler is how to determine whether an unvisited URL is relevant to the search topic. Effective relevance prediction can help avoid downloading and visiting many irrelevant pages. In this paper, we propose a new learning-based approach to improve relevance prediction in focused Web crawlers. For this study, we chose Naïve Bayesian as the base prediction model, which however can be easily switched to a different prediction model. Experimental result shows that our approach is valid and more efficient than related approaches.
Publisher Name
IEEE
Publishing City
Shanghai, China
Conference Location
Shanghai, China
Conference Name
IEEE/ACIS 11th International Conference on Computer and Information Science (ICIS),
Cloud computing has demonstrated its effectiveness in handling complex data that requires substantial computational power, immediate responsiveness, and ample storage capacity.
Leaf diseases are a global threat to crop production and food preservation. Detecting these diseases is crucial for effective management. We introduce LeafDoc-Net, a robust, lightweight transfer-…