Improving Relevance Prediction for Focused Web Crawlers

د. مجدل سلطان بن سفران

Assistant Professor

أستاذ مساعد بقسم علوم الحاسب/المشرف على كرسي أبحاث الذكاء الاصطناعي في الحوار الالكتروني والتواصل الحضاري

علوم الحاسب والمعلومات

مبنى كلية الحاسب الآلي والمعلومات

Improving Relevance Prediction for Focused Web Crawlers

Safran, Mejdl . 2012

Database advanced application Relevance Prediction Web Crawler IR

A key issue in designing a focused Web crawler is how to determine whether an unvisited URL is relevant to the search topic. Effective relevance prediction can help avoid downloading and visiting many irrelevant pages. In this paper, we propose a new learning-based approach to improve relevance prediction in focused Web crawlers. For this study, we chose Naïve Bayesian as the base prediction model, which however can be easily switched to a different prediction model. Experimental result shows that our approach is valid and more efficient than related approaches.

Publisher Name

IEEE

Publishing City

Shanghai, China

Conference Location

Shanghai, China

Conference Name

IEEE/ACIS 11th International Conference on Computer and Information Science (ICIS),

more of publication

Energy-latency trade-off analysis for scientific workflow in cloud environments: The role of processor utilization ratio and mean grey wolf optimizer

Cloud computing has demonstrated its effectiveness in handling complex data that requires substantial computational power, immediate responsiveness, and ample storage capacity.

2024

Modern Subtype Classification and Outlier Detection Using the Attention Embedder to Transform Ovarian Cancer Diagnosis

2024

A Robust and Light-Weight Transfer Learning-Based Architecture for Accurate Detection of Leaf Diseases Across Multiple Plants using Less Amount of Images

Leaf diseases are a global threat to crop production and food preservation. Detecting these diseases is crucial for effective management. We introduce LeafDoc-Net, a robust, lightweight transfer-…

2024