Use this resource - and many more! - in your textbook!
AcademicPub holds over eight million pieces of educational content for you to mix-and-match your way.

Web Page Downloading and Classification
By: Le, D.X.; Moon, C.W.; Tran, L.Q.; Thoma, G.R.;
2001 / IEEE / 0-7695-1004-3
Description
This item was taken from the IEEE Conference ' Web Page Downloading and Classification ' Describes the processes of downloading and classifying Web-based articles in online medical journals as a preliminary step to extracting bibliographic data to populate MEDLINE/sup (R)/, the widely-used database of the National Library of Medicine (NLM). The processes are combined to develop an automated system named WPDC (""Web Page Downloading and Classification""). The system downloads the Web pages using Microsoft's Windows Internet API tool WinInet, and a combination of several artificial intelligence (AI) techniques, including the breadth-first search algorithm and the constraint satisfaction method. The breadth-first search algorithm and the constraint satisfaction method are then used to traverse the Web page's links, identify these pages as abstract, full text, PDF or image files, and recognize and generate the successors of the downloading pages.
Related Topics
Medical Information Systems
Application Program Interfaces
Internet
Tree Searching
Constraint Handling
Hypermedia
Downloading Page Successors
World Wide Web-based Articles
Online Medical Journals
Bibliographic Data Extraction
Medline
Wpdc
Web Page Downloading And Classification
Internet Api Tool
Wininet
Artificial Intelligence Techniques
Breadth-first Search Algorithm
Constraint Satisfaction Method
Web Page Link Traversal
Web Page Identification
Abstract
Full Text
Pdf File
Image File
Web Pages
Biomedical Imaging
Data Mining
Web Server
Libraries
Internet
Mars
Image Databases
Lab-on-a-chip
Moon
Bibliographic Systems
Classification
Information Resources
Document Handling
Engineering
Microsoft Windows