A web document is similar in concept to a web page, but also satisfies the following broader (W3C) definition:
- "... Every Web document has its own URI. Note that a Web document is not the same as a file: a single Web document can be available in many different formats and languages, and a single file, for example a PHP script, may be responsible for generating a large number of Web documents with different URIs. A Web document is defined as something that has a URI and can return representations (responses in a format such as HTML or JPEG or RDF) of the identified resource in response to HTTP requests. In technical literature ... the term Information Resource is used instead of Web document.".[1]
The term "web document" has been used as a fuzzy term in many sources (see ,[2] ,[3] ,[4] ,[5] ,[6] and others), but in all of them the W3C definition given above applies. Recent research in fields like "Web Document Retrieval" and "Web Document Analysis" (see p. ex.,[7] ,[8] ,[9] ,[10] ,[11] [12]) has revived interest in clarifying the correct use of the term.
The key idea is that a single underlying resource in an HTTP system, may have several different representations, which can be exposed by mechanisms such as content negotiation.
References
- ^ "Cool URIs for the Semantic Web", W3C (2008). http://www.w3.org/TR/cooluris/#oldweb
- ^ W3C Recommended list of XML-Web documents
- ^ "Flexible Web Document Analysis for Delivery to Narrow-Bandwidth Devices", G. Penn, J. Hu, H. Luo, R. McDonald. article abstract at ICDAR'01.
- ^ "On reliable and scalable peer-to-peer Web document sharing", L. Xiao, X. Zhang, and Z. Xu. See IEEE Symp.
- ^ "Web document based graphical user interface", Arthur A. Van Hoff, Patent number: 5802530.
- ^ WDA2001, the "First International Workshop on Web Document Analysis".
- ^ "Query-sets: using implicit feedback and query patterns to organize web documents", B. Poblete and R. Baeza-Yates (2008). doi 10.1145/1367497.1367504
- ^ "Modeling anchor text and classifying queries to enhance web document retrieval", A. Fujii (2008). doi 10.1145/1367497.1367544
- ^ "Web Document Analysis: Challenges and Opportunities", Apostolos Antonacopoulos. World Scientific 2003. ISBN 981-238-582-7.
- ^ WDA2005, "Web Document Analysis 2005".
- ^ "Query type classification for web document retrieval", article abstract at ACM SIGIR Conference
- ^ "Web document clustering: a feasibility demonstration", O. Zamir and O. Etzioni. See ACM SIGIR Conference.
External links
|