WDA2003 Call for Participation Second International Workshop on Web Document Analysis August 3, 2003 Edinburgh, Scotland, UK (co-located with ICDAR2003) http://www.csc.liv.ac.uk/~wda2003 email: wda2003@csc.liv.ac.uk ** Short Papers Due March 31, 2003 ** CALL FOR PARTICIPATION Background ---------- With the ever-increasing use of the Web, a growing number of documents are published and accessed on-line. The emerging issues pose new challenges for Document Analysis. The need is evident for further discussion to identify the role of Document Analysis in Web applications. While there has been active research on Web Content Extraction using text-based techniques, documents are in fact 2-dimensional entities, and often include multimedia content. Hence, techniques that have been developed for image-based documents could prove valuable in the realm of Web documents, and new methods for the analysis of multimedia content will be required. Following the success of the First WDA workshop (WDA2001) in Seattle, USA, the series continues with WDA2003 in Edinburgh, UK. The aim of the workshop is to bring together researchers from Document Analysis and Web communities such as Web Content Extraction, Web Publishing, Digital Libraries and e-Commerce Security, to share experiences and discuss possible avenues for future collaboration. Technical Focus --------------- This workshop is intended as a forum for discussing emerging issues in document analysis in Web environments. Special attention will be given to new applications and requirements created by opportunities on the Internet (in the area of multimedia document analysis and management). We invite contributions in areas including but not limited to the following: - Layout analysis of Web documents and its applications to content extraction, multimodal access and Web mining. - Document understanding and semantic tagging for web services. - Digital document models: homogeneous representation of structured documents, hypertext and multimedia components. - Structural feature extraction for concept learning, extraction and retrieval. - Automated and semi-automated wrapping methods for information extraction. - Knowledge integration from heterogeneous collections of documents. - Theme extraction and document clustering/visualization. - Access of textual information embedded in Internet images. - Document image processing for Internet: data compression, color analysis, representation and coding for multiresolution or resolution-independent images. - Collaborative annotation and manipulation of documents on the Web. - Authoring, editing and presentation systems for complex multimedia documents. - Enterprise applications: intranet and workflow management. - Document analysis and reformatting for multimodal interfaces. - Web content summarization and repurposing for mobile access. - Cross-language multi-web-document summarization / knowledge integration. - Web Security and Image Understanding (CAPTCHAs). - Data collection and evaluation methods. Workshop Format --------------- This will be a one-day single-track event. Participants are expected to give a short description of their work (submitted for review in the form of a short paper) and participate in the discussions. Sessions of short presentations will be followed by discussion sessions in small groups. In addition, all participants will join in a panel discussion session led by experts in the field. Discussions will be focused on specific issues identified as key to promote collaboration between researchers in Document Analysis and Web content technologies. At the end of the workshop the results of the discussions will be summarised. Publications ------------ The accepted papers will be made available before the workshop. The essence of the discussions, as captured by individual discussion chairs, will be published on-line, along with the papers, after the workshop. In addition, after the workshop, expanded versions of selected papers will be considered for inclusion in a book. Submission Information ---------------------- We invite the submission of original, previously unpublished work. Papers should identify current/future needs, open problems and discuss the authors' view of the subject and overall direction. Papers describing work in progress are also encouraged. We also welcome, with some restrictions, submissions that are closely related to work submitted also to ICDAR2003. Authors can use this workshop as a forum to present work that differs materially from their ICDAR presentations, in any of several ways: - recent results too late for the ICDAR deadline; - methodological issues facing the WDA community; - proposals for community-wide data sets, experiments, competitions, websites etc. Papers should be submitted via the Web (http://www.csc.liv.ac.uk/~wda2003) in camera-ready format and should not exceed 4 printed pages. The format adopted is that of the IEEE-CS Conference Publications and is the same as that of ICDAR2003. Full details of the formatting instructions, a sample document and templates for LaTeX and MS-Word users can be found at the ICDAR2003 submissions site (http://icdar.csc.liv.ac.uk/ICDAR03/format.html). Acceptable formats are PDF and Postscript. Important Dates --------------- Paper-submission due 31 March 2003 Author Notification 14 May 2003 Camera-ready copy due 16 June 2003 Please check the workshop web site at http://www.csc.liv.ac.uk/~wda2003 for more details and the latest update.