WDA2003 Call for Participation
         Second International Workshop on Web Document Analysis

                           August 3, 2003
                        Edinburgh, Scotland, UK
                      (co-located with ICDAR2003)

                   http://www.csc.liv.ac.uk/~wda2003
                     email: wda2003@csc.liv.ac.uk

                 ** Short Papers Due March 31, 2003 **


                         CALL FOR PARTICIPATION


Background
----------
With the ever-increasing use of the Web, a growing number of documents are
published and accessed on-line. The emerging issues pose new challenges for
Document Analysis. The need is evident for further discussion to identify
the role of Document Analysis in Web applications.

While there has been active research on Web Content Extraction using
text-based techniques, documents are in fact 2-dimensional entities, and
often include multimedia content. Hence, techniques that have been developed
for image-based documents could prove valuable in the realm of Web
documents, and new methods for the analysis of multimedia content will be
required.

Following the success of the First WDA workshop (WDA2001) in Seattle, USA,
the series continues with WDA2003 in Edinburgh, UK. The aim of the workshop
is to bring together researchers from Document Analysis and Web communities
such as Web Content Extraction, Web Publishing, Digital Libraries and
e-Commerce Security, to share experiences and discuss possible avenues for
future collaboration.


Technical Focus
---------------
This workshop is intended as a forum for discussing emerging issues in
document analysis in Web environments. Special attention will be given to
new applications and requirements created by opportunities on the Internet
(in the area of multimedia document analysis and management). We invite
contributions in areas including but not limited to the following:

- Layout analysis of Web documents and its applications to content
  extraction, multimodal access and Web mining.

- Document understanding and semantic tagging for web services.

- Digital document models: homogeneous representation of structured
  documents, hypertext and multimedia components.

- Structural feature extraction for concept learning, extraction and
  retrieval.

- Automated and semi-automated wrapping methods for information extraction.

- Knowledge integration from heterogeneous collections of documents.

- Theme extraction and document clustering/visualization.

- Access of textual information embedded in Internet images.

- Document image processing for Internet: data compression, color analysis,
  representation and coding for multiresolution or resolution-independent
  images.

- Collaborative annotation and manipulation of documents on the Web.

- Authoring, editing and presentation systems for complex multimedia
  documents.

- Enterprise applications: intranet and workflow management.

- Document analysis and reformatting for multimodal interfaces.

- Web content summarization and repurposing for mobile access.

- Cross-language multi-web-document summarization / knowledge integration.

- Web Security and Image Understanding (CAPTCHAs).

- Data collection and evaluation methods.


Workshop Format
---------------
This will be a one-day single-track event. Participants are expected to give
a short description of their work (submitted for review in the form of a
short paper) and participate in the discussions.

Sessions of short presentations will be followed by discussion sessions in
small groups. In addition, all participants will join in a panel discussion
session led by experts in the field. Discussions will be focused on specific
issues identified as key to promote collaboration between researchers in
Document Analysis and Web content technologies. At the end of the workshop
the results of the discussions will be summarised.


Publications
------------
The accepted papers will be made available before the workshop. The essence
of the discussions, as captured by individual discussion chairs, will be
published on-line, along with the papers, after the workshop. In addition,
after the workshop, expanded versions of selected papers will be considered
for inclusion in a book.


Submission Information
----------------------
We invite the submission of original, previously unpublished work. Papers
should identify current/future needs, open problems and discuss the authors'
view of the subject and overall direction. Papers describing work in
progress are also encouraged.

We also welcome, with some restrictions, submissions that are closely
related to work submitted also to ICDAR2003. Authors can use this workshop
as a forum to present work that differs materially from their ICDAR
presentations, in any of several ways:

- recent results too late for the ICDAR deadline;

- methodological issues facing the WDA community;

- proposals for community-wide data sets, experiments, competitions,
  websites etc.

Papers should be submitted via the Web (http://www.csc.liv.ac.uk/~wda2003)
in camera-ready format and should not exceed 4 printed pages. The format
adopted is that of the IEEE-CS Conference Publications and is the same as
that of ICDAR2003.

Full details of the formatting instructions, a sample document and templates
for LaTeX and MS-Word users can be found at the ICDAR2003 submissions site
(http://icdar.csc.liv.ac.uk/ICDAR03/format.html). Acceptable formats are PDF
and Postscript.

Important Dates
---------------
Paper-submission due   31 March 2003
Author Notification    14 May 2003
Camera-ready copy due  16 June 2003


Please check the workshop web site at http://www.csc.liv.ac.uk/~wda2003 for
more details and the latest update.