=============================================
  Document Recognition and Retrieval XI:
  Call for Papers and Announcement
=============================================

Please email questions to 


This conference is part of the IS&T/SPIE's International Symposium on
Electronic Imaging 2004, 18-22 January 2004, San Jose Marriott & San Jose Convention
Center, San Jose, California USA

Conference Chairs: Elisa H. Barney Smith, Boise State Univ.; Jianying Hu, Avaya Labs
Research; James Allan, Univ. of Massachusetts/Amherst.

Program Committee: Apostolos Antonacopoulos, Univ. of Liverpool (United Kingdom); Jamie
Callan, Carnegie Mellon Univ.; Francine R. Chen,
Palo Alto Research Ctr.; Xiaoqing Ding, Tsinghua Univ. (China); David S. Doermann, Univ.
of Maryland/College Park; Hiromichi Fujisawa, Hitachi,
Ltd. (Japan); David Grossman, Illinois Institute of Technology; Alexander G. Hauptmann,
Carnegie Mellon Univ.; Matthew F. Hurst, Intelliseek,
Inc.; Paul B. Kantor, Rutgers Univ.; Tapas Kanungo, IBM Almaden Research Ctr.; Daniel P.
Lopresti, Palo Alto Research Ctr.; Thomas A. Nartker,
Univ. of Nevada/Las Vegas; Kris Popat, Palo Alto Research Ctr.; Sargur N. Srihari, Univ.
at Buffalo; Kazem Taghva, Univ. of Nevada/Las Vegas;
George R. Thoma, National Library of Medicine; Karl Tombre, LORIA-INPL (France); Marcel
Worring, Univ. van Amsterdam (Netherlands);
Berrin A. Yanikoglu, Sabanci Univ. (Turkey); Jiangying Zhou, Summus Ltd.

The fields of document recognition and retrieval have grown rapidly in recent years.
This development has been fueled by rising accuracy rates for omnifont and handprint
optical character recognition (OCR), decreasing costs for the computational power needed
to run such sophisticated algorithms, and the emergence of new application areas such as
the World Wide Web (WWW), digital libraries, and video- and camera-based OCR. The use of
OCR is spreading from high-volume, niche domains to more general tasks, including the
processing of noisy "real-world" documents, photocopies, and faxes.

Beyond OCR, document recognition includes the recovery of a document's logical structure
and format. This encompasses decomposing a document into its various fundamental
components (sentences, paragraphs, figures, tables, etc.), tagging these units, and then
determining a higher-level structure for the document as a whole. Advanced machine
learning techniques may allow to fully recover the structure of tables and equations and
thus understand their content, or the conversion of line drawings from raster to a
vector format where the resulting graphical objects are endowed with semantic meaning.
Syntactic representation of logical structure (e.g. using grammars) and syntax-directed
recognition is another important area where research contributions are solicited.

One primary reason for digitizing existing paper materials is, of course, to simplify
retrieval and organization of information. Therefore we are particularly interested in
papers which address any of the following issues: (1) retrieval in the face of corrupted
readings of the terms in a document; (2) retrieval based on sketches, images, tables,
diagrams or other non-linguistic objects that appear in the document; (3) retrieval
based on text appearing with non-standard alignment, in images or graphics; (4)
recognition and tagging of mathematical arrays and equations which serve as indicators
of subject content or methodology used in the document; (5) novel methods for retrieval
and organization of information based on text or other information in a document. Papers
addressing retrieval-specific issues are encouraged to use a standard methodology from
either statistics (such as the ROC representation) or IR (such as precision versus
recall) to assess the effectiveness of proposed techniques against the endpoint goal of
correct recognition and retrieval of the entire document, or a section thereof.

Papers are solicited in the following areas:

Recognition
* algorithms and systems for machine-printed and handwritten character and word
recognition, especially for degraded documents (e.g., faxes or old/historical documents)
* character and word segmentation techniques
* identification and analysis of tables or equations
* page segmentation, including hierarchical decomposition of documents into text
regions, colored/textured background, halftones, line-art, etc.
* logical structure analysis, linguistic representation of structure and syntax-directed
recognition of logical structure
* raster-to-vector conversion of line-art, maps, and technical drawings
* filtering and enhancement techniques for document images
* document image compression
* document degradation models
* video- and camera-based OCR
* applications of document recognition to the WWW and digital libraries
* techniques to support spoken language access to document text (audio browsing of
document databases)
* multilingual character recognition
* other topics relating to document analysis and character recognition.

Retrieval
* impact of recognition accuracy on retrieval effectiveness
* recovery and use of logical structure for retrieval
* relevance feedback techniques for document retrieval
* cross-language and multi-lingual retrieval
* categorization of text documents and imaged documents
* summarization of text documents and imaged documents
* keyword spotting in document images
* approximate string matching algorithms for OCR text
* non-textual retrieval methods
* image and multimedia search
* interfaces for retrieval
* benchmarking and evaluation issues
* other topics relating to the retrieval of documents and document images.

Note: submissions to Document Recognition and Retrieval XI should be abbreviated papers
(4-6 pages). The paper should informative and address the following questions: i) What
is the paper about? ii) What is the original contribution? iii) What is the most closely
related work by others and how does this work differ? iv) How can others make use of
this work? v) What are the main experimental/theoretical results? Full papers (10-12
pages) will be needed for the final proceedings.

While June 23 is the official due date, late submissions may be permitted depending on
the program, at the discretion of the Conference Chairs.

For more information and submission instructions, please see:
http://electronicimaging.org/call/03/conferences/index.cfm?fuseaction=EI11

Abbreviated papers (4-6 pages) Due Date: 23 June 2003
Manuscript (10-12 pages) Due Date: 27 October 2003
Final Summary (500 word abstract for program book) Due Date: 17 November 2003

Proceedings of this conference will be published and available at the meeting.  The
Abstract and Manuscript due dates must be strictly observed.

Submissions imply the intent of at least one author to register, attend the symposium,
and present the paper (either orally or in poster format).

Your Abbreviated papers must include all of the following:

     1.   The phrase "SUBMIT TO: EI11,  BARNEYSMITH, HU, ALLAN"

     2.   SUBMIT ABSTRACT TO THIS CONFERENCE ONLY:
          Document Recognition and Retrieval XI (EI11)

     3.   PAPER TITLE

     4.   AUTHOR LISTING (principal author first) -- for each author:
          First (given) name (initials not acceptable), Last (family)nor other
information in a document. P
          name, Affiliation, Mailing address, Telephone, Fax, and
          Email address.

     5.   PRESENTATION
          Indicate your preference for "Oral Presentation" or "Poster
          Presentation." Final placement is subject to chairs' discretion.

     6.   BRIEF BIOGRAPHY ( principal/presenting author)
          Approximately 50 words.

     7.  ABBREVIATED PAPER TEXT
          6-pages/2,500 words maximum

     8.   KEYWORDS
          List a maximum of five keywords.

Conditions of Acceptance:

*   Authors are expected to secure registration fees and travel and
    accommodation funding, independent of IS&T/SPIE, through their
    sponsoring organizations before submitting abstracts.

*   Only original material should be submitted.

*   Commercial papers, descriptions of papers with no
    research/development content, and papers where supporting data or a
    technical description cannot be given for proprietary reasons will not
    be accepted for presentation in this symposium.

*   Abstracts should contain enough detail to clearly convey the approach
    and the results of the research.

*   Government and company clearance to present and publish should be
    final at the time of submittal. Authors are required to warrant to
    SPIE/IS&T in advance of publication of the Proceedings that all
    necessary permissions and clearances have been obtained, and that
    submitting authors are authorized to transfer copyright of the
    paper to SPIE/IS&T.

*   Applicants will be notified of acceptance by mail no later than 14
    October 2003. Early notification of acceptance will be placed on
    the Web site the week of 29 September 2003 at
    www.electronicimaging.org

Paper Review:

To ensure a high-quality conference, all abstracts and Proceedings
manuscripts will be reviewed by the Conference Chair/Editor for
technical merit and suitability of content. Conference Chair/Editors
may require manuscript revision before approving publication, and
reserve the right to reject for presentation or publication any paper
that does not meet content or presentation expectations. SPIE and
IS&T's decision on whether to publish manuscripts is final.

Instructions for Submitting Abstracts:

All authors are STRONGLY ENCOURAGED to submit their abstracts by the
due date using the "submit an abstract" link on the Web at:
www.electronicimaging.org Submitting directly on the web ensures that
your abstract will be immediately accessible to the conference chair
for review.

If you do not have web access, you may E-MAIL each abstract separately to:
abstracts@spie.org in ASCII text (not encoded) format. IMPORTANT: to ensure
receipt and proper processing of your abstract, the Subject line must
include only the following:

    SUBJECT: EI11,  BARNEYSMITH, HU, ALLAN


Final Summary:

Accepted authors will receive instructions for submission of the
200-word Final Summary in their author kit. The Final Summaries will
be published and available at the meeting.

Oral or Poster Presentation:

Instructions for Oral and Poster presentations will be included in
your author kit.  All Oral and Poster presentations are included in
the Proceedings and require a manuscript.

Proceedings:

These conferences will result in full-manuscript Chair/Editor-reviewed
volumes published in the Proceedings. Correctly formatted,
ready-to-print manuscripts are required from all accepted
authors. Electronic submissions are highly preferred, and result in
higher quality reproduction. Submission must be made in PostScript
created with a printer driver compatible with SPIE's online Electronic
Manuscript Submission system. Manuscripts must be submitted in English
by 27 October 2003. Authors are required to transfer copyright of the
manuscript to SPIE/IS&T or to provide a suitable publication
license. Papers published are indexed in leading scientific databases
including INSPEC, Compendex Plus, Physics Abstracts, Chemical
Abstracts, International Aerospace Abstracts, and Index to Scientific
and Technical Proceedings.

Publishing Policy:

Manuscript due dates must be strictly observed. Late manuscripts may
not be published in the Proceedings, whether the conference volume
will be published before or after the meeting. The objective of this
policy is to better serve the conference participants as well as the
technical community at large, by enabling timely publication of the
Proceedings. Papers not presented at the meeting will not be published
in the conference Proceedings, except in the case of exceptional
circumstances at the discretion of SPIE, IS&T, and the Conference
Chair/Editors.

Participant Registration Fee:

Authors, coauthors, program committee members, and session chairs are
accorded a reduced symposium registration fee.  If you are a current
IS&T or SPIE Member you will enjoy an additional discount on your
symposium registration fee and on educational short courses.