Monday, November 10, 2008

OPEN SOURCE TOOLS FOR MACHINE TRANSLATION

The Third Machine Translation Marathon, which will take place January 26-31 in Prague, Czech Republic, is hosting a Open Source Convention to advance the state of the art in machine translation.

We invite developers of open source tools to present their work and submit a paper of up to 10 pages that (a) describes the underlying methodology and (b) instructions how to use the tools. We are looking for stand-alone tools and extensions of existing tools, such as the Moses open source systems. Accepted papers will be presented during the MT Marathon and published as a special issue of the Prague Bulletin of Mathematical Linguistics (http://ufal.mff.cuni.cz/pbml).

Possible topics:

  • training of machine translation models
  • machine translation decoders
  • tuning of machine translation systems
  • evaluation of machine translation
  • visualization, annotation or debugging tools
  • tools for human translators
  • interfaces for web-based services or APIs
  • extensions of existing tools
  • other tools for machine translation
Program will be selected from papers submitted by a program committee that will ask additional reviewers from the PBML's editorial board for at least two reviews per paper.

Important dates:

Deadline for paper submission: December 12, 2008
Notification of acceptance: December 19, 2008
Camera-ready paper due: January 8, 2009
Presentations: January 26-30, 2009 (at the MT Marathon in Prague)

Please send non-anonymized full-version submissions in .pdf format to Philipp Koehn . Please use the PBML style files from http://ufal.mff.cuni.cz/pbml-instructions.html (follow the "short paper" track instructions) or download the style package directly from http://ufal.mff.cuni.cz/pbml-style-standard.html.

Program Committee

Philipp Koehn (chair)
Chris Callison-Bruch
Jan Hajic

About the MT Marathon

MT Marathon is organized yearly by the EuroMatrix machine translation research project funded by the European Union under its "Cooperation programme" as a STREP project FP6-IST-5-034291-STP. In January 2009, it will be third MT Marathon organized by EuroMatrix. MT Marathon consists of several events taking place at the same place to allow for free flow of thoughts and exchange of information and experience: a "spring school" (this time more like a "winter school") with associated lab lessons, invited research talks, and a hands-on experience with Open Source MT tools. Participants will also experience evaluating Machine Translation systems (with some hands-on experience in actual subjective evaluation of MT systems taking part in the WMT 2009 competition - see more at http://www.statmt.org/wmt09/). This year, talks presenting some of the available OpenSource tools in more detail will also be planned throughout the week (see the call for papers). Please find more about the current MT Marathon and the previous ones at http://ufal.mff.cuni.cz/euromatrix/mtmarathon.

About Euromatrix

The EuroMatrix project (http://www.euromatrix.net) aims at a major push in machine translation (MT) technology applying the most advanced MT technologies systematically to all pairs of EU languages. Special attention is being paid to the languages of the new and near-term prospective member states. As part of this application development, EuroMatrix designs and investigates novel combinations of statistical techniques and linguistic knowledge sources as well as hybrid MT architectures. EuroMatrix addresses urgent European economic and social needs by concentrating on European languages and on high-quality translation to be employed for the publication of technical, social, legal and political documents. EuroMatrix aims at enriching the statistical MT approach with novel learning paradigms and experiment with new combinations of methods and resources from statistical MT,rule-based MT, shallow language processing and computational lexicography/morphology.

The main objectives of the project are:

  • Translation systems for all pairs of EU languages, with a special focus on the languages of new and near-term prospective member states
  • Efficient inclusion of linguistic knowledge into statistical machine translation
  • The development and testing of hybrid architectures for the integration of rule-based and statistical approaches
  • Organization, analysis and interpretation of a competitive annual international evaluation of machine translation with a strong focus on European economic and social needs
  • The provision of open source machine translation technology including research tools, software and data
  • A systematically compiled and constantly updated detailed survey of the state of MT technology for all EU language pairs based on the developed systematic translation between all EU languages, the comparative MT evaluations and an inventory of available and needed tools, components, lingware and data.

No comments:

Post a Comment

You can leave a comment here using your Google account, OpenID or as an anonymous user.

Popular Posts