Helmut Schmid
Ludwig Maximilian University Munich
Center for Information and Language Processing
Oettingenstr. 67
D-80538 Munich
room: C112
tel.: +49 89 2180 9715
email: LastName@cis.uni-muenchen.de
 
 
 
Research Interests
  Probabilistic and Symbolic NLP, POS Tagging, Parsing, Finite-State Tools, Computational Morphology, Statistical Machine Translation
 
Teaching
  Angewandtes Programmieren in der Computerlinguistik
  SNLP-Tutorium
 
Software
  TreeTagger a tool for automatic annotation of text corpora with part-of-speech and lemma information (POS tagger and lemmatizer)
  RFTagger a POS tagger for fine-grained POS tagsets.
  SFST a toolbox for the implementation of morphological analysers and other programs which are based on finite state transducers.
  SMOR a German finite-state morphology implemented in the SFST programming language.
  BitPar an efficient parser for Treebank grammars.
  LoPar a parser for head-lexicalized probabilistic context-free grammars.
  VPF a graphical viewer for parse trees and parse forests including parses with feature structures.
  LSC a statistical clustering software for predicate-argument tuples with a fixed number of arguments.
  PAC a statistical clustering software for predicate-argument tuples with a variable number of arguments. The selectional preferences are generalized by means of a WordNet hierarchy.
 
Publications
 

Nadir Durrani, Helmut Schmid, Alexander Fraser, Philipp Koehn, Hinrich Schütze (2015). The Operation Sequence Model - Combining N-Gram-based and Phrase-based Statistical Machine Translation. Computational Linguistics. 41(2).

Nadir Durrani, Philipp Koehn, Helmut Schmid, Alexander Fraser (2014). Investigating the Usefulness of Generalized Word Representations in SMT. In Proceedings of the 25th Annual Conference on Computational Linguistics (COLING), Dublin, Ireland.

Thomas Müller, Hinrich Schütze, Helmut Schmid (2013). Efficient Higher-Order CRFs for Morphological Tagging. In Proceedings of EMNLP. Seattle, USA.

Nadir Durrani, Alexander Fraser, Helmut Schmid, Hieu Hoang, Philipp Koehn (2013). Can Markov Models Over Minimal Translation Units Help Phrase-Based SMT? In Proceedings of the 51st Annual Conference of the Association for Computational Linguistics (ACL). Sofia, Bulgaria, August, Poster

Nadir Durrani, Helmut Schmid, Alexander Fraser, Hassan Sajjad, Richárd Farkas (2013). Munich-Edinburgh-Stuttgart Submissions of OSM Systems at WMT13. In Proceedings of the ACL 2013 Eighth Workshop on Statistical Machine Translation. Sofia, Bulgaria, August, Poster

Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Durrani, Hassan Sajjad and Richárd Farkas (2013). Munich-Edinburgh-Stuttgart Submissions at WMT13: Morphological and Syntactic Processing for SMT. In Proceedings of the ACL 2013 Eighth Workshop on Statistical Machine Translation. Sofia, Bulgaria, August, Poster

Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser, Helmut Schmid (2013). QCRI-MES Submission at WMT13: Using Transliteration Mining to Improve Statistical Machine Translation. In Proceedings of the ACL 2013 Eighth Workshop on Statistical Machine Translation. Sofia, Bulgaria, August, Poster

Nadir Durrani, Alexander Fraser, Helmut Schmid (2013). Model With Minimal Translation Units, But Decode With Phrases. In Proceedings of the 14th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT), Atlanta, Georgia, USA.

Alexander Fraser, Helmut Schmid, Richard Farkas, Renjing Wang, Hinrich Schütze (2013). Knowledge Sources for Constituent Parsing of German, a Morphologically Rich and Less-Configurational Language. In Computational Linguistics, vol. 39, no. 1.

Thomas Müller, Hinrich Schütze and Helmut Schmid (2012). A Comparative Investigation of Morphological Language Modeling for the Languages of the European Union. In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), Montreal, Canada.

Richárd Farkas, Veronika Vincze, Helmut Schmid (2012). Dependency parsing of Hungarian: baseline results and challenges. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Avignon, France.

Richárd Farkas, Helmut Schmid (2012). Forest Reranking through Subtree Ranking. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CONLL), Jeju, Korea.

Hassan Sajjad, Alexander Fraser, Helmut Schmid (2012). A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL), Jeju, Republic of Korea.

Richárd Farkas, Bernd Bohnet, Helmut Schmid (2011). Features for phrase-structure reranking from dependency parses. In Proceedings of the 12th International Conference on Parsing Technologies (IWPT), Dublin, Ireland.

Hassan Sajjad, Nadir Durrani, Helmut Schmid, Alexander Fraser (2011). Comparing Two Techniques for Learning Transliteration Models Using a Parallel Corpus. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand.

Nadir Durrani, Helmut Schmid, Alexander Fraser (2011): A Joint Sequence Translation Model with Integrated Reordering, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT), Portland, Oregon.

Hassan Sajjad, Alexander Fraser, Helmut Schmid (2011): An Algorithm for Unsupervised Transliteration Mining with an Application to Word Alignment, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT), Portland, Oregon.

Nadir Durrani, Hassan Sajjad, Alexander Fraser, Helmut Schmid (2010): Hindi-to-Urdu Machine Translation Through Transliteration. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL), pages 465-474, Uppsala, Sweden.

Fabienne Fritzinger, Max Kisselew, Ulrich Heid, Andreas Madsack, Helmut Schmid (2009): Werkzeuge zur Extraktion von signifikanten Wortpaaren als Web Service, in Wolfgang Hoeppner, editor, GSCL-Symposium Sprachtechnologie und eHumanities, Technischer Bericht Nr. 2009-01 Duisburg, Germany.

Hassan Sajjad, Helmut Schmid (2009): Tagging Urdu Text with Parts of Speech: A Tagger Comparison, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL) . Athens, Greece.

Wiebke Wagner, Helmut Schmid, Sabine Schulte im Walde (2009): Verb Sense Disambiguation using a Predicate-Argument-Clustering Model, Proceedings of the CogSci Workshop on Distributional Semantics beyond Concrete Concepts. Amsterdam, The Netherlands, July 2009.

Helmut Schmid, Florian Laws (2008): Estimation of Conditional Probabilities with Decision Trees and an Application to Fine-Grained POS Tagging, COLING 2008, Manchester, Great Britain.

Sabine Schulte im Walde, Christian Hying, Christian Scheible, Helmut Schmid: Combining EM Training and the MDL Principle for an Automatic Verb Classification Incorporating Selectional Preferences, ACL-HLT 2008, Columbus, Ohio.

Helmut Schmid, Bernd Möbius, Julia Weidenkaff (2007): Tagging Syllable Boundaries With Joint N-Gram Models, Interspeech 2007, Antwerp, Belgium.

Vera Demberg, Helmut Schmid, Gregor Möhler (2007): Phonological Constraints and Morphological Preprocessing for Grapheme-to-Phoneme Conversion, Proceedings of ACL 2007, Prague, Czech Republic.

Helmut Schmid (2006): Trace Prediction and Recovery With Unlexicalized PCFGs and Slash Features, Proceedings of COLING-ACL 2006, Sydney, Australia.

Helmut Schmid (2005): Disambiguation of Morphological Structure Using a PCFG, Proceedings of the Human Language Technology Conference and the Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), Vancouver, Canada.

Helmut Schmid (2005): A Programming Language for Finite State Transducers Proceedings of the 5th International Workshop on Finite State Methods in Natural Language Processing (FSMNLP 2005), Helsinki, Finland.

Helmut Schmid, Michaela Atterer (2004): New Statistical Methods for Phrase Break Prediction, Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland.

Helmut Schmid (2004): Efficient Parsing of Highly Ambiguous Context-Free Grammars with Bit Vectors, Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland.

Helmut Schmid, Arne Fitschen, Ulrich Heid (2004): SMOR: A German Computational Morphology Covering Derivation, Composition, and Inflection, Proceedings of the IVth International Conference on Language Resources and Evaluation (LREC 2004), p. 1263-1266, Lisbon, Portugal.

Helmut Schmid (2002): Lexicalization of Probabilistic Grammars. Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), Taipei, Taiwan.

Helmut Schmid (2002): A Generative Probability Model for Unification-Based Grammars. Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), Taipei, Taiwan.

Helmut Schmid, Mats Rooth (2001): Parse Forest Computation of Expected Governors. Proceedings of the 39th Annual Meeting of the ACL (ACL 2001), Toulouse, France.

Helmut Schmid, Sabine Schulte im Walde (2000): Robust German Noun Chunking With a Probabilistic Context-Free Grammar. Proceedings of the 18th International Conference on Computational Linguistics (COLING 2000), August 2000.

Helmut Schmid (2000) LoPar: Design and Implementation. Arbeitspapiere des Sonderforschungsbereiches 340, No. 149, IMS Stuttgart, July 2000. (25 pages)

Helmut Schmid (2000): Unsupervised Learning of Period Disambiguation for Tokenisation. Internal Report, IMS, University of Stuttgart, May 2000. (16 pages)

Helmut Schmid( 2000): YAP - Parsing and Disambiguation With Feature-Based Grammars. Ph.D. thesis, University of Stuttgart, January 2000, AIMS report 6(1). (197 pages)

Helmut Schmid (1997): Parsing by Successive Approximation. Proceedings of International Workshop on Parsing Technologies (IWPT '97). Boston, USA.

Helmut Schmid (1995): Improvements in Part-of-Speech Tagging with an Application to German. Proceedings of the ACL SIGDAT-Workshop. Dublin, Ireland.

Helmut Schmid (1994): Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of International Conference on New Methods in Language Processing, Manchester, UK.

Helmut Schmid (1994): Part-of-Speech Tagging with Neural Networks. Proceedings of the 15th International Conference on Computational Linguistics (COLING-94).