- Language: English
- 664 Pages
- Published: January 2012
This product is currently not available for purchase.
Integrating Natural Language Processing Components with XML and XSLT. Edition No. 1
- Published: April 2008
- 352 Pages
- VDM Publishing House
This book describes novel software architectures for the integration of deep and shallow natural language processing (NLP) components in language technology. The generic markup language XML and the XML transformation language XSLT are used for flexible combination of linguistic markup produced by multiple NLP components.
Shallow NLP components such as tokenizers, part-of-speech taggers, named entity recognizers and shallow parsers are combined with a deep parser, operating grammars written in the spirit of the Head-Driven Phrase Structure Grammar (HPSG) theory.
The integration paradigm enables synergy leading to more robust deep parsing with increased coverage.
It also constitutes a division of labor: the deep grammar models general, correct language use, while shallow systems are responsible for domain-specific extensions.
Applications are presented in question answering, information extraction, natural language understanding, ontologies and the Semantic Web.
The book addresses to software engineers, computational linguists and language technology engineers.
Ulrich Schäfer, Dr.-Ing., Dipl.-Inform., studied Computer Science and Computational Linguistics at Saarland University, Saarbrücken, Germany. Since 2000, he is working as Senior Software Engineer at the German Research Center for Artificial Intelligence (DFKI) GmbH, Saarbrücken.