Temporal Expressions in Polish Corpus KPWr

This article presents the result of the recent research in the interpretation of Polish expressions that refer to time. These expressions are the source of information when something happens, how often something occurs or how long something lasts. Temporal information, which can be extracted from text automatically, plays significant role in many information extraction systems, such as question answering, discourse analysis, event recognition and many more. We prepared PLIMEX — a broad description of Polish temporal expressions with annotation guidelines, based on the state-of-the-art solutions for English, mainly TimeML specification. We also adapted the solution to capture the local semantics of temporal expressions, called LTIMEX. Temporal description also supports further event identification and extends event description model, focusing at anchoring events in time, ordering events and reasoning about the persistence of events. We prepared the specification, which is designed to address these issues and we annotated all documents in Polish Corpus of Wroclaw University of Technology (KPWr) using our annotation guidelines.
Research areas:
Year:
2015
Type of Publication:
Article
Keywords:
temporal expressions; machine learning; information extraction; TIMEX; TimeML; corpus annotation; corpus
Journal:
Cognitive Studies --- Etudes Cognitives
Volume:
15
Pages:
293-317
Hits: 14