Traditional thematic role systems employ discrete categories like AGENT and PATIENT to characterize participant involvement in events. Following Dowty’s (1991) proto-role theory, this project decomposes these coarse categories into fine-grained semantic properties that capture dimensions of meaning such as volition, sentience, causation, and change of state.

The annotation framework operationalizes proto-role properties through simple binary questions about event participants. For example, rather than determining if an argument is an AGENT, annotators answer specific questions about whether the participant wanted the event to happen, was aware of the event, or caused the event. These responses are aggregated to produce scalar values representing the degree to which each property applies.

The datasets include annotations for both Penn TreeBank and Universal Dependencies corpora, with properties annotated for all predicate-argument pairs identified by PredPatt. The continuous-valued nature of the annotations enables modeling graded semantic phenomena and soft clustering of semantic roles.

Computational models trained on this data demonstrate that proto-role properties provide strong predictive signal for syntactic realization patterns, supporting theoretical claims about the syntax-semantics interface. The framework has been extended to model lexical semantic relationships and selectional preferences.

Datasets

Publications

  • Reisinger, Dee Ann, Rachel Rudinger, Francis Ferraro, Craig Harman, Kyle Rawlins, and Benjamin Van Durme. 2015. Semantic Proto-roles. Transactions of the Association of Computational Linguistics 3: 475-488.

  • White, Aaron Steven, Dee Ann Reisinger, Keisuke Sakaguchi, Tim Vieira, Sheng Zhang, Rachel Rudinger, Kyle Rawlins, and Benjamin Van Durme. 2016. Universal Decompositional Semantics on Universal Dependencies. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 1713–1723. Austin, Texas: Association for Computational Linguistics.

  • White, Aaron Steven, Kyle Rawlins, and Benjamin Van Durme. 2017. The Semantic Proto-Role Linking Model. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, 92–98. Valencia, Spain: Association for Computational Linguistics.

  • Teichert, Adam, Adam Poliak, Benjamin Van Durme, and Matthew Gormley. 2017. Semantic Proto-Role Labeling. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17), 4459-4465.

  • Rudinger, Rachel, Adam Teichert, Ryan Culkin, Sheng Zhang, and Benjamin Van Durme. 2018. Neural-Davidsonian Semantic Proto-role Labeling. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 944–955. Brussels, Belgium: Association for Computational Linguistics.