Semantic Role Labelling

Article Info

Contributed by
1 author

Last updated on
2020-01-10 08:42:21

Article Versions

3 2020-01-10 08:42:21
1924,1923 3,1924

By arvindpdmn

Completing article.
2 2020-01-09 05:04:34
1923,1834 2,1923

By arvindpdmn

Completed Summary. Work in progress.
1 2019-12-29 04:55:32
1,1834

By arvindpdmn

First version with some questions. Content pending.

Chat Room

Submitting ...

You are editing an existing chat message.

For the verb 'loaded', semantic roles of other words and phrases in the sentence are identified. Source: Lascarides 2019, slide 10.

In linguistics, predicate refers to the main verb in the sentence. Predicate takes arguments. The role of Semantic Role Labelling (SRL) is to determine how these arguments are semantically related to the predicate.

Consider the sentence "Mary loaded the truck with hay at the depot on Friday". 'Loaded' is the predicate. Mary, truck and hay have respective semantic roles of loader, bearer and cargo. We can identify additional roles of location (depot) and time (Friday). The job of SRL is to identify these roles so that downstream NLP tasks can "understand" the sentence.

SRL is also known by other names such as thematic role labelling, case role assignment, or shallow semantic parsing.

Discussion

Why do we need semantic role labelling when there's already parsing?
A TreeBanked sentence also PropBanked with semantic role labels. Source: Palmer 2013, slide 6.
Often an idea can be expressed in multiple ways. Consider these sentences that all mean the same thing: "Yesterday, Kristina hit Scott with a baseball"; "Scott was hit by Kristina yesterday with a baseball"; "With a baseball, Kristina hit Scott yesterday"; "Kristina hit Scott with a baseball yesterday".
Either constituent or dependency parsing will analyze these sentence syntactically. But syntactic relations don't necessarily help in determining semantic roles. One way to understand SRL is via an analogy. In image captioning, we extract main objects in the picture, how they are related and the background scene. This is precisely what SRL does but from unstructured input text. Such an understanding goes beyond syntax.
However, parsing is not completely useless for SRL. In a traditional SRL pipeline, a parse tree helps in identifying the predicate arguments.
But SRL performance can be impacted if the parse tree is wrong. This has motivated SRL approaches that completely ignore syntax. However, many research papers through the 2010s have shown how syntax can be effectively used to achieve state-of-the-art SRL.
What are some applications of SRL?
SRL is helpful for question answering. Source: Yih and Toutanova 2006, slide 2.
SRL is useful in any NLP application that requires semantic understanding: machine translation, information extraction, text summarization, question answering, and more. For example, predicates and heads of roles help in document summarization. For information extraction, SRL can be used to construct extraction rules.
SRL can be seen as answering "who did what to whom". Obtaining semantic information thus benefits many downstream NLP tasks such as question answering, dialogue systems, machine reading, machine translation, text-to-scene generation, and social network analysis.
Historically, early applications of SRL include Wilks (1973) for machine translation; Hendrix et al. (1973) for question answering; Nash-Webber (1975) for spoken language understanding; and Bobrow et al. (1977) for dialogue systems.
Which are the essential roles used in SRL?
Thematic roles with examples. Source: Jurafsky 2015, slide 10.
One of the oldest models is called thematic roles that dates back to Pāṇini from about 4th century BC. Roles are assigned to subjects and objects in a sentence. Roles are based on the type of event. For example, if the verb is 'breaking', roles would be breaker and broken thing for subject and object respectively. Some examples of thematic roles are agent, experiencer, result, content, instrument, and source. There's no well-defined universal set of thematic roles.
A modern alternative from 1991 is proto-roles that defines only two roles: Proto-Agent and Proto-Patient. Using heuristic features, algorithms can say if an argument is more agent-like (intentionality, volitionality, causality, etc.) or patient-like (undergoing change, affected by, etc.).
How are VerbNet, PropBank and FrameNet relevant to SRL?
Comparing PropBank and FrameNet representations. Source: Jurafsky 2015, slide 37.
Verbs can realize semantic roles of their arguments in multiple ways. This is called verb alternations or diathesis alternations. Consider "Doris gave the book to Cary" and "Doris gave Cary the book". The verb 'gave' realizes THEME (the book) and GOAL (Cary) in two different ways. VerbNet is a resource that groups verbs into semantic classes and their alternations.
PropBank contains sentences annotated with proto-roles and verb-specific semantic roles. Arguments to verbs are simply named Arg0, Arg1, etc. Typically, Arg0 is the Proto-Agent and Arg1 is the Proto-Patient. Being also verb-specific, PropBank records roles for each sense of the verb. For example, for the word sense 'agree.01', Arg0 is the Agreer, Arg1 is Proposition, and Arg2 is other entity agreeing.
An idea can be expressed with similar words such as increased (verb), rose (verb), or rise (noun). PropBank may not handle this very well. FrameNet is another lexical resources defined in terms of frames rather than verbs. For every frame, core roles and non-core roles are defined. Frames can inherit from or causally link to other frames.
What's the typical SRL processing pipeline?
SRL involves predicate identification, predicate disambiguation, argument identification, and argument classification.
Argument identification is aided by full parse trees. However, in some domains such as biomedical, full parse trees may not be available. In such cases, chunking is used instead.
When a full parse is available, pruning is an important step. Using heuristic rules, we can discard constituents that are unlikely arguments. In fact, full parsing contributes most in the pruning step. Pruning is a recursive process.
If each argument is classified independently, we ignore interactions among arguments. A better approach is to assign multiple possible labels to each argument. Then we can use global context to select the final labels. This step is called reranking.
Which are the main approaches to SRL?
Early SRL systems were rule based, with rules derived from grammar. Since the mid-1990s, statistical approaches became popular due to FrameNet and PropBank that provided training data. Classifiers could be trained from feature sets. A set of features might include the predicate, constituent phrase type, head word and its POS, predicate-constituent path, voice (active/passive), constituent position (before/after predicate), and so on.
SRL has traditionally been a supervised task but adequate annotated resources for training are scarce. Research from early 2010s focused on inducing semantic roles and frames. There's also been research on transferring an SRL model to low-resource languages.
One novel approach trains a supervised model using question-answer pairs. Given a sentence, even non-experts can accurately generate a number of diverse pairs. We therefore don't need to compile a pre-defined inventory of semantic roles or frames.
Which are the neural network approaches to SRL?
Architecture and details of LISA for SRL. Source: Strubell et al. 2018, fig. 1-2.
Neural network approaches to SRL are the state-of-the-art since the mid-2010s. We note a few of them.
Roth and Lapata (2016) used dependency path between predicate and its argument. Words and relations along the path are represented and input to an LSTM. Another input layer encodes binary features. A hidden layer combines the two inputs using RLUs. Finally, there's a classification layer.
He et al. (2017) used deep BiLSTM with highway connections and recurrent dropout. With word-predicate pairs as input, output via softmax are the predicted tags that use BIO tag notation. GloVe input embeddings were used. Another research group also used BiLSTM with highway connections but used CNN+BiLSTM to learn character embeddings for the input.
Since 2018, self-attention has been used for SRL. Strubell et al. (2018) applied it to train a model to jointly predict POS tags and predicates, do parsing, attend to syntactic parse parents, and assign semantic roles. One of the self-attention layers attends to syntactic relations. Shi and Lin used BERT for SRL without using syntactic features and still got state-of-the-art results.

Milestones

350
BC

Indian grammarian Pāṇini authors Aṣṭādhyāyī, a treatise on Sanskrit grammar. It records rules of linguistics, syntax and semantics. His work is discovered only in the 19th century by European scholars. His work identifies semantic roles under the name of kāraka.

1965

In what may be the beginning of modern thematic roles, Gruber gives the example of motional verbs (go, fly, swim, enter, cross) and states that the entity conceived of being moved is the theme. The theme is syntactically and semantically significant to the sentence and its situation. A related development of semantic roles is due to Fillmore (1968).

1991

Dowty notes that all through the 1980s new thematic roles were proposed. There's no consensus even on the common thematic roles. A large number of roles results in role fragmentation and inhibits useful generalizations. As an alternative, he proposes Proto-Agent and Proto-Patient based on verb entailments. An argument may be either or both of these in varying degrees. He then considers both fine-grained and coarse-grained verb arguments, and 'role hierarchies'. Essentially, Dowty focuses on the mapping problem, which is about how syntax maps to semantics.

Sep
1993

Beth Levin published English Verb Classes and Alternations. This work classifies over 3,000 verbs by meaning and behaviour. She makes a hypothesis that a verb's meaning influences its syntactic behaviour. She then shows how identifying verbs with similar syntactic structures can lead us to semantically coherent verb classes. For example, "John cut the bread" and "Bread cuts easily" are valid. But 'cut' can't be used in these forms: "The bread cut" or "John cut at the bread".

1997

FrameNet workflows, roles, data structures and software. Source: Baker et al. 1998, fig. 3.

FrameNet is launched as a three-year NSF-funded project. Semantic information is manually annotated on large corpora along with descriptions of semantic frames. Conceptual structures are called frames. Role names are called frame elements. For example, in the Transportation frame, Driver, Vehicle, Rider, and Cargo are possible frame elements.

2000

Kipper et al. at the University of Pennsylvania create VerbNet. This is a verb lexicon that includes syntactic and semantic information. In 2004 and 2005, other researchers extend Levin classification with more classes. In 2008, Kipper et al. use Levin-style classification on PropBank with 90% coverage, thus providing useful resource for researchers.

2000

Just as Penn Treebank has enabled syntactic parsing, the Propositional Bank or PropBank project is proposed to build a semantic lexical resource to aid research into linguistic semantics. The idea is to add a layer of predicate-argument structure to the Penn Treebank II corpus. By 2005, this corpus is complete. It uses VerbNet classes. In time, PropBank becomes the preferred resource for SRL since FrameNet is not representative of the language.

2002

Making use of FrameNet, Gildea and Jurafsky apply statistical techniques to identify semantic roles filled by constituents. Their work also studies different features and their combinations. They also explore how syntactic parsing can integrate with SRL. Other techniques explored are automatic clustering, WordNet hierarchy, and bootstrapping from unlabelled data. In the coming years, this work influences greater application of statistics and machine learning to SRL.

2004

Swier and Stevenson note that SRL approaches are typically supervised and rely on manually annotated FrameNet or PropBank. They propose an unsupervised "bootstrapping" method. They start with unambiguous role assignments based on a verb lexicon. In further iterations, they use the probability model derived from current role assignments. This may well be the first instance of unsupervised SRL.

Jun
2008

Punyakanok et al. apply full syntactic parsing to the task of SRL. They show that this impacts most during the pruning stage. Based on CoNLL-2005 Shared Task, they also show that when outputs of two different constituent parsers (Collins and Charniak) are combined, the resulting performance is much higher. They call this joint inference.

Oct
2008

An example sentence with both syntactic and semantic dependency annotations. Source: Johansson and Nugues 2008, fig. 1.

Johansson and Nugues note that state-of-the-art use of parse trees are based on constituent parsing and not much has been achieved with dependency parsing. This is due to low parsing accuracy. They use dependency-annotated Penn TreeBank from 2008 CoNLL Shared Task on joint syntactic-semantic analysis. Using only dependency parsing, they achieve state-of-the-art results.

2009

Researchers propose SemLink as a tool to map PropBank representations to VerbNet or FrameNet. PropBank provides best training data. VerbNet excels in linking semantics and syntax. FrameNet provides richest semantics. SemLink allows us to use the best of all three lexical resources. For example, VerbNet can be used to merge PropBank and FrameNet to expand training resources. By 2014, SemLink integrates OntoNotes sense groupings, WordNet and WSJ Tokens as well. Shi and Mihalcea (2005) presented an earlier work on combining FrameNet, VerbNet and WordNet.

2015

Inspired by Dowty's work on proto roles in 1991, Reisinger et al. produce a large-scale corpus-based annotation. They use PropBank as the data source and use Mechanical Turk crowdsourcing platform. They confirm that fine-grained role properties predict the mapping of semantic roles to argument position. In 2016, this work leads to Universal Decompositional Semantics, which adds semantics to the syntax of Universal Dependencies.

Nov
2017

Neural network architecture of the SLING parser. Source: Ringgaard et al. 2017, fig. 1.

Google's open sources SLING that represents the meaning of a sentence as a semantic frame graph. Unlike a traditional SRL pipeline that involves dependency parsing, SLING avoids intermediate representations and directly captures semantic annotations. It uses an encoder-decoder architecture. Simple lexical features (raw word, suffix, punctuation, etc.) are used to represent input words. Decoder computes sequence of transitions and updates the frame graph. The system is based on the frame semantics of Fillmore (1982).

Sep
2019

SpanGCN encoder: red/black lines represent parent-child/child-parent relations respectively. Source: Marcheggiani and Titov 2019, fig. 2.

While dependency parsing has become popular lately, it's really constituents that act as predicate arguments. Marcheggiani and Titov use Graph Convolutional Network (GCN) in which graph nodes represent constituents and graph edges represent parent-child relations. BiLSTM states represent start and end tokens of constituents. Their earlier work from 2017 also used GCN but to model dependency relations.

References

Article Stats

2292

Words

Authors

Edits

Chats

Likes

26K

Hits

Cite As

Devopedia. 2020. "Semantic Role Labelling." Version 3, January 10. Accessed 2023-11-12. https://devopedia.org/semantic-role-labelling

Contributed by
1 author

Last updated on
2020-01-10 08:42:21

algorithms machine learning natural language processing meaning

Semantic Role Labelling

Discussion

Milestones

References

Further Reading

Article Stats

Cite As

See Also

Semantic Role Labelling

Discussion

Milestones

References

Further Reading

Article Stats

Author-wise Stats for Article Edits

Cite As

See Also

Login