Abstract
The notion of latent-variable probabilistic context-free derivation of syntactic structures is enhanced to allow heads and unrestricted discontinuities. The chosen formalization covers both constituency parsing and dependency parsing. By the new framework, one obtains a probability distribution over the space of all discontinuous parses. This lends itself to intrinsic evaluation in terms of cross-entropy. The derivational model is accompanied by an equivalent automaton model, which can be used for deterministic parsing.
Original language | English |
---|---|
Article number | 104619 |
Journal | Information and Computation |
Volume | In press |
Early online date | 10 Aug 2020 |
DOIs | |
Publication status | E-pub ahead of print - 10 Aug 2020 |
Keywords
- Parsing
- Grammars
- Weighted automata