Philip, Gill
(2005)
Identifying Multi-Word Units in Context.
[Preprint]
Full text disponibile come:
Abstract
Abstract: Far from being linguistic anomalies, multi-word expressions abound in natural language, yet their identification is surprisingly problematic. The same combination of words can occur as a compositional, fully lexical string or as a delexicalised multi-word unit (MWU). How can these different manifestations of a series of words be distinguished one from the other? To exacerbate the problem, the creativity of language users results in the appearance of non-canonical forms of MWUs. How can these innovative uses be retrieved so that they can be incorporated into a comprehensive analysis of the MWU under study? This paper sets forth procedures for retrieving non-canonical variants from large general reference corpora, and addresses the disambiguation of compositional and non-compositional multi-word strings from a collocational standpoint.
Abstract
Abstract: Far from being linguistic anomalies, multi-word expressions abound in natural language, yet their identification is surprisingly problematic. The same combination of words can occur as a compositional, fully lexical string or as a delexicalised multi-word unit (MWU). How can these different manifestations of a series of words be distinguished one from the other? To exacerbate the problem, the creativity of language users results in the appearance of non-canonical forms of MWUs. How can these innovative uses be retrieved so that they can be incorporated into a comprehensive analysis of the MWU under study? This paper sets forth procedures for retrieving non-canonical variants from large general reference corpora, and addresses the disambiguation of compositional and non-compositional multi-word strings from a collocational standpoint.
Tipologia del documento
Preprint
Autori
Parole chiave
canonical, variation; non-compositional; salience
Settori scientifico-disciplinari
DOI
Data di deposito
12 Set 2005
Ultima modifica
16 Mag 2011 11:42
URI
Altri metadati
Tipologia del documento
Preprint
Autori
Parole chiave
canonical, variation; non-compositional; salience
Settori scientifico-disciplinari
DOI
Data di deposito
12 Set 2005
Ultima modifica
16 Mag 2011 11:42
URI
Statistica sui download
Statistica sui download
Gestione del documento: