SOAS Research Online

A Free Database of the Latest Research by SOAS Academics and PhD Students

[skip to content]

Hill, Nathan W. and Garrett, Edward (2017) A part-of-speech (POS) lexicon of Classical Tibetan for NLP. [Datasets]

[img] Archive - Other
Download (88kB)

Abstract

This part-of-speech (POS) lexicon of Classical Tibetan was prepared in the course of the research project 'Tibetan in Digital Communication' (2012-2015) hosted at SOAS, University of London and funded by the UK's Arts and Humanities Research Council (grant code: AH/J00152X/1). The data for verbs comes from a digitized version of A Lexicon of Tibetan Verb Stems as Reported by the Grammatical Tradition (Munich: Bayerische Akademie der Wissenschaften, 2010) by Nathan W. Hill. Otherwise data comes from the manually part-of-speech tagged training data produced by the corpus and a few lexical items specifically added by hand to improve rule based tagging

Item Type: Datasets
SOAS Departments & Centres: Departments and Subunits > School of History, Religions & Philosophies > Department of Religions & Philosophies
Departments and Subunits > Department of East Asian Languages & Cultures
Legacy Departments > Faculty of Languages and Cultures > Department of the Languages and Cultures of China and Inner Asia
Legacy Departments > Faculty of Languages and Cultures > Department of Linguistics
Subjects: P Language and Literature > PI Oriental languages and literatures
P Language and Literature > PL Languages and literatures of Eastern Asia, Africa, Oceania
Date Deposited: 16 Jun 2017 08:00
URI: https://eprints.soas.ac.uk/id/eprint/24153
Funders: Arts and Humanities Research Council

Altmetric Data

There is no Altmetric data currently associated with this item.

Statistics

Download activity - last 12 monthsShow export options
Downloads since deposit
6 month trend
83Downloads
6 month trend
703Hits
Accesses by country - last 12 monthsShow export options
Accesses by referrer - last 12 monthsShow export options

Repository staff only

Edit Item Edit Item