This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety.
The encoding of this portion of the MIX Language resources can be divided into the following components:
i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties;
ii) Annotation and glossing of lexical information and structures in documents; tasks include;
glossing of orthographical lexical items;
assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.);
linking of various tokens of the same lexical item within each document;
linking corresponding bilingual items (Spanish-Mixtec) within each document;
entry of analyzed, glossed and categorized lexical information into TEI-Dictionary;
link orthographical forms of lexical items with phonetic transcriptions of spoken forms;
iii) Classification of Document Types and Ontology Linking;
replace current SIL document classification model with established and expansible document system which can be used to classify any new document or literature produced in the language in the future;
as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);
Note: all of the SIL-Mexico collection documents have images associated with them and some are integral to the semantics of the documents themselves as the text references the images. Currently (as of 09/02/2014) the TAPAS system is still under construction and these images, while embedded in the document and uploaded onto the project directory are not yet visible in the viewing window to visitors. This is temporary and will be visible in the near future. The original documents (and their images) can be found at the following: http://www-01.sil.org/mexico/mixteca/mixtepec/00i-mixtecomixtepec-mix.htm