BS ISO 24613-1:2019
Language resource management. Lexical markup framework (LMF). Core model

Standard No.
BS ISO 24613-1:2019
Release Date
2020
Published By
British Standards Institution (BSI)  GB  /  BSI
Status
 2024-01
Replace By
BS ISO 24613-1:2024
Latest
BS ISO 24613-1:2024
 

Introduction

Standard Technology Evolution and Background

As a comprehensive revision of ISO 24613:2008, the 2019 version of the LMF standard is released in parts. The core model (Part 1) adds the LexiconInformation and GrammaticalInformation classes, refactors the original Representation class into the OrthographicRepresentation class, and introduces the cross-reference (CrossREF) metamodel, which significantly improves the support for multilingual dictionary interoperability.


Core Model Architecture Analysis

Core Classes Inherited Characteristics Key Data Categories
LexicalResource No Subclassing languageCoding/scriptCoding
LexicalEntry Allow Subclassing formType/partType
OrthographicRepresentation Allow Subclassing representationType/xml:lang

Data category selection mechanism

The standard achieves flexible modeling through DCS (Data Category Selection):

  1. Standardized allocation: mandatory use of ISO 639 language codes and ISO 15924 text codes
  2. User customization: extend special needs through User-defined data categories
  3. Typed implementation: equivalent modeling can be achieved through subclass instantiation (such as Lemma subclass) or through data category assignment

Application case: In the development of Arabic dictionaries, the same root word can be associated with multiple LexicalEntry objects, and morphological annotation is implemented through the formStructure=root data category.


Cross-model reference specifications

The CrossREF package provides three types of key constraints:

  • Reference type: internal/external
  • Relation type: 12 preset values such as synonym/antonym/variant
  • ID specification: Supports multiple identifier systems such as IRI/URI/URL

Implementation suggestions

Model simplification principle based on clause 5.5.6 of the standard:

Scenario Recommended solution Risk of data loss
Monolingual dictionary Prefer subclass inheritance Low
Multilingual database Focus on data category selection Requires verification of metadata compatibility
Machine Readable Dictionary (MRD) Combined with OrthographicRepresentation subclassing Medium

BS ISO 24613-1:2019 Referenced Document

  • ISO 15924 Information and documentation — Codes for the representation of names of scripts — Amendment 1
  • ISO 16642 Management of terminology resources — Terminological markup framework*2025-12-10 Update
  • ISO 639 Code for individual languages and language groups*2023-11-01 Update

BS ISO 24613-1:2019 history

  • 2024 BS ISO 24613-1:2024 Language resource management. Lexical markup framework (LMF) - Core model
  • 2020 BS ISO 24613-1:2019 Language resource management. Lexical markup framework (LMF). Core model
 Language resource management. Lexical markup framework (LMF). Core model

Standard and Specification




Copyright ©2026 All Rights Reserved
Update: Thu, 14 May 2026 04:49:41 +0000