To extend mappings and requests of letters, numbers and symbols to the full Unicode range or to the Basic Multilingual Plane (BMP) or to a part of the Unicode range to handle languages, module generators use information of the UnicodeData.txt.
The Unicode Consortium publishes in the Unicode Character Database (UCD) the text file UnicodeData.txt.
UCD (Unicode Standard Version 15.0.0 released 13-September-2022): https://unicode.org/Public/UNIDATA/UnicodeData.txt
or look for the latest version of the UCD at https://www.unicode.org/Public/
UnicodeData.txt is the database for the module generators UnicodeXXXGenerator (UnicodeCaseGenerator, UnicodeCaseGeneratorExt, UnicodeIsCaseGenerator, UnicodeLetterGenerator, UnicodeNumberGenerator, UnicodeSpecialsGenerator, UnicodeArabicGenerator, UnicodeDecompositionGenerator, UnicodeCharactersGenerator and UnicodeBidirectionalGenerator, UnicodeEastAsianWidthGenerator, UnicodeMathClassGenerator, UnicodeEmojisGenerator). The generated auxiliary files are simplified by UnicodeNormXXX (UnicodeNormCase, UnicodeNormIsCase, UnicodeNormLetter, UnicodeNormNumber, UnicodeNormSpecials, UnicodeNormArabic, UnicodeNormDecomposition100H, UnicodeNormDecomposition, UnicodeNormDecompositionIter, UnicodeNormCharacters and UnicodeNormBidirectional, UnicodeNormEastAsianWidth, UnicodeNormMathClass, UnicodeNormEmojis.
The result are the modules UnicodeXXX_Mapping (UnicodeCase_MappingGen, UnicodeIsCase_Mapping, UnicodeLetter_Mapping, UnicodeNumber_Mapping, UnicodeSpecials_Mapping, UnicodeArabic_Mapping, UnicodeDecomposition100H_Mapping, UnicodeDecomposition_Mapping, UnicodeDecompositionIter_Mapping, UnicodeUnassigned_Mapping, UnicodeCharacters, UnicodeBidirectional_Mapping, UnicodeEastAsianWith, UnicodeMathClass_Mapping, UnicodeEmojis_Mapping). The procedures in the generated modules consist of a CASE statement. The case expression can be of type character or integer.