Consultor Eletrônico



Kbase 20632: I18N: The DLC/PROLANG Directory Explained.
Autor   Progress Software Corporation - Progress
Acesso   Público
Publicação   5/14/2008
Status: Verified

GOAL:

Contents of the PROLANG directory explained

GOAL:

The DLC/PROLANG Directory Explained

FACT(s) (Environment):

Progress 9.x
OpenEdge 10
All Supported Operating Systems
Progress/OpenEdge Versions

FIX:

Each subdirectory of the PROLANG directory contains a promsgs file for the language that it represents, a .df file for each code page, and an empty database that is marked with a code page appropriate for the language.
Some directories have more than one .pf file because a language is used in more than one region. Although the language might be common to these regions, other customs, such as date or number format or collation, might be different, so other settings in the .pf file must be different.

Installation creates a startup.pf in the $DLC directory with settings that you are prompted for. The .pf files in these directories (below) are example settings:

Language Directory Region File Name
-------------------- --------- ---------------------- ---------------
Arabic arb Saudi Arabia arabic.pf
Czech cze Czech Republic czech.pf
Danish dan Denmark danish.pf
Dutch dut Netherlands dutch.pf
Dutch dut Belgium dutbel.pf
Dutch dut Netherlands startup.pf
English-American ame Canada engcan.pf
English-American ame United States engus.pf
English-American ame United States startup.pf
English-British eng Australia engaus.pf
English-British eng Canada engcan.pf
English-British eng United Kingdom enguk.pf
English-British eng United Kingdom startup.pf
Finnish fin Finland finnish.pf
French fre &nbsp.; Belgium frebel.pf
French fre France french.pf
French fre Quebec freque.pf
French fre Switzerland freswit.pf
French fre France startup.pf
German ger Austria geraus.pf
German ger Germany german.pf
German ger Switzerland gerswit.pf
German ger Germany startup.pf
Greek gre Greece greek.pf
Hebrew hbr Israel hebrew.pf
Hungarian hun Hungary hungary.pf
Icelandic isl Iceland iceland.pf
Italian ita Italy italian.pf
Italian ita Switzerland itaswit.pf
Italian ita Italy startup.pf
Japanese jpn Japan &nbsp.; japanese.pf
Japanese jpn Japan eucjis/eucjis.pf
Korean kor Korea korean.pf
Korean kor Korea cp949/cp949.pf
Korean kor Korea cp1361/cp1361.pf
Latin Spanish spl South America/
Latin America spanlat.pf
Macedonian mkd Macedonia makedoni.pf
Norwegian nor Norway norwegia.pf
Persian per persian.pf
Polish pol Poland polish.pf
Portuguese brz Brazil portbrz.pf
Portuguese por Portugal portug.pf
Russian rus Russia russian.pf
Serbian srb Serbia serbian.pf
Simplified Chinese sch PRC schprc.pf
Spanish mex Mexico spanmex.pf
Spanish spa Spain &nbs.p; spanish.pf
Swedish swe Sweden swedish.pf
Thai tai Thailand tai.pf
Traditional Chinese tch Taiwan tchroc.pf
Traditional Chinese tch Taiwan cp950/cp950.pf
Turkish tur Turkey turkish.pf
Convmap.dat:

Convmap.dat is now partitioned into several .dat files. It uses the new INCLUDE syntax that includes the smaller files but still builds a comprehensive convmap.cp. The portioning makes the management of the many tables easier and simpler.

convmap.dat and the other .dat files are located in the $DLC/prolang/convmap directory. The convmap.cp file remains in $DLC.

The convmap directory also contains some new files that are used for defining word break rules. These files are named <codepage>bas.wbt. The "bas" is for "basic", meaning the default behavior. There is one table for each code page
family.

The table can be used for the code page that it is named for, or for any code page that it is convertible to.

The table v8-deflt.wbt is the table provided with Progress Versions 7.x and 8.x and is provided for compatibility. Progress recommends installing the new table that is appropriate for your database character set, and to rebuild indexes in order to improvements word indexing (it is not required).

If you change any of the *.dat tables you can make a new convmap.cp with the command:

proutil -C codepage-compiler $DLC/prolang/convmap/convmap.dat

$DLC/convmap.cp:

If you do not put the new convmap.cp in the $DLC directory, you can point to this new file instead by using the -convmap startup parameter or by setting the PROCONV environment variable. This is useful for testing a modified convmap.cp,
before you install on a deployed system.

If you want to list the code pages and collations that are available in the current convmap.cp, you can use the procedure:

$DLC/prolang/list-cp.p.

If you want to list the supported code page conversions that are available in the current convmap.cp, you can use the procedure:

$DLC/prolang/listconv.p.

The following tables are output from these procedures.

Codepage Tables available in the convmap.dat file are:

- Case Tables.

For every code page, there is a table defining case rules (upper and lower case letter mappings) named BASIC. In addition, the following character sets provide case rules for French as used in France.

(In France, the uppercase of version of a letter does not have an accent. In French Canada, the uppercase letter retains its accent.)

Codepage Case
Name Name
------------------- -------------------
1252 FrenchR> IBM850 French
IBM858 French
SO8859-1 French
ISO8859-15 French
undefined French
- Collation Tables.

The following is a list of the collations provided for each code page:

Codepage Collation
Name Name
--------- --------------------
1250 CZECH, HUNGARIAN, POLISH, ROMANIAN
1251 RUSSIAN
1252 BASIC, BASIC9, DANISH, FINNISH,
GERMAN-LIBRARY, GERMAN9, ICELANDIC,
NORWEGIAN, SPANISH9, SWEDISH
1253 GREEK
1254 TURKISH
1256 BASIC
620-2533 THAI
708 BASIC
709 BASIC
710 BASIC
711 BASIC
714 BASIC
720 BASIC
721 BASIC
786 BASIC
BIG-5 BASIC
CP1361 BASIC
CP949 BASIC
CP950 BASIC
EUCJIS BASIC
GB2312 BASIC
IBM437 BASIC
IBM850 BASIC, DANISH, FINNISH,
GERMAN-LIBRARY, GERMAN9, ICELANDIC,
NORWEGIAN, SWEDISH
IBM851 GREEK
IBM852 CZECH, HUNGARIAN, POLISH, ROMANIAN
IBM857 TURKISH
IBM858 BASIC, DANISH, FINNISH, GERMAN9,
ICELANDIC, NORWEGIAN, SWEDISH
IBM861 ICELANDIC
IBM862 HEBREW
IBM866 RUSSIAN
ISO8859-1 BASIC, CZECH, DANISH, FINNISH, GERMAN-LIBRARY,
GERMAN9, ICELANDIC, NORWEGIAN, SPANISH9, SWEDISH
ISO8859-15 BASIC, CZECH, DANISH, FINNISH,
GERMAN9, ICELANDIC, NORWEGIA.N, SWEDISH
ISO8859-2 CZECH, HUNGARIAN, POLISH, ROMANIAN, RUSSIAN
ISO8859-8 HEBREW
KOI8-R RUSSIAN
KSC5601 BASIC
MAZOVIA POLISH
ROMAN-8 BASIC, CZECH
SHIFT-JIS BASIC
UNDEFINED BASIC
UTF-8 BASIC
- Conversion Tables.

The following is a list of every code page conversion provided in convmap.cp:

NOTE: BIG-5 to CP950, KSC5601 to CP949, and KSC5601 to CP1361, are one-way conversions.

SHIFT-JIS to EUCJIS and EUCJIS to SHIFT-JIS cannot be used between CPINTERNAL and CPSTREAM. These conversions are only for client-server-database conversions. Refer to the Progress Internationalization Guide for more information.

Source Target
Name Name
---------- --------------------
1250 IBM852, ISO8859-2, MAZOVIA, UTF-8
1251 IBM866, ISO8859-5, KOI8-R, UTF-8
1252 IBM037, IBM273, IBM277, IBM278,
IBM284, IBM297, IBM437, IBM500,
IBM850, IBM858, IBM861, ISO8859-1,
UTF-8
1253 IBM851, UTF-8
1254 IBM857, UTF-8
1256 708, 709, 710, 711, 714, 720, 721,
786, UTF-8
620-2533 UTF-8.
708 1256, UTF-8
709 1256, UTF-8
710 1256, UTF-8
711 1256, UTF-8
714 1256, UTF-8
720 1256, UTF-8
721 1256, UTF-8
786 1256, UTF-8
BIG-5 CP950, UTF-8
CP949 UTF-8
EUCJIS SHIFT-JIS, UTF-8
GB2312 UTF-8
IBM037 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM273 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM277 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM278 1252, IBM850, IBM858, ISO8859-1,
&nbsp.; ISO8859-15, UTF-8
IBM284 1252, ISO8859-1, ISO8859-15, UTF-8
IBM297 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM437 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM500 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM850 1252, IBM037, IBM273, IBM277, IBM278,
IBM297, IBM437, IBM500, IBM858,

IBM861, ISO8859-1, ISO8859-15, ROMAN-8, UTF-8
IBM851 1253, UTF-8
IBM852 1250, ISO8859-2, MAZOVIA, UTF-8
IBM857 1254, UTF-8
IBM858 1252, IBM037, IBM273, IBM277, IBM278,
IBM297, IBM437, IBM500, IBM850, IBM861,
ISO8859-15, ROMAN-8, UTF-8
IBM861 1252, IBM850, IBM858, ISO8859-1,
ISO8859-15, UTF-8
IBM862 ISO8859-8, UTF-8
IBM866 1251, ISO8859-5, KOI8-R, UTF-8
ISO8859-1 1252, IBM037, IBM273, IBM277, IBM278,
IBM284, IBM297, IBM437, IBM500, IBM850,
IBM861, ISO8859-15, ROMAN-8, UTF-8
ISO8859-15 IBM037, IBM273, IBM277, IBM278, IBM284,
IBM297, IBM437, IBM500, IBM850, IBM858,
IBM861, ISO8859-1, ROMAN-8, UTF-8
ISO8859-2 1250, IBM852, MAZOVIA, UTF-8
ISO8859-5 1251, IBM866, KOI8-R, UTF-8
ISO8859-8 IBM862, UTF-8
KOI8-R 1251, IBM866, ISO8859-5, UTF-8
KSC5601 CP1361, CP949, UTF-8
MAZOVIA 1250, IBM852, ISO8859-2, UTF-8
ROMAN-8 IBM850, IBM858, ISO8859-1, ISO8859-15, UTF-8
SHIFT-JIS EUCJIS, UTF-8
UTF-8 1250, 1251, 1252, 1253, 1254, 1256,
620-2533, 708, 709, 710, 711, 714, 720,
721, 786, BIG-5, CP1361, CP949, CP950,
EUCJIS, GB2312, IBM037, IBM273, IBM277,
IBM278, IBM284, IBM297, IBM437, IBM500,
&nb.sp; IBM850, IBM851, IBM852, IBM857, IBM858,
IBM861, IBM862, IBM866, ISO8859-1,
ISO8859-15, ISO8859-2, ISO8859-5,
ISO8859-8, KOI8-R, KSC5601, MAZOVIA,
ROMAN-8, SHIFT-JIS
.

FIX:

References to Written Documentation:

Progress Solutions:

19799, "Code Pages and Possible Conversions"
19146, "How to Find What i18N Parameters are Used"
20633 "I18N Internationalization Startup Parameters Explained"