Kbase P103198: In what length are characters stored within an UTF Database ?
Autor |
  Progress Software Corporation - Progress |
Acesso |
  Público |
Publicação |
  4/13/2005 |
|
Status: Unverified
GOAL:
In what length are characters stored within an UTF Database ?
GOAL:
How much the conversion of a Database to UTF may affect his growth ?
FIX:
The UTF-8 characters are stored in the database with variable length in terms of each character's property on byte counts.
4 is the maximum byte counts for UTF-8 characters. A character only can be stored with 4 bytes when it is a 4 bytes character
Which means that:
If the character is single byte then it will be stored as a single byte.
If the character is double byte then it will be stored as a double byte.
If the character is triple byte then it will be stored as a triple byte.
If the character is multi byte then it will be stored as a multi byte.
This also means that, when converting an existing Database to UTF, the Database size will not necessarily grow bigger if all the character are stored as single byte.