jadawin
Guest
Background: I decided I wanted to make some village names to read in order. I thought the whole numbering thing was a bit overdone, and I really didn't want to 'number with letters', so I thought, what can I do?
Punctuation! That's always worked well in the past. Use the ascii values from 33 through 63, skipping numbers, skipping anything the game complains about (like " perhaps, or ^), and it'll be fine.
So, I started. Only have 5 or 6 villages to name this way at a time right now. But....something is amiss! The standard collating sequence for 7-bit ascii just isn't working! I was wondering, then, exactly what sort of character set is being used? I went and looked up ISO 646, which is the daughter of ASCII on the international scene, thinking perhaps the DE variant would be a problem, but the only difference in the (33-47) range is that some of the values are usuable as diacritics.
So, I ask, what is the sorting order used? I would expect the following to give me a strict ordering:
!"$#%&'()*+,-./ (to begin with)
Now, it trims a leading space, so, I have to use ! as my first
But experimentally, I get:
!"#%$
And then everything I add in today, falls in after the !
Is it perhaps just IGNORING the punctuation? No, that can't be it, because the rest isn't sorted by the 2nd character, and when I leave out the leading character, it drops to the bottom, even starting with an 'a'.
Leaving out the character is my temporary solution, and sure, I can come up with another, but that doesn't answer the question: WTF character set is being used for a collation sequence? :icon_eek:
Punctuation! That's always worked well in the past. Use the ascii values from 33 through 63, skipping numbers, skipping anything the game complains about (like " perhaps, or ^), and it'll be fine.
So, I started. Only have 5 or 6 villages to name this way at a time right now. But....something is amiss! The standard collating sequence for 7-bit ascii just isn't working! I was wondering, then, exactly what sort of character set is being used? I went and looked up ISO 646, which is the daughter of ASCII on the international scene, thinking perhaps the DE variant would be a problem, but the only difference in the (33-47) range is that some of the values are usuable as diacritics.
So, I ask, what is the sorting order used? I would expect the following to give me a strict ordering:
!"$#%&'()*+,-./ (to begin with)
Now, it trims a leading space, so, I have to use ! as my first
But experimentally, I get:
!"#%$
And then everything I add in today, falls in after the !
Is it perhaps just IGNORING the punctuation? No, that can't be it, because the rest isn't sorted by the 2nd character, and when I leave out the leading character, it drops to the bottom, even starting with an 'a'.
Leaving out the character is my temporary solution, and sure, I can come up with another, but that doesn't answer the question: WTF character set is being used for a collation sequence? :icon_eek: