Jump to content

Problem with diacritics


sernikk

Recommended Posts

It's about time to write a topic about the problem with diacritics in the cache listing.

In Polish language there are some unique letters: ą, ż, ź, ś, ć, ę, ń, ó. Sadly, they are not showing up everywhere. There is no problem with using them in the main description, but in the hint or image description, there is.

 

What is happening?

When writing a hint on the edit cache page (lets use "ążźśćęńó" for test purposes) it looks like this:

obraz.png.37de89646a5de3db5b9e77685d6c151a.png

 

But looking into the cache listing:

obraz.png.5199306089333408e942286f633639f9.png
 

Yes. All letters except for the "ó" are converted into A-Z letters. The website is completely ignoring the dashes and little tails. Some words written that way sound completely different.

 

The same problem occurs in the "Edit Image" page (after uploading any photo to the cache listing). You just can't use these letters, and that's just sad. I guess it's about bad encoding and the core of the problem is somewhere deep in the web code. The exact same bug happens to be on the Waymarking.com page, but there it's even worse, because the website is getting rid of all these letters in the whole description. I can somewhat understand that Waymarking is not a focus for HQ (sad but understandable, I hope it changes soon), but geocaching.com in 2021 should allow to use all Polish letters, when it's even accepting emoji's (in hints, yeah!).

 

It would be really nice of you to fix it, not only for me, but for all my fellow Polish friends :grin:

Greetings

 

Edited by sernikk
correcting screenshot
  • Upvote 3
Link to comment

Since hints are managed by a simple ROT-13 substitution algorithm, which can only handle 26 characters, and expects an English alphabet, it comes as no surprise that it would mangle many other languages, including Polish.   That really is 'unfixable'.  However, it should not be a problem for image captioning.  That's something that should be looked into.

 

 

 

  • Upvote 1
Link to comment
7 minutes ago, ecanderson said:

Since hints are managed by a simple ROT-13 substitution algorithm, which can only handle 26 characters, and expects an English alphabet, it comes as no surprise that it would mangle many other languages, including Polish.   That really is 'unfixable'.  However, it should not be a problem for image captioning.  That's something that should be looked into.

 

You're not exactly right ;) It handles different languages easily. Example:

I used Spanish letters and symbols and here is the result:

obraz.png.ff4c800000e82a1cb4c557c11f05e387.png

 

They are showing up with no problems. Of course the ROT-13 is not going to work, but it shouldn't in this case so it's ok.

Edited by sernikk
typos
Link to comment
2 hours ago, ecanderson said:

How can there not be at least some issues

If you type ä in a hint, it will not be "encrypted" by ROT13, but the letter is preserved and when the hint is decrypted, the hint reads like it should. That is, ROT13 treats it the same way it treats numbers in English text.

 

But if you type one of these Polish letters, apparently the original character is not preserved at all and is replaced even in the "decrypted" form.

  • Upvote 2
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...