I can’t believe nobody has done this list yet. I mean, there is one about names, one about time and many others on other topics, but not one about languages yet (except one honorable mention that comes close). So, here’s my attempt to list all the misconceptions and prejudices I’ve come across in the course of my long and illustrious career in software localisation and language technology. Enjoy – and send me your own ones!

  • darklamer@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    68
    arrow-down
    1
    ·
    1 month ago

    Geolocation is an accurate way to predict the user’s language.

    Now that’s a pet peeve of mine, a bizarre belief surprisingly often held by people, who must be oblivious to the existence of tourism.

      • darklamer@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        9
        ·
        1 month ago

        Now there you have one more pet peeve of mine, language preference settings that pretend that all language versions are equal, that shitty translations aren’t abundant. If the original text is in a language that I can read then I want the original, not some shitty translation. It shouldn’t be that hard!

    • dohpaz42@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      1 month ago

      This. When I was in Mexico on my honeymoon, Google kept redirecting me to their .mx version of Google; despite my inability to read Spanish.

      • Lucy :3@feddit.org
        link
        fedilink
        arrow-up
        16
        ·
        1 month ago

        And I always want the english version instead of the german version, despite me being german. Literally only google fucks that up. Every other site, even the small local german Uni website or the canteens meal site, respects my browsers setting. Google does not, and serves me german.

    • peoplebeproblems@midwest.social
      link
      fedilink
      English
      arrow-up
      6
      ·
      edit-2
      1 month ago

      It would be a useful way to predict it possibly, but presumably the author meant if you have support for localization, you also provide an obvious and easy means of changing the language.

      More importantly, you should be using the language an existing user has already used in the past.

      Edit: come to think of it, this is less a programmer problem, and more of a UX problem. Obviously as programmers we need to take UX into consideration, but in all my products I’ve worked on, UX is specified already by a UX designer.

      • LaggyKar@programming.dev
        link
        fedilink
        arrow-up
        7
        ·
        1 month ago

        It’s not even that, there are multiple languages spoken in the same region. Webpages should just use the language the browser tells it to use.

        • peoplebeproblems@midwest.social
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 month ago

          I had assumed the author didn’t limit his statements to web browsers. If it’s an application on a user’s box, they should be using the language the OS provides.

          In the case of less complex hardware, IoT or embedded devices with localization support, you would likely have another strategy if it doesn’t have a setup process. For something without internet or GPS, you can’t do this obviously. For something without a GUI, it’s unlikely to have localization support without direct design consideration for it’s destination.

    • schnurrito@discuss.tchncs.de
      link
      fedilink
      arrow-up
      2
      ·
      1 month ago

      yup I too remember getting YouTube ads in Hungarian when I was there as a tourist - despite not understanding Hungarian at all and watching videos only in other languages, they really ought to know that

    • Kissaki@programming.dev
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 month ago

      I hate when apps use my number formatting setting to determine display language - despite Windows having a display language as well. Even Qt does (did?) that.

  • Redkey@programming.dev
    link
    fedilink
    arrow-up
    46
    arrow-down
    2
    ·
    1 month ago

    Icons that are based on English puns and wordplay are easily understood by speakers of other languages.

    This reminded me of one of those Top Gear “drive across a foreign country in weird vehicles” specials where Jeremy Clarkson needed to borrow a cable to jump-start his car, and laboriously mimed out jumping for “jump”, and walking a dog for “lead”, to a perplexed local. Richard Hammond was cracking up but finally managed to point out what a fool Clarkson was being.

    Geolocation is an accurate way to predict the user’s language.

    And as an addendum to this, in 2025 nobody should be using Windows’ “Non-latin/-unicode character set” setting to guess the user’s preferred language. That’s a pre-WinXP kludge. I’m specifically looking at you, Intel integrated graphics software writers, but you have plenty of company, don’t worry.

    • fluckx@lemmy.world
      link
      fedilink
      arrow-up
      39
      ·
      1 month ago

      Geolocation is an accurate way to predict the user’s language.

      This makes me so angry. It really really really really really does.

      Despite setting everything to English I still get my receipts in French. And all because my IP is CG-NAT to the capital which is marked as french speaking.

      What is so hard about letting me decide. The absolute fucking arrogance thinking you as a company know better than me in which language I would like to be served.

      Eat a dick Microsoft.

      • Ashen44@lemmy.ca
        link
        fedilink
        arrow-up
        7
        ·
        1 month ago

        For real. I’ve seen the fingerprinting info, I know your website can see what language I’ve set, so display your website in that language!

  • TehPers@beehaw.org
    link
    fedilink
    English
    arrow-up
    15
    ·
    1 month ago

    Another couple missing:

    • every language uses gendered nouns/verbs/adjectives/pronouns/etc
    • no language uses gendered nouns/verbs/adjectives/pronouns/etc
    • pronouns referring to people are always gendered
    • pronouns are always singular (1) or plural (2+)

    A fun language to learn regarding these is Hawaiian, where the language uses a-class and o-class rather than masculine and feminine, and which you use is largely based on how much control you have over it.

  • locuester@lemmy.zip
    link
    fedilink
    English
    arrow-up
    12
    arrow-down
    1
    ·
    1 month ago
    • There is always only one correct way to spell anything.

    “gray" and “grey” are both correct spellings of the color between black and white.

  • 2xsaiko@discuss.tchncs.de
    link
    fedilink
    arrow-up
    9
    ·
    1 month ago

    Segmenting a text into sentences is as easy as splitting on end-of-sentence punctuation.

    Is there a language this actually isn’t true for? It seems oddly specific like a lot of the others and I don’t think I know of one that does this. Except maybe some wack ass conlangs of course.

    • Giooschi@lemmy.world
      link
      fedilink
      English
      arrow-up
      24
      ·
      edit-2
      1 month ago

      Even in english this isn’t true, for example dots can appear inside a sentence for multiple reasons (a decimal number, an abbreviation, a quotation, three dots, etc, etc), which would make you split it into more than one piece.

    • TehPers@beehaw.org
      link
      fedilink
      English
      arrow-up
      16
      ·
      edit-2
      1 month ago

      English. I can go to the store and buy a sandwich for $8.99 all in one sentence, but splitting it on periods gives you two sentences.

        • Björn Lindström@social.sdfeu.org
          link
          fedilink
          arrow-up
          2
          ·
          1 month ago

          @2xsaiko @TehPers there’s other examples too. E.g. Thai has no spaces between words but spaces between phrases/sentences. However the spaces between phrases involve style choices similar to comma in English and many other Latin script writing systems. Also, Thai may have spaces around abbreviations special characters.

          I’m quite familiar with Thai so that’s close at hand but I guess it’s the same in a lot of other writing systems based on Brahmic scripts.

  • Bysmuth@lemmy.zip
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 month ago

    “All languages (that use the Latin alphabet) have the same alphabetical sorting order.” oh this one is very unexpected to me. Does someone know an example? I’m curious

  • MyNameIsRichard@lemmy.ml
    link
    fedilink
    arrow-up
    6
    arrow-down
    6
    ·
    edit-2
    1 month ago

    Every language has words for yes and no.

    Assuming yes and no means true and false, c has numbers (1, 0) for yes and no and c++ can use those numbers for yes and no because it is a superset of c.

    Technically, it’s 0 and non-0 but I always use 1. They are integers rather than keywords.

    • hddsx@lemmy.ca
      link
      fedilink
      arrow-up
      1
      arrow-down
      8
      ·
      edit-2
      1 month ago

      Technically, 0 is false and anything else is true. !0 is 1, though, IIRC

      • Crestwave@lemmy.world
        link
        fedilink
        arrow-up
        12
        ·
        1 month ago

        As far as C goes, 1 is true and 0 is false.

        In terms of POSIX exit codes, 0 is success and 1 is error.

      • MyNameIsRichard@lemmy.ml
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        1 month ago

        That’s what I meant so I’ve edited my comment to hopefully make that a bit clearer. Having actually tested it, because in C++ I use true/false, it is 0 that equals false and anything else is true. You’ll have to forgive my lack of clarity. It’s 03:02 and I’ve had about 2 hours sleep tonight and won’t be getting any more. Time for a coffee.