logo elektroda
logo elektroda
X
logo elektroda

Automatic text translation in Google Sheets (Excel online). Let's translate firmware.

p.kaczmarek2 2664 16
ADVERTISEMENT
Treść została przetłumaczona polish » english Zobacz oryginalną wersję tematu
  • Google Sheets columns with translations to various languages .
    Google Sheets offer a wide range of features, here I will focus on one of them - the GOOGLETRANSLATE function. This function allows us to automatically translate words and phrases between two languages, just like regular Google Translate, but its presence in the sheet can really save us a lot of time. In this particular case, I will demonstrate the translation of a potential home automation firmware, but I think it has many more applications and there is something for everyone.


    Basics of GOOGLETRANSLATE
    Open our Google sheet and type in A1 some example word - "Relay":

    Google Sheet with the word Przekaźnik entered in cell A1 .
    .
    In B1 type the equals sign and then start typing GOOGLE, a hint should already appear:
    GOOGLETRANSLATE function suggestion in Google Sheets .
    .
    Validate, now three arguments need to be entered:
    GOOGLETRANSLATE formula in Google Sheets .
    .
    The arguments are:
    - text to be translated (or a cell with it)
    - source language
    - target language
    Done:
    Google Sheet with GOOGLETRANSLATE formula translating the word Przekaźnik. .
    .
    Everything works:
    Google Sheet with translation function from Polish to English


    We are expanding the sheet .
    We already know how GOOGLETRANSLATE works, but it's time to start using it better. First a small change - in my firmware everything is in English, so the base language will be English. Apart from that, I'll throw out the language codes written "hard" in the cells so that they are in the first cell. So we are looking into the language codes:
    https://developers.google.com/admin-sdk/directory/v1/languages?hl=pl
    We create columns for each of them:
    Screenshot of a Google Sheet with technical translations in various languages. .
    .
    We enter the magic formula. The whole idea here is to use the character $ to enable absolute addressing for column A (source word), for the source language and for the target language row:
    Screenshot of Google Sheets showing the use of the GOOGLETRANSLATE function to translate the word Relay from English to Polish. .
    .
    Mark cell and stretch downwards:
    Google Sheets snippet with GOOGLETRANSLATE function. .
    .
    And then to the right:
    Google Sheets columns with translations to various languages .


    Related feature .
    While we're on the subject of translations, it's worth mentioning a related function, namely DetectLanguage - this returns the language code for a given word:
    Screenshot of a Google Sheet using the DETECTLANGUAGE function to identify the language of the word Achtung. .
    Thus we get:
    Google Sheets cell with the word Achtung and language code de .
    Can be useful if you do not recognise the language in question.

    Potential pitfall
    Is everything now perfectly translated? Unfortunately not, rather someone familiar with the language should review it. Google Translate isn't perfect and doesn't know the context, so strange translations still happen. Rather, what we have done should only be a base for further translations.


    Export results .
    Now it is a matter of further processing the data. A CSV file, or comma separated values, can be downloaded from Google Sheets.
    Screenshot of the download menu in Google Sheets showing the CSV option selection. .
    .
    This is how the file looks after downloading:
    Simple CSV file with technical translations into various languages
    .
    The structure of the file is really very simple. It can be loaded very easily in your own program, two steps are enough:
    - reading it line by line
    - divide the line into phrases according to commas
    Example code in C#:
    Code: C#
    Log in, to see the code
    .
    This code simply reads in such a file and extracts separate values from it. The loading is carried out line by line.

    How is the translation implemented for programs written in C/C++? .
    It's time to go one step further and think about how we can make language versions for our software. Let's look at some practical example here, how is this done in Tasmota for example?
    https://tasmota.github.io/docs/
    To start with, there can be two ways of translations:
    - a language version embedded at compile time in the firmware (only one, selected) - this requires changing the batch to change the language, but saves memory
    - you can also keep different language versions in the batch (or load them from files, etc.) - this takes more memory
    Tasmota has separate binary files for the languages:
    Tasmota installation options with a dropdown language selection menu..
    .
    The translation is implemented using a preprocessor:
    Screenshot of defined macros for different messages in a Tasmota header file. .
    .
    We have separate headers that define the subtitles, e.g:
    Code: C / C++
    Log in, to see the code
    .
    Here are some of them:
    List of language translation files for Tasmota
    .
    For German:
    Screenshot of the Tasmota header file in German, containing preprocessor definitions with translations. .
    .
    And then in the application code the simply executed #define is used, like this:
    Code: C / C++
    Log in, to see the code
    .
    It is important to note that only one of these headers is included at compile time. Then compilation is performed separately for each such header and the result is separate binaries, one file per language.


    Automatic header generation .
    Now that we know how translations are implemented by Tasmota, let's try to map the same in our code. Let's add the names of the preprocessor definitions to our sheet:
    Google Sheet with translations of words into various languages. .
    .
    Next, let's modify our code so that it automatically reads the CSV and creates translation headers:
    Code: C#
    Log in, to see the code
    .
    It's worth remembering that I start the loops here with 1 rather than 0, to skip the first name column for the preprocessor. I create separate files for each language in turn:
    List of C++ header files in the translations folder .
    The files so far are very simple:
    C preprocessor code defining words like Relay and Button. .
    .
    Here it would still be useful to make some kind of standard, instead of just NO, maybe better S_NO, etc. As in Tasmota, by the way, they have a D_ prefix there.
    So we change:
    Code: C#
    Log in, to see the code
    .
    to:
    Code: C#
    Log in, to see the code
    .
    Of course, this is just a seed and there are still a few potential things to improve with this, but for demonstration purposes this will suffice.

    Summary
    Undoubtedly Google Sheets can help with translation automation, including when translating a program to a microcontroller, although I still think some care needs to be taken when doing this and running any translation through someone who knows the language. In addition, it is worth remembering that the adventure does not end with the Sheets alone, because you can always download the CSV and process it further, even if only in simple self-written programmes, as I have also shown here..
    And what uses do you have for Google Sheets? Feel free to discuss.

    Cool? Ranking DIY
    Helpful post? Buy me a coffee.
    About Author
    p.kaczmarek2
    Moderator Smart Home
    Offline 
    p.kaczmarek2 wrote 12079 posts with rating 10096, helped 579 times. Been with us since 2014 year.
  • ADVERTISEMENT
  • #2 21166003
    Pawel_
    Level 11  
    Nonetheless, one would also have to check the quality of the translations themselves because, looking at the context, the word 'Current' should be translated as 'Current' and not 'Current'.
  • ADVERTISEMENT
  • #3 21166134
    p.kaczmarek2
    Moderator Smart Home
    Congratulations, watchful eye, I wondered if anyone would see it, but I also included a warning about it in the text in the paragraph "Potential pitfall".

    Anyway, it wouldn't be the first time, how many one sees at least on the internet these "funny pictures"/photos with incorrect translations e.g. from some foreign restaurants or something.

    It's like with AI, it's a good tool if you check the results afterwards.
    Helpful post? Buy me a coffee.
  • #4 21166218
    TechEkspert
    Editor
    An interesting mechanism, however, in this way and without correction, unfortunately, Chinese translations of the GUI and instructions are most likely to be produced....
  • #5 21166837
    krzbor
    Level 27  
    @ElektrodaBot In the picture below are translations of technical phrases from English into Polish, Czech, German and Russian respectively. Are these translations correct?
    Table with translations of technical terms into Polish, Czech, German, and Russian.
  • ADVERTISEMENT
  • #6 21166842
    krzbor
    Level 27  
    Quite a good analysis. How about using ChatGPT instead of Google translator?
  • #7 21167045
    TechEkspert
    Editor
    GPT has beaten google, I wonder if it will replace the search engine and translator in the future.
  • #8 21168545
    krzbor
    Level 27  
    TechEkspert wrote:
    GPT has beaten google, I wonder if it will replace the search engine and translator in the future.

    This is almost certain to happen. Here is an example - Google translation of the words: trunc, extension, phone into Polish: cut off, extension, phone
    Chat command:
    Translate to Polish: trunc, extension, phone
    Answer:
    trunc - stem
    extension - extension
    phone - phone
    .
    As you can see, he didn't catch that these words are from the field of telephony - that's why I added the unambiguous term "phone"
    So I asked another question:
    These words are in the description of a telephone exchange. Do they then mean the same thing?
    Answer:
    In the context of a telephone exchange, the terms have different meanings. Here are the correct translations:

    trunc - connection
    extension - extension number
    phone - telephone

    Do these translations match your expectations?


    In chat GPT you only need to provide context information and the translations are correct.
  • #9 21168621
    Urgon
    Level 38  
    AVE...

    I am currently correcting translations of articles from Dutch to Polish for EdW (sometimes I also write my own articles). Some professional machine-learning-based programme is used for the translation, I just correct factual errors and messy sentences. Sometimes you get flowers, such as "Viennese bridge" instead of "Vien bridge". Interestingly, machine translation from Dutch to English is usually quite correct, as both languages have similar grammar and vocabulary.

    Translating single words, especially if they have an unambiguous meaning, is much easier - a regular paper dictionary, for example, does this. The problem begins when whole sentences need to be translated, especially if they are not simple sentences. For example, translations from German into Polish, especially of articles from the press, are mechanically difficult. German journalists and editors are fond of constructing absurdly long, multiple-complex sentences where the main judgment is at the grey end, and by the time the reader gets to it, he or she may no longer remember the matching subject. Google Translate does a poor job of this....

    BTW, what happens when you run the lyrics of a song through Google Translate several times and translate them through different languages before going back to the original one? This:



    .
  • #10 21173109
    divadiow
    Level 34  
    That's brilliant

    Added after 4 [minutes]:

    Is this thread a preamble to making OpenBeken available in other languages?
  • #11 21173163
    p.kaczmarek2
    Moderator Smart Home
    Maybe we need to take another approach and make a single call to OpenAI GPT API with request to "detect the context and translate". It may be better than Google.
    Helpful post? Buy me a coffee.
  • ADVERTISEMENT
  • #12 21176603
    krzbor
    Level 27  
    p.kaczmarek2 wrote:
    Maybe we should take a different approach and make a single call to the OpenAI GPT API asking it to "detect context and translate". This might be better than Google.
    .
    We will try:
    @ElectrodBot
    Translate the following technical phrases (related to electricity) from English into Polish, Czech, German and Russian. Save the result as JSON.
    Relay, Button, Voltage, Power, Current, Options, Yes, No
  • #13 21176610
    krzbor
    Level 27  
    I think it worked out pretty well?
  • #14 21176618
    p.kaczmarek2
    Moderator Smart Home
    You did it a bit differently @krzbor than I assumed. I was thinking not to prompt in the prompt what the phrases are related to, but to write a prompt like (preferably in English) "Translate the phrases paying attention to their context so that they fit together". It seems to me that with the right prompt, you can make GPT into such a "smart translator" that will keep the context but without telling it explicitly that these are technical phrases (related to electricity) .

    With the available API from OpenAI, which I have already used, it would be possible to make such a plugin for Sheets which detects the context itself and translates, without telling it in advance that it is e.g. related to electricity
    Helpful post? Buy me a coffee.
  • #15 21190567
    p.kaczmarek2
    Moderator Smart Home
    p.kaczmarek2 wrote:
    with the right prompt, it is possible to make GPT into such a "smart translator" that will preserve the counter but without telling it explicitly that these are technical (current-related) phrases .
    .

    An elaboration of the idea can be found in this topic:
    Translation of concepts by AI while preserving context - ChatGPT prompt and beyond
    Helpful post? Buy me a coffee.

Topic summary

The discussion focuses on the use of the GOOGLETRANSLATE function in Google Sheets for automatic text translation, particularly in the context of translating firmware for home automation. Participants highlight the importance of verifying translation quality, as automated translations can often be inaccurate, especially with technical terms. Suggestions include using AI tools like ChatGPT for better context-aware translations. The conversation also touches on the potential for creating plugins that utilize AI to enhance translation accuracy by detecting context without explicit prompts. Various languages and technical phrases are discussed, emphasizing the challenges of translating complex sentences and the need for context in translations.
Summary generated by the language model.
ADVERTISEMENT