logo elektroda
logo elektroda
X
logo elektroda

Vision-based AI models for translating catalogue notes - we test Nano Banana, ChatGPT-Image and othe

p.kaczmarek2  19 567 Cool? (+3)
📢 Listen (AI):
Comparison of two screenshots translating L1050 datasheet using different AI models
Is a catalogue note in a foreign language a problem in 2026? Today I will test whether artificial intelligence can replace a translator and translate subtitles from electronic parts specifications into English. Importantly, the whole experiment will be based on screenshots (bitmaps) rather than PDF files, so AI will have no way to make its job easier. Will this form of specification translation be of any use? Let's find out!

The testing methodology will be very simple - I take a screenshot of the catalogue note and try to translate it into English by sending it as an attachment with a prompt to the AI. I will use the free LMArena website to run the AI models:
https://lmarena.ai/
After testing, I will try to subjectively evaluate and group the models according to their results.

Note - I have placed the images as generated by the AI. If something is cropped, it means that such a bitmap was created by the model.

Test 1 - constant-current LED controller
Inputs:
Chinese L1050 datasheet page with description, features, and pin layout
translate to english



seedream-4-high-res-fal
L1050 IC on blue datasheet background with technical description and pin annotation
Such a grand hallucination rather rules out this model.

reve-v1.1-fast
Screenshot of a Chinese LED driver datasheet for model L1050 with feature list and pinout diagram
No translations.

reve-v1.1
L1050 datasheet page with tables, description, and IC pinout diagram
Major unnecessary reworking of document, residual translations.

chatgpt-image-latest (20251216)
Screenshot of LED driver L1050 datasheet with features and pin diagram
Slightly better, occasional typos. Almost usable.

gpt-image-1.5
L1050 LED driver datasheet with text errors and pinout diagram
Slightly better, occasional typos. Almost usable.

flux-1-context-pro
Screenshot of a Chinese datasheet for L1050 IC in SOP-16 package
No translations.

flux-2-flex
Screenshot of L1050 datasheet with description, features list, and 16-pin package diagram.
Virtually no translations, except for the title above the document.

flux-2-flex-20251231
Screenshot of a datasheet for the L1050 LED driver chip
Translation failure - random letters and stamps, mostly no translations.

qwen-image-edit
Mosaic of small pastel-colored squares in turquoise, pink, yellow, and blue tones
Screenshot of the L1050 datasheet with Chinese text and a pin configuration diagram.
This model hallucinates a strange background and is able to spoil the document. Useless.


flux-2-max
Screenshot of L1050 LED driver datasheet with English and Chinese text sections
Vestigial translations, most are a meaningless string of letters. The headline translated.

flux-2-pro
Screenshot of L1050 datasheet showing description, features, ordering, and marking info
Nonsensical strings of letters, useless result.

flux-2-pro-20251231
Screenshot of L1050 datasheet with garbled English and Chinese text and SOP-16 pin layout
Screenshot of L1050 datasheet with mistranslated English text on a blue background.
Nonsense strings of letters, useless result.

gemini-2.5-flash-image-preview (nano-banana)
Screenshot of L1050 LED driver datasheet with description, features, and SOP-16 package diagram
Surprisingly the elder Banana did not want to translate anything.

gemini-3-pro-image-preview (nano-banana-pro)
L1050 LED IC datasheet with description, features, and SOP-16 pinout diagram
Best result to date. Text almost all correct, occasional errors and typos, only in paragraphs are some words nonsense.

Trial 2 - display controller
FD650 controller diagram with LED and keyboard interface specifications and pin layout
translate to english
gpt-image-1-mini
Screenshot of FD650 datasheet with Chinese text and a pinout diagram labeled FD-950
Model reworked image, renamed layout, not translated.

gemini-2.5-flash-image-preview (nano-banana)
Diagram and description of the FD650 chip functions in Chinese.
Elder Banana could not cope with the translation.

flux-2-pro
Datasheet page for FD650 IC showing features list and 16-pin diagram.
The model has attempted a translation but the result is unreadable, virtually only the title is helpful - LED Driver/Keyboard Scan.

flux-2-pro-20251231
Screenshot of FD650 datasheet with garbled English text and pinout diagram
Flux 2's primary keywords decrypted, but the rest are useless.


flux-1-context-pro
Screenshot of FD650 datasheet with Chinese text and pinout diagram labeled “Translate to english”
This model has superimposed the inscription translate to English on the image.



flux-2-flex-20251231
FD650 datasheet excerpt with block description and pinout diagram
Residual translations.

gpt-image-1.5
FD650 datasheet page with IC overview, features list, and pinout diagram
At first glance very good, but the introduction from the second/third sentence onwards fell apart.

reve-v1.1
FD650 datasheet excerpt with features list and pin diagram
The basics translated, but also damaged the lead-in diagram.

seedream-4-high-res-fal
Screenshot of FD650 datasheet showing functions and a pinout diagram
The title may be translated, but the model has added some strange background.

chatgpt-image-latest (20251216)
Screenshot of FD650 datasheet with device features and pinout diagram
Like the second one from OpenAI, it's not bad, only the introduction fell apart afterwards. In addition, I see a slightly damaged lead-in diagram.

gemini-3-pro-image-preview (nano-banana-pro)
FD650 datasheet page showing description, features, and pinout diagram
Nano Banana Pro has again performed very well.

Trial 3 - synchronous rectifier
This time a trial with a screenshot:
Circuit diagram and Chinese-language description of MT6706BL synchronous rectifier
translate to english

qwen-image-edit
Synchronous rectifier circuit diagram with MT6706L and MT6706BL ICs
Useless result.

chatgpt-image-latest (20251216)
Schematic with MT6706BL chip and description of flyback synchronous controller operation
The basic translation is there, but with lots of typos. Synchornous?

gpt-image-1.5
MT6706BL circuit diagram with incorrect OCR-translated English technical text
Same as the previous GPT.

seedream-4-high-res-fal
Rectifier diagram with MT6706BL IC and distorted section of translated specification
This model has redone the background again....

gpt-image-1
Screenshot of MT6760BL datasheet showing descriptive text and circuit diagrams.
Fragment of datasheet with technical text and MT3706BL block diagrams
Residual translation. In addition, with another attempt I received a strangely cropped image.

gpt-image-1-mini
Diagram of two MT6706BL circuits with bridge rectifiers and capacitors
And here what happened? A short circuit? And this is between two separate diagrams.... in addition, the model also cropped the picture.

flux-2-flex
Flyback power supply schematic with MT6706BL controller and technical description text.
Again, residual translation.

gemini-2.5-flash-image-preview (nano-banana)
Rectifier circuit diagrams using MT6706BL with Chinese text explanation.
No translation.

seedream-4.5
Application diagram for MT6706BL synchronous rectifier with technical description in English.
Application circuit diagram of MT6706BL used in synchronous rectification
It came out slightly better this time, but there are still shortcomings.

flux-1-context-pro
Block diagram with MT6706BL IC and Chinese-language functional description
No translation.

flux-2-pro
Rectifier schematics with MT6706BL chip and distorted English title and paragraph
The letters have been changed, but they don't make sense?

gemini-3-pro-image-preview (nano-banana-pro)
bb558ffb3
Another success story for Nano Banana Pro.


Final ranking of video models
I did additional tests, but did not put any more images in the topic, because the content with several of the same nonsense graphics would be unreadable. In the end, I grouped the models according to my overall feeling, although I noticed that occasionally a particular model might do better or worse - probably the generation has some randomness factor (seed - so called).
Right translations, occasional errors:
- gemini-3-pro-image-preview (nano-banana-pro)
Almost acceptable translations, but problems with some words, blurring of letters:
- chatgpt-image-latest (20251216)
- gpt-image-1.5
Sometimes it explains something, sometimes it hallucinates and creates nonsense:
- reve-v1.1
- gpt-image-1
Bare translation attempts, meaningless letter composition:
- flux-2-max
- flux-2-pro
- flux-2-pro-20251231
Something tries to translate, but hallucinates and rearranges images:
- seedream-4-high-res-fal
Can spoil the image:
- qwen-image-edit

In summary , only the latest Nano Banana Pro seems to give acceptable results in terms of translating images from the catalogue notes, although it still happens to have artefacts. Just behind it is still GPT-Image 1.5 and ChatGPT-Image (20251216), but it is no match for it. The rest of the models are useless, although some of them try to remake the image and some ignore the text completely.
There doesn't seem to be much left to do with AI in this context. It seems to me that as early as 2026 there will be much better models that can handle such translations even better, and even if not, the Nano Banana Pro is still satisfactory.
Do you see a use for artificial intelligence in the role of an image translator? Or do you know of other practical applications for the Nano Banana Pro and similar models?

About Author
p.kaczmarek2
p.kaczmarek2 wrote 13701 posts with rating 11511 , helped 623 times. Been with us since 2014 year.

Comments

fachman1964 02 Jan 2026 09:02

And so very good for a machine. It will still take some time before it reaches perfection. Nevertheless, you can read the information you need from such translations, definitely better than Chinese "bushes".... [Read more]

szeryf3 02 Jan 2026 09:53

Artificial intelligence is learning by the day and I suspect that by the end of this year there will be a visible difference in this subject. It wasn't so long ago that this was black magic, and now peasants... [Read more]

MikeC 02 Jan 2026 11:16

Mi chatgpt 5.2 still did things differently: https://obrazki.elektroda.pl/7460791500_1767348997_bigthumb.jpg And this one for English and pseudo Polish: https://obrazki.elektroda.pl/8048532500_1767349615_thumb.jpg... [Read more]

gulson 02 Jan 2026 12:06

Nano banana the best, as usual. With hundreds of pages, however, it is best to do OCR from such a document, i.e. get the Chinese, and then translate yourself with the language model, already without... [Read more]

Mateusz_konstruktor 02 Jan 2026 12:53

In my opinion, artificial intelligence will also lead to a standard in China for the use of English in electronic component documentation. By a circuitous route, but nevertheless this is the aspect I see... [Read more]

gulson 02 Jan 2026 13:18

The idea of a global language has been around for quite a long time, so far nothing has changed. The indicated documentation can be released in English in parallel, now they got a tool where it is done... [Read more]

p.kaczmarek2 02 Jan 2026 14:13

@fachman1964 my sense is that with them there is often vestigial and untranslated documentation, even when they make it available. In the SDK for Beken and other IoT chips it is similar. @MikeC I think... [Read more]

Mateusz_konstruktor 02 Jan 2026 14:17

@gulson It is not a global language, but the equivalent of a technical drawing, i.e. a language that is universal in its assumptions and understood by everyone regardless of the mother tongue used. English... [Read more]

p.kaczmarek2 02 Jan 2026 14:23

What I am wondering is what is the actual cost of such a reworking of one image by Nano Banana Pro. Say, this situation from my presentation. Has anyone seen such information somewhere? A quick web search... [Read more]

Mateusz_konstruktor 02 Jan 2026 14:58

@pkaczmarek2 Any rates, even those actually paid, are not authoritative. Here, the decisive factor is something other than the actual cost. The end-user price is the result of an activity that has as... [Read more]

PPK 02 Jan 2026 16:45

Something seems to me that in the case of Asian 'bushes', AI rather searches on the manufacturer/other sites for a ready-made English translation...Mainly the complete ones.... [Read more]

MikeC 02 Jan 2026 17:11

It translates from these bushes without any problem ... [Read more]

p.kaczmarek2 05 Jan 2026 23:33

Test with Polish. translate all to polish https://obrazki.elektroda.pl/7213900200_1767652236_thumb.jpg gpt-image-1.5 https://obrazki.elektroda.pl/1381551100_1767652255_bigthumb.jpg... [Read more]

Mateusz_konstruktor 06 Jan 2026 00:26

Why do images two and three have the left and right sections cut off? [Read more]

p.kaczmarek2 06 Jan 2026 01:04

All in all, I think we've talked about this before, but this is how the AI generates. Below is a short video of what it looks like to me: gpt-image-1.5 cuts off images for me always, no matter if... [Read more]

Mateusz_konstruktor 06 Jan 2026 02:54

I encounter such a problem in cases of incompatible settings or incompatible web browsers themselves. Parts of web pages sometimes get cut off as a result, and this happens especially with unusual settings. Can... [Read more]

p.kaczmarek2 06 Jan 2026 10:51

The website address is the same all the time, as in the first paragraph: Somewhere in the fifteen years I've been doing the frontend and backend, and I haven't encountered the browser truncating... [Read more]

Mateusz_konstruktor 06 Jan 2026 11:47

This could be linked to the frame size of the item depending on the screen size and then automatically adjusting the dynamically generated image. Sometimes a slightly unusual web browser or some setting... [Read more]

willyvmm 06 Jan 2026 12:35

If the attached images are exactly what AI got, then I'm not surprised. Shit in => Shit out. [Read more]

%}