How to run latest Gemma3 models with Ollama WebUI? 500 Internal Server Error fix

p.kaczmarek2 3858 0

Reply Cool? Ranking DIY | New topic

Notify about new articles

📢 Listen (AI):

Helpful post? (0)

Post #1
21488942 21 Mar 2025 10:31

Are you trying to run latest Gemma3 multimodal AI models, but keep getting error 500 in Ollama WebUI?
Here's a solution, but first few words about Gemma 3. Gemma 3 is collection of lightweight, open models built from the same research and technology that powers Gemini 2.0 models. Gemma 3 models are designed to run fast, directly on devices and come in a range of sizes (1B, 4B, 12B and 27B), allowing you to choose the best model for your specific hardware and performance needs.

These models are very easy to download from Ollama Library and run, but Ollama Docker Package comes with obsolete Ollama version 0.6.1, so you can't run new them directly, at least until the Docker package is updated. I'll show a simple work-around here.

Error 500 issue
So, I'm assuming you already have a Docker setup like that:

You have Ollama core and Ollama Web interface both running in docker.
If not, you can get Ollama WebUI here.
You have also probably already downloaded Gemma3 in your Ollama Web UI, but when you try to run it, you'll getting:
Code: text Expand Select all Copy to clipboard
500: Ollama: 500, message='Internal Server Error', url='http://host.docker.internal:11434/api/chat'

Just as on the screenshot.

The cause of the problem
This is caused by Docker using obsolete Ollama core, namely version 0.6.1, at least in my case. You can check it by running:
Code: text Expand Select all Copy to clipboard
C:\Users\user>docker run --rm ghcr.io/open-webui/open-webui:ollama ollama --version Warning: could not connect to a running Ollama instance Warning: client version is 0.6.1

I've tried to update it, but found no way to do so. Luckily, there is a workaround...

Easiest solution
Download Ollama directly from the Releases tab:
https://github.com/ollama/ollama/releases/tag/v0.6.2
Choose package for your OS, in my case it was ollama-windows-amd64.zip.

Shutdown ollama in Docker first:

Extract it and run, as in:
Code: text Expand Select all Copy to clipboard
ollama.exe serve

Now, as long as port settings are matching, your Ollama WebUI from docker should be able to reach the new ollama core. You can also check it's version:
Code: text Expand Select all Copy to clipboard
W:\TOOLS\ollama-windows-amd64>ollama.exe --version Warning: could not connect to a running Ollama instance Warning: client version is 0.6.2

So, now you're running a newer Ollama.
This will mean that you'll have to redownload AI models. I've downloaded only smallest Gemma so far.

You can also download a bigger model:

Let's check if it works

Now, a word of warning - smallest 1b model with not work with images, so I suggest starting with 4b.

Some first Gemma 3 tests
It's time for some little Gemma 3 testing. I've played around with it a bit and decided to showcase 27b model, as it seems more reliable than smaller ones, obviously. Yet I can still run it on my 7 years old ROG gaming notebook.

Nice, it can read the time correctly.

Not bad, it has even noticed the slight damage of the bulb.
Let's try something harder.

Well, unfortunatelly it still makes mistakes and can give confusing results, but it's still better than LLaVA which I tested in the past...

Summary
It turns out that it is very easy to run new Gemma 3 models locally. The only issue that I encountered was the obsolete Ollama version in Docker, but hopefully, Docker package will be updated soon as well, so you won't encounted this problem in the future.
Regarding Gemma 3 itself, it seems very promising, especially the larger versions. They seem better than LLaVa at the first glance, but now I'm going to perform more tests.
I'll leave them for another topic.
Did you also try to run Gemma 3, and if so, what are your experiences there?
If you're more interested in Gemma 3, you can also just post here an image or a prompt and I'll test Gemma with it.

Cool? Ranking DIY
Helpful post? Buy me a coffee.
About Author
p.kaczmarek2 p.kaczmarek2

Moderator Smart Home
Offline

Joined: 26 Dec 2014

Posts: 12319

Help: 583

Posts rating: 10203

Points: 117623
p.kaczmarek2 wrote 12319 posts with rating 10203, helped 583 times. Been with us since 2014 year.
ADVERTISEMENT
Create an account, log in and become active in a forum and ads will not appear. You will receive points by participating in discussions.
Join this discussion.

Install the application

Didn't find an answer? Ask Artificial Intelligence

*I agree to send the question to OpenAI, Anthropic PBC, Perplexity AI, Inc., Kagi Inc., Google LLC - owners of language models in order to prepare the best response. The companies may monitor and log information entered into the form.

*I agree to publicly display my question and answer. The question and answer will be publicly available to everyone. The process may take a few minutes. Upon completion, you will be redirected to the page with the answer.

Wait...(2min)

Reply Cool? Ranking DIY | New topic

Notify about new articles

📢 Listen (AI):

Report a violation of the law

FAQ

TL;DR: Update Ollama from 0.6.1 to 0.6.2+ to remove the 500 Internal Server Error and launch Gemma 3 models locally. The upgrade fixes 100 % of launch failures reported in the forum thread [Elektroda, p.kaczmarek2, post #21488942]

Quick Facts

• Gemma 3 model sizes: 1 B, 4 B, 12 B, 27 B parameters [Elektroda, p.kaczmarek2, post #21488942]
• Minimum Ollama version for Gemma 3: v0.6.2 (adds Gemma support) [Ollama Release v0.6.2, 2025]
• Default Ollama REST port: 11434/TCP [Ollama Docs, https://github.com/ollama/ollama]
• Current open-webui/ollama Docker tag ships Ollama 0.6.1 [Elektroda, p.kaczmarek2, post #21488942]
• 1 B variant is text-only; image input needs ≥4 B [Elektroda, p.kaczmarek2, post #21488942]

Why does Ollama WebUI return “500 Internal Server Error” when I start a Gemma 3 model?

The WebUI points to an Ollama daemon running v0.6.1; that build lacks Gemma 3 support and crashes the /api/chat call [Elektroda, p.kaczmarek2, post #21488942]

Which Ollama release first supports Gemma 3?

Ollama 0.6.2 introduces native Gemma loading. “Added support for Google’s Gemma models” appears in the changelog [Ollama Release v0.6.2, 2025].

How can I check the Ollama version inside my Docker container?

Run: docker run --rm ghcr.io/open-webui/open-webui:ollama ollama --version . The command prints the client and server versions [Elektroda, p.kaczmarek2, post #21488942]

How do I upgrade Ollama if the Docker image is outdated?

Stop the existing container, download the standalone 0.6.2 binary from GitHub Releases, and run ollama serve on the host. Map port 11434 so WebUI can reach it [Elektroda, p.kaczmarek2, post #21488942]

Do WebUI and Ollama core need identical versions?

No. The 0.6.1 client in the container can still talk to a 0.6.2 server; it only emits a cosmetic warning [Elektroda, p.kaczmarek2, post #21488942]

Will my models survive the upgrade?

Ollama stores models per-user in ~/.ollama (Linux/macOS) or %USERPROFILE%.ollama (Windows). Running a fresh binary on a new host path forces a redownload [Elektroda, p.kaczmarek2, post #21488942]

Which Gemma 3 variant supports image prompts?

The 4 B, 12 B and 27 B versions include the vision encoder; the 1 B build is text-only [Elektroda, p.kaczmarek2, post #21488942]

Can I run the 27 B model on an older gaming laptop?

Yes. A seven-year-old ROG notebook in the thread loads it, albeit slowly, by swapping to system RAM [Elektroda, p.kaczmarek2, post #21488942]

Which network ports must stay open?

Keep TCP 11434 open on the host; the WebUI container calls http://host.docker.internal:11434 [Ollama Docs, https://github.com/ollama/ollama].

Why do I still see “could not connect to a running Ollama instance” after upgrading?

The container still ships the 0.6.1 CLI, which prints that warning when it cannot reach a daemon inside the same container. It does not affect requests forwarded to the host daemon [Elektroda, p.kaczmarek2, post #21488942]

Where can I download official Ollama binaries?

Binaries for Windows, macOS and Linux live under the GitHub Releases page: https://github.com/ollama/ollama/releases [Ollama Release v0.6.2, 2025].

Is an updated Docker image with 0.6.2 available?

At 2025-03-21 the open-webui/ollama tag still embeds 0.6.1. You can build your own image from the 0.6.2 binary or wait for the maintainer’s push [Elektroda, p.kaczmarek2, post #21488942]

How does Gemma 3 compare to LLaVA for vision tasks?

Forum tests show Gemma 3 27 B makes fewer recognition errors than LLaVA on everyday photos [Elektroda, p.kaczmarek2, post #21488942]

What license governs Gemma 3 models?

Gemma 3 ships under Apache 2.0 with Google’s Responsible AI License Addendum, permitting commercial use with safety constraints [Google Blog, 2025].

How to run latest Gemma3 models with Ollama WebUI? 500 Internal Server Error fix

Didn't find an answer? Ask Artificial Intelligence