Want to deploy powerful open-source AI models like Qwen 2.5, Llama 3, and DeepSeek-R1 locally, but struggling to find a simple and easy-to-use method?
Don't worry! The golden combination of Ollama + Open WebUI will clear all obstacles for you.
This article provides a comprehensive guide on how to easily set up a local AI environment using Ollama + Open WebUI, giving you a dedicated and powerful AI assistant to explore the endless possibilities of AI!
Pro Tip: Due to hardware limitations, local deployments typically cannot run the largest versions of DeepSeek-R1 (e.g., 67B). But don't worry, smaller models (e.g., 1.3B or 7B) can run smoothly on most personal computers and provide excellent reasoning capabilities. More importantly, you can choose the version that best suits your needs!
Why Choose Ollama + Open WebUI?
Among many local deployment solutions, the Ollama + Open WebUI combination stands out as the preferred choice for many AI enthusiasts. What makes them so appealing?
- Ollama: A Simplified Model Engine
- Ollama is like an "AI model treasure box". With just one command, you can download, install, and run various mainstream large language models, such as Llama 3 and DeepSeek-R1!
- Open WebUI: An Elegant and Easy-to-Use Interactive Interface
- Open WebUI provides a gorgeous interface for Ollama. It offers a beautiful and intuitive web interface.
- Completely open source and free.
After deployment, simply open http://127.0.0.1:8080
in your browser to start chatting with your AI assistant:
Exclusive for Windows Users: One-Click Startup Package, Say Goodbye to Cumbersome Configurations!
Considering the difficulties that Windows users may encounter when configuring the Docker environment, we have thoughtfully prepared an integrated package that can be used after downloading and decompressing, truly achieving "out-of-the-box" functionality!
Download and Decompress the Integrated Package:
Integrated Package Download Address: https://www.123684.com/s/03Sxjv-JmvJ3
- If you haven't installed Ollama yet, please double-click the
ollama-0.1.28-setup.exe
file in the package to install it. The installation process is very simple, just click "Next" all the way.
- If you haven't installed Ollama yet, please double-click the
Start WebUI:
- Double-click the
启动webui.bat
file in the package to start Open WebUI.
- When you start it for the first time, the system will prompt you to set up an administrator account. Please follow the prompts to complete the registration.
- Double-click the
Select the Model You Want to Use
After entering Open WebUI, you will see the model selection area in the upper left corner. If there are no models in the list, don't worry, it means you haven't downloaded any models yet.
You can directly enter the model name in the input box to download it online from Ollama.com:
Model Selection Tips:
- Model Vault: Go to https://ollama.com/models to browse the rich model resources officially provided by Ollama.
- Parameter Scale: Each model has different versions (e.g., 1.3B, 7B, 67B, etc.), representing different parameter scales. The more parameters, the more powerful the model is, but it also requires more computing resources (memory and video memory).
- Do What You Can: Choose the appropriate model based on your hardware configuration. Generally, if your "memory + video memory" size is greater than the model file size, you can run the model smoothly.
- Deepseek-R1 Selection: Search for
deepseek-r1
in Ollama's model library to find it.
Taking the deployment of the deepseek-r1
model as an example:
Select Model Specification: On the https://ollama.com/library page, find the model version you want to deploy (e.g.,
deepseek-r1
).Download the Model: Paste the model name (e.g.,
deepseek-r1
) into the input box in the upper left corner of Open WebUI, and click the "Pull from ollama.com" button to start downloading.Wait for the Download to Complete: The download time depends on your network speed and model size, please be patient.
Start Your AI Journey
After the model is downloaded, you can chat with DeepSeek-R1 in Open WebUI! Explore its powerful features!
If the model supports it, you can also upload pictures, files, etc., for multimodal interaction. Let your AI assistant not only be able to talk but also "read and understand pictures"!
Advanced Exploration: Open WebUI's Hidden Treasures
Open WebUI has more features than just that! Click the menu button in the upper left corner to find more surprises:
Personalized Customization: In the "Settings" panel, you can adjust the interface theme, font size, language, etc., according to your preferences to create an exclusive AI interaction experience.
- You can also customize prompts to make your AI assistant understand you better!
Multi-User Management: In the "Admin" panel, you can set user registration methods, permissions, etc., to facilitate multiple people sharing your local AI resources.
Adjust Detailed Parameters: Click in the upper right corner to set advanced parameters
Multi-Model Comparison: Who is Better?
Open WebUI also supports a multi-model comparison function, allowing you to easily compare the output results of different models and find the one that best suits your needs!
GPU Acceleration: Squeeze the Performance of Your Graphics Card! (Optional)
If you have an NVIDIA graphics card and have installed the CUDA environment, then congratulations, you can use Ollama to accelerate model inference through simple operations, greatly improving the response speed of the AI assistant!
- Double-click the
GPU-cuda支持.bat
file in the package to install CUDA dependencies.
Ollama + Open WebUI, this golden combination, opens a door to the local AI world for you. Now, you can get rid of cloud constraints, create your own AI think tank, and explore the infinite possibilities of AI!