With the proliferation of artificial intelligence large language models (LLMs), I've categorized them based on their primary functions to help you choose according to your needs. Below are the categories and some recommendations that I find easy to use, focusing on practicality!
1. Text Generation: All-Around Writers, Chatbots, and Polishers
These models specialize in text understanding and generation. Whether it's writing articles, translating, polishing copy, or just chatting, they can handle it.
Good & Free Options (China):
- DeepSeek Chat (chat.deepseek.com): A versatile player for text tasks, simple and easy to get started.
- Tencent Yuanbao (yuanbao.tencent.com): Feature-rich, handling daily text processing with ease.
- Tongyi Qianwen (Qwen) (chat.qwen.ai): Stable and reliable, suitable for various text needs.
Worth Trying (International):
- Grok (grok.com): Especially suitable for tasks requiring real-time search, seamlessly accessing data from the X platform for up-to-the-minute information.
- Gemini (via Google AI Studio, aistudio.google.com): Almost free and unlimited, excelling in text generation, translation, and polishing. It can even handle mixed text and image layouts, just not video generation.
2. Text-to-Image: Turning Sentences into Art
These models can directly generate images based on your text descriptions, making them great tools for unleashing your creativity.
Recommended (China):
- JiMeng (jimeng.jianying.com): In addition to generating images, it can also create videos, maximizing creativity, but the free quota is a bit stingy.
- Tongyi Wanxiang (wan.video): Delivers both images and videos with good results, but the number of free uses is also limited.
Strong Contenders (International):
- Grok: Not only is it proficient with text, but it also excels at image generation.
- Gemini: Offers even more powerful features, generating images independently and also inserting them directly into text, providing a super cool experience.
3. Text/Image-to-Video: From Idea to Short Film
These models can turn text or images into short videos of a few seconds, suitable for those who want quick results.
- Recommended:
- JiMeng (jimeng.jianying.com)
- Tongyi Wanxiang (wan.video)
4. Audio/Video Understanding: Listening, Watching, and Summarizing
These models can analyze audio or video content and output text transcriptions or summaries, offering maximum practicality.
My Experience and Recommendations
The above are some of the large models I often use, and my experience with them has been quite good. Domestically, DeepSeek Chat, Tencent Yuanbao, and Tongyi Qianwen are solid performers in text tasks, and they're free and easy to use.
Internationally, Grok is particularly effective for information searches thanks to real-time data from the X platform. Gemini, on the other hand, is an all-rounder, almost free and powerful, especially excelling in mixed text and image layouts. If you want to play with text-to-image or video, JiMeng and Tongyi Wanxiang are good choices in China, just use the free quota sparingly.
If you're too lazy to choose and just want to use one, I highly recommend Gemini. Except for video generation, its other features are almost perfect, offering an incredibly high cost-performance ratio that developers will love!