Skip to content

There are many kinds of artificial intelligence large models now. According to their main functions, I simply divide them into several categories to make it easier for everyone to choose according to their needs. Here are the classifications and some recommendations that I find easy to use, easy to get started with, and focus on practicality!

1. Text Generation: All-rounders for Writing, Chatting, and Polishing

This type of model specializes in text understanding and generation. Whether it's writing articles, translating, polishing copy, or just chatting, they can handle it.

  • Good and Free Domestically:

    • DeepSeek Chat (chat.deepseek.com): A versatile player for text tasks, simple and easy to use.
    • Tencent Yuanbao (yuanbao.tencent.com): Feature-rich, no pressure for daily text processing.
    • Tongyi Qianwen (Qwen) (chat.qwen.ai): Stable and reliable, suitable for various text needs.
  • Worth a Try Abroad:

    • Grok (grok.com): Particularly suitable for tasks that require real-time search, it can seamlessly call data from the X platform, with excellent information freshness.
    • Gemini (via Google AI Studio, aistudio.google.com): Almost free and unlimited, it's good at text generation, translation, and polishing, and can even handle mixed text and image layouts, just no video generation.

2. Text-to-Image: Turn a Sentence into Artwork

This type of model can directly generate images based on your text descriptions, a great helper for brainstorming.

  • Domestic Recommendations:

    • Ji Meng (jimeng.jianying.com): In addition to generating images, it can also make videos, full of creativity, but the free quota is a bit stingy.
    • Tongyi Wanxiang (wan.video): Both images and videos are great, and the effect is good, but the number of free trials is also limited.
  • Strong Foreign Players:

    • Grok: Not only is it good at text, but image generation is also a piece of cake.
    • Gemini: More powerful, in addition to generating images independently, it can also insert pictures directly into the text, which is a super cool experience.

3. Text or Image-to-Video: From Idea to Short Film

This type of model can turn text or images into short videos of a few seconds, suitable for friends who want to get results quickly.

4. Audio and Video Understanding: Understand, Watch, and Summarize

This type of model can analyze audio or video content and output text transcripts or summaries, with full marks for practicality.

My Usage Experience and Recommendations

The above are several large models that I often use, and the experience is quite good. Domestically, DeepSeek Chat, Tencent Yuanbao, and Tongyi Qianwen are stable and reliable in text tasks, and they are free and easy to use.

The foreign Grok relies on real-time data from the X platform, which is particularly powerful for searching for information; Gemini is an all-rounder, almost free and powerful, especially the mixed text and image layout is very eye-catching. If you want to play with text-to-image or video, Ji Meng and Tongyi Wanxiang are good domestic choices, but you have to use the free quota sparingly.

If you are too lazy to choose and just want to use one, I highly recommend Gemini. In addition to not being able to generate videos, other functions are almost perfect, and the cost performance is outrageously high, and it is also great for developers to use!