Skip to content

There are many AI large language models available today. I've categorized them based on their primary functions to help you choose the right one for your needs. Here are my easy-to-use and practical recommendations for getting started!

1. Text Generation: Writing, Chatting, and Polishing Powerhouse

These models specialize in text understanding and generation. Whether it's writing articles, translating, refining copy, or just chatting, they can handle it all.

  • Good & Free (China):

    • DeepSeek Chat (chat.deepseek.com): A versatile tool for text-based tasks, simple and easy to use.
    • Tencent Yuanbao (yuanbao.tencent.com): Feature-rich and handles daily text processing with ease.
    • Tongyi Qianwen (Qwen) (chat.qwen.ai): Stable and reliable, suitable for various text-related needs.
  • Worth Trying (International):

    • Grok (grok.com): Especially suitable for tasks requiring real-time search, seamlessly integrating with X (formerly Twitter) data for up-to-date information.
    • Gemini (via Google AI Studio, aistudio.google.com): Almost free and unlimited, excels in text generation, translation, and polishing. It even handles image and text mixed layouts, but it doesn't generate videos.

2. Text-to-Image: Turn Words into Art

These models can generate images directly from your text descriptions, making them a great tool for brainstorming and creative exploration.

  • Recommended (China):

    • Ji Meng (jimeng.jianying.com): Besides generating images, it can also create videos, maximizing creativity, but the free quota is a bit stingy.
    • Tongyi Wanxiang (wan.video): Handles both image and video generation, with good results, but the number of free uses is limited.
  • Strong Contenders (International):

    • Grok: Not only is it good at text, but it also excels at image generation.
    • Gemini: More powerful, allowing not only standalone image generation but also direct insertion of images into text, providing a cool experience.

3. Text or Image-to-Video: From Idea to Short Film

These models can turn text or images into short videos of a few seconds, suitable for those who want quick results.

4. Audio and Video Understanding: Listen, Watch, and Summarize

These models can analyze audio or video content and output text transcriptions or summaries, making them highly practical.

My Experience and Recommendations

The above are some of the large language models I frequently use, and my experience with them has been quite positive. Domestically, DeepSeek Chat, Tencent Yuanbao, and Tongyi Qianwen are solid choices for text-based tasks, and they're free to use.

Internationally, Grok is particularly effective for information retrieval, thanks to its real-time data from X (formerly Twitter). Gemini, on the other hand, is an all-rounder, almost free and feature-rich, with its image and text mixed layout capability being particularly outstanding. If you want to play with text-to-image or video, Ji Meng and Tongyi Wanxiang are good domestic options, but be mindful of the limited free quota.

If you're too lazy to choose and just want to use one, I highly recommend Gemini. Except for video generation, its other functions are nearly perfect, offering an incredibly high price-performance ratio, and it's a pleasure for developers to use!