Kuaishou’s Kling AI platform generates video from textual content and nonetheless photos.
Nurphoto | Nurphoto | Getty Photographs
BEIJING — China’s video-heavy leisure world has yielded a trove of knowledge for firms — they usually’re now ramping up money-making synthetic intelligence instruments for producing advertisements and movie clips.
TikTok dad or mum ByteDance holds the primary and third spots in analysis agency Artificial Analysis‘ top-ranked text-to-video generative AI fashions, which have been launched within the final two months. Google holds the second and fourth spots, whereas Beijing-based brief video app Kuaishou’s Kling AI ranks fifth.
Regardless of some consolidation in different elements of the AI business, “competitors in AI video technology fashions is at an earlier stage, and a few Chinese language firms have emerged as early leaders on this house,” stated Wei Xiong, China web analyst at UBS Securities.
“We consider AI video technology has the potential to reshape the content material business,” she stated, “by enhancing manufacturing effectivity, reducing limitations to creation and unlocking new monetization fashions.”
With such AI instruments, customers can add a single picture or a number of ones, and direct the AI to generate a video clip primarily based on them. Different instruments enable customers to enter textual content, from which the AI will generate the video clip.
Greater than 20,000 companies from advertisers to film animators already use Kling AI for producing video, the Beijing-based firm claimed this week through the World AI Convention in Shanghai. The newest model, Kling 2.1, can mechanically add related sound results to match the AI-generated video.
It isn’t only for customers in China.
“Whether or not it is person scale or industrial income, abroad accounts for almost all,” Zeng Yushen, head of operations at Kling AI, instructed CNBC in Mandarin, translated by CNBC. She stated the corporate plans to boost its assist for the instrument in locations comparable to Japan, South Korea and Europe.
“That is one thing we have noticed, AI massive fashions are more and more globalized,” she stated. “Individuals do not appear to care which nation’s product it’s.”

Kuaishou claimed Kling AI made over 150 million yuan ($20.83 million) in revenue within the first three months of the yr, and that every day promoting spend on generative AI instruments was 30 million yuan throughout that point. The corporate has but to announce when it’ll launch second-quarter outcomes. Zeng declined to share Kling AI’s mannequin coaching prices.
Whereas the lowered manufacturing price implies a “sizeable” market, UBS’ Xiong stated, “present mannequin capabilities stay constrained by clip size, movement consistency and controllability.”
Chinese language video AI firms additionally face competitors from the U.S., past the Trump administration’s restrictions on China’s entry to superior semiconductors wanted for coaching AI fashions.
Amazon and Google have launched tools for generating video from photos or textual content. The releases come as Microsoft-backed OpenAI launched its video technology mannequin Sora to ChatGPT subscribers in December — practically a yr after it had revealed its capabilities in February 2024.
Nonetheless, Kling AI had already launched to the general public in June 2024. Customers subscribe and purchase credit to generate movies.
Vidu, a rival instrument from Beijing-based startup Shengshu, launched to world customers roughly 12 months in the past, and round March this yr stated it anticipated annual income of $20 million primarily based on person subscription charges.
“Chinese language corporations have a tendency to try to first determine a industrial ‘ache level’ …, areas the place firms pays for providers, which has been a problem for AI purposes,” stated Paul Triolo, accomplice and senior vp for China at advisory agency DGA-Albright Stonebridge Group.
He pointed to how Chinese language startup 3DStyle makes use of generative AI to design new clothes types and combine them with internet-connected, automated manufacturing.
U.S. firms have additionally been making use of AI to particular industries, Triolo stated, however Chinese language companies are sometimes in a position to combine AI extra shortly as a result of they face a really aggressive atmosphere and may recruit from a “very certified” native base of software program engineers.
‘AI as filmmaker’
Chinese language e-commerce large Alibaba has additionally stayed on high of the pattern by releasing the most recent model of its video technology AI mannequin this week known as Wan2.2. The corporate claimed that with the open-source mannequin, customers can management lighting, time of day, colour tone, digital camera angle, body measurement, composition and focal size.
Open supply permits customers to obtain a mannequin without cost, and customise, if not commercialize, merchandise with it. Alibaba claimed that since open sourcing the “Wan” mannequin collection in February, the fashions have been downloaded greater than 5.4 million occasions from the Hugging Face platform and an analogous one in China known as ModelScope.
“The age of AI in movie is over. We have entered the age of AI as filmmaker,” stated Winston Ma, adjunct professor at NYU College of Legislation. He identified that China’s 1.4 billion inhabitants has given native firms “monumental” quantities of video-watching information to work with.
“Similar to TikTok took the worldwide markets by storm with brief movies within the cell web age, Chinese language AI firms might effectively lead the Generative AI revolution in visible digital leisure,” stated Ma, writer of “The Digital Struggle: How China’s Tech Energy Shapes the Way forward for AI, Blockchain and Our on-line world.”
Avatars and gaming
Chinese language firms are additionally constructing AI instruments for extra than simply producing movies.
Prior to now week, Baidu introduced that its latest AI-powered digital human know-how — which powered gross sales of $7.65 million throughout an interactive livestreaming session of over six hours in June — can be launched for broader business use in October.
In 3D visualization, Tencent launched its Hunyuan World mannequin for creating digital panoramic photos of scenes, generated from textual content and visible prompts. The visuals use a “mesh” file format which gamer builders can then use to edit particular elements of the picture.
“Past supporting [Tencent’s] inner improvement groups, the platform demonstrates Tencent’s ambition to standardize high-fidelity recreation asset technology and develop its affect throughout China’s recreation improvement panorama,” stated Daniel Ahmad, director of analysis and insights at Niko Companions.
Niko discovered that greater than half of recreation improvement studios in China already use AI for content material technology and decreasing improvement time and prices.
However recreation improvement displays broader challenges in utilizing AI at scale for producing movies and graphics.
“Whereas curiosity in AI is excessive,” Ahmad stated, “we have already seen some backlash to video games which have poorly applied the know-how.”
