Fudan Professor Mandarin video: China’s AI will be fully involved in the world by 2025! US will be challenged by China. US’s AI business model is completely wrong! New rules in China, DeepSeek AI, the domestically produced AI model, has upset AI giants on their get rich quick schemes. DeepSeek AI open source details are disclosed! ChatGTP and Navidia, they are not benefiting mankind, but are trying to ripping off the world to make only a very small number of people very rich.
Chinese start-up DeepSeek has emerged as “the biggest dark horse” in the open-source large language model (LLM) arena in 2025, just days after the firm made waves in the global artificial intelligence (AI) community with its latest release.
Chinese start-up DeepSeek has emerged as “the biggest dark horse” in the open-source large language model (LLM) arena in 2025, just days after the firm made waves in the global artificial intelligence (AI) community with its latest release.
That assessment came from Jim Fan, a senior research scientist at Nvidia and lead of its AI Agents Initiative, in a New Year’s Day post on social-media platform X, following the Hangzhou-based start-up’s release last week of its namesake LLM, DeepSeek V3.
“[The new AI model] shows that resource constraints force you to reinvent yourself in spectacular ways,” Fan wrote, referring to how DeepSeek developed the product at a fraction of the capital outlay that other tech companies invest in building LLMs.
DeepSeek V3 comes with 671 billion parameters and was trained in around two months at a cost of US$5.58 million, using significantly fewer computing resources than models developed by bigger tech firms such as Facebook parent Meta Platforms and ChatGPT creator OpenAI.
LLM refers to the technology underpinning generative AI services such as ChatGPT. In AI, a high number of parameters is pivotal in enabling an LLM to adapt to more complex data patterns and make precise predictions. Open source gives public access to a software program’s source code, allowing third-party developers to modify or share its design, fix broken links or scale up its capabilities.
DeepSeek’s development of a powerful LLM at less cost than what bigger companies spend shows how far Chinese AI firms have progressed, despite US sanctions that have largely blocked their access to advanced semiconductors used for training models.
Leveraging new architecture designed to achieve cost-effective training, DeepSeek required just 2.78 million GPU hours – the total amount of time that a graphics processing unit is used to train an LLM – for its V3 model. DeepSeek’s training process used Nvidia’s China-tailored H800 GPUs, according to the start-up’s technical report posted on December 26, when V3 was released.
That process was substantially less than the 30.8 million GPU hours that Meta needed to train its Llama 3.1 model on Nvidia’s more advanced H100 chips, which are not allowed to be exported to China
“DeepSeek V3 looks to be a stronger model at only 2.8 million GPU hours,” computer scientist Andrej Karpathy – a founding team member at OpenAI – said in his X post on December 27.
Karpathy’s observation prompted Fan to respond on the same day in a post on X: “Resource constraints are a beautiful thing. Survival instinct in a cutthroat AI competitive land is a prime driver for breakthroughs.”
“I’ve been following DeepSeek for a long time. They had one of the best open coding models last year,” Fan wrote. “Superior OSS [open-source software] models put huge pressure on commercial, frontier LLM companies to move faster.”
The founder of cloud computing start-up Lepton AI, Jia Yangqing, echoed Fan’s perspective in an X post on December 27. “It is simple intelligence and pragmatism at work: given a limit of computation and manpower present, produce the best outcome with smart research,” wrote Jia, who previously served as a vice-president at Alibaba Group Holding, owner of the South China Morning Post.
DeepSeek did not immediately respond to a request for comment.
The start-up was reportedly spun off in 2023 by hedge-fund manager High Flyer Quant. The person behind DeepSeek is High-Flyer Quant founder Liang Wenfeng, who had studied AI at Zhejiang University.
In an interview with Chinese online media outlet 36Kr in May 2023, Liang said High-Flyer Quant had already bought more than 10,000 GPUs before the US government imposed AI chip restrictions on China. That investment laid the foundation for DeepSeek to operate as an LLM developer. Liang said DeepSeek also receives funding support from High-Flyer Quant.
Most developers at DeepSeek are either fresh graduates, or people early in their AI career, following the company’s preference for ability more than experience in recruiting new employees.
DeepSeek’s V3 model, however, has also stirred some controversy because it had mistakenly identified itself as OpenAI’s ChatGPT on certain occasions.
Lucas Beyer, a researcher at Microsoft-backed OpenAI, said in an X post last Friday that DeepSeek V3’s misidentification was prompted by this simple question: “What model are you?”
Still, V3 is not the first AI model struck by identity confusion. Machine-learning expert Aakash Kumar Nain wrote in a post on X that it was common a mistake made across various AI models because “a lot of data available on the internet has already been GPT-contaminated”.
A group of researchers from China’s Shandong University and Drexel University and Northeastern University in the US echoed Nain’s view. Out of 27 AI models these researchers tested, they found that a quarter exhibited identity confusion, which “primarily stems from hallucinations rather than reuse or replication”.
As of Tuesday, DeepSeek’s V1 LLM was still ranked as the most popular AI model on Hugging Face, the world’s largest online machine-learning and open-source AI community.
Video: Johnson Choi reports from SF on 1/1/25. Happy New Year. I am introducing 2 concepts, 1 new and 1 old. The old one is Purchasing Power Parity (PPP) the new one is MCGA (america Make China Great Again) 新年快樂。今天我介紹 2 個概念,1 個新和1個舊概念。舊的是購買力平價(PPP),新的是MCGA(美國讓中國再次偉大).
america Make China Great Again must give credits to Obama, Trump and Biden for helping China reaching 2025 goals heading to 2035 China Standards 美國讓中國再次偉大必須歸功於歐巴馬、川普和拜登幫助中國實現2025年目標並邁向2035年中國標準
Regarding to Purchasing Power Parity, please watch my previous videos on our 3 weeks trips to HK, Yunnan, Guangzhou & Shenzhen 4 weeks ago. 關於購買力平價,請觀看我之前4週前我們去香港、雲南、廣州和深圳的3週旅行的影片
Purchasing power parity (PPP) is a way to compare the purchasing power of different countries’ currencies by measuring the price of specific goods in those countries:
Definition: PPP is the ratio of the price of a market basket of goods in one location to the price of the same basket of goods in another location.
Calculation: PPPs are calculated by converting the currency of one country to another so that the same amount of goods and services can be purchased in each country.
Purpose: PPPs are used to level prices across countries and account for real purchasing power. They’re used by international institutions, charities, and governments to design policies and allocate resources.
American logistics expert reports from China video: US picking the wrong enemy, could not possibly compete with China and win! US Computer chips sanction against China is a complete failure! Why China beating US is winning the chips race: materials, markets, money, and Moore’s Law 美國物流專家從中國視訊報導:美國對華計算機晶片製裁徹底失敗! 美國不自量力,找錯對象,美國不可能與中國競爭並獲勝! 將成為終極輸家。為何中國擊敗美國贏得晶片競賽: 材料、市場、資金和摩爾定律.
Huawei and SMIC are quickly catching up to global rivals in advanced semiconductor manufacturing, which is surprising to many industry analysts.
Chinese tech firms enjoy access to China’s enormous supply chain advantages, such as in refined silicon, and in wafer manufacturing.
Chinese companies are also the biggest buyers of semiconductor chips. China is simply too big a market for Western companies to lose, and so they are strongly motivated to go around the export bans, or even set up manufacturing and distribution plants in-country and be outside of US and European oversight.
The Chinese central government, a host of local governments, and Chinese companies themselves have invested far over $100 billion in their semiconductor industry in recent years, which is much more than investments made by other countries.
But another feature of today’s chip industry is that Moore’s Law is reaching the limits of what semiconductor companies can do. Massive investments in capital and time are required to build the next generation of ever-smaller chips.
So companies have turned to “chip packaging” to achieve high productivity gains, using existing chips. Chip Packaging is an area where Chinese companies are already strong, and allows them to employ economies of scale. This plays directly into their industrial strengths.
The timing of the semiconductor chips war, therefore, has been beneficial to China. It has allowed Chinese firms to catch up, and fast.
My friends in HK, Japan and China are sharing pictures on countdown to 2025.
I told them I discussed it with my wife. If I want to live to see 2025. It is better to lock ourselves at home after dark in San Francisco. With all the gun violence and Asian hates. The old days where large crowds meant to be safe is no more today! Large crowds meant easy targets for shooting practices! Don’t you love America?
Taiwan US-China expert video: China’s “Hong Kong, Guangzhou and Macau Greater Bay Area” has already defeated San Francisco and New York’s Greater Bay Area, and will defeat Japan’s Tokyo Greater Bay Area in 2025 to become the world’s number one. China will surpass US in all areas in absolute terms by 2035.
Some people suggest this is the best way to send message to people on WeChat. From a marketing point of view, I disagree and this is the reason.
When I launched our software company back in 1999-2001, we spent a lot of time in Silicon Valley with venture capitalists. Their message is simple. If we cannot catch their attention within 15-30 seconds, please get lost. They want the punch lines to catch their attention that you are different (WeChat format lack the punchline). We also commonly referred it as the elevator speech. In an elevator you have 15-30 seconds to reach the floor before the venture capitalists get off.
If I am a restaurant trying to promote special on lobster at $9.99. When I get this kind of message in this format, I have no clue what the restaurant is selling me. I received 100s of messages each day, so I will press the delete button.
WeChat format is poorly designed and I have years of marketing experience. Most of my circle of friends on WeChat on both sides of the Pacific never use this feature.