Forum Home
News
World News - Discussion
Political Soapbox
Chinese start-ups such as DeepSeek are challenging global AI giants

All Activity

Chinese start-ups such as DeepSeek are challenging global AI giants

Followers

January 25, 20251 yr

Their models are cheaper thanks to US export restrictions that have inadvertently spurred innovation

Last month, a Chinese start-up called DeepSeek astonished the international tech community with its latest open-source artificial intelligence model. DeepSeek-V3 delivers a performance comparable to that of better-funded US rivals such as OpenAI. This week it impressed once again with R1, its foray into AI reasoning.

https://archive.ph/51WIt#selection-2415.0-2419.1

I see lots of comments about how China just produces cheap copycat goods. Clearly this is an outdated view. Japanese manufacturing was regarded in the same way for about 25 years after WW2.

January 26, 20251 yr

Popular Post

And just in time, Musk has made Grok subscription only again. Was free last night but only gives a subscription prompt now. ChatGPT was already limited in the number of images it would generate for free and Gemini won't generate images of people unless you pay. They are driving people to DeepSeek, although no image function on DS yet.

January 27, 20251 yr

Popular Post

On 1/25/2025 at 4:39 PM, placeholder said:

Their models are cheaper thanks to US export restrictions that have inadvertently spurred innovation

Last month, a Chinese start-up called DeepSeek astonished the international tech community with its latest open-source artificial intelligence model. DeepSeek-V3 delivers a performance comparable to that of better-funded US rivals such as OpenAI. This week it impressed once again with R1, its foray into AI reasoning.

https://archive.ph/51WIt#selection-2415.0-2419.1

I see lots of comments about how China just produces cheap copycat goods. Clearly this is an outdated view. Japanese manufacturing was regarded in the same way for about 25 years after WW2.

That's smart from the Chinese to make it open source. By doing so they will be able to use freely any additional improvement made by anyone in the world.

In the end the total amount of development resources dedicated to it may well exceed what is expected to be invested in ChatGPT and other LLM models.

January 28, 20251 yr

Popular Post

DeepSeek was developed for almost nothing (~$5.6 million), and apparently to maximize its value one won't need the fancy chips NVIDIA is making. Thus, NVIDIA lost $560 billion in market cap on 1/27/25. Also, the subscription models of ChatGPT and others were obviated, since anybody can choose open source DeepSeek for free, or close to it.

I'm beginning to think the $500 billion AI plan the President announced last week is just a thinly disguised Oligarch bailout, as the tech sector suddenly isn't worth the multiple the market gave it.

I wonder if tech insiders are going to be dumping even more? I wonder how bad margin calls are going to be for all the leveraged buyers of tech? I wonder what company Musk is actually going to run, or will he go full DOGE, as Twitter is worth very little compared to what he paid, $40-80,000 Teslas are in a market with $12-25,000 BYDs, and his rocket just blew up last week (2nd stage) and rained debris all over the Caribbean.

I think we've seen lots of peaks in the last week....Peak Musk, Peak Equity Market, andalready Peak 47's Presidency. It's all downhill from here.

January 28, 20251 yr

They're doing it for a fraction of the costs the US companies are spending.

See the much more extensive post on this above mine

January 28, 20251 yr

On 1/25/2025 at 4:39 PM, placeholder said:

I see lots of comments about how China just produces cheap copycat goods. Clearly this is an outdated view.

Actually that's exactly what happened with DeepSeek.

It may be the case it has managed to cut costs and compute, but we do know that it is built at least in part on the shoulders of the giants: it uses Nvidia chips – albeit older, cheaper versions - and utilises Meta's open-source Llama architecture, as well as AliBaba's equivalent Qwen.

https://www.bbc.com/news/articles/cx2jxvn0r51o

They copied Meta's open source. And by all accounts DeepSeek is not as good as the US competition, for instance you can't ask it questions about Tiannamen Square. It censors itself when it comes to Chinese topics.

January 28, 20251 yr

4 hours ago, Cameroni said:

Actually that's exactly what happened with DeepSeek.

It may be the case it has managed to cut costs and compute, but we do know that it is built at least in part on the shoulders of the giants: it uses Nvidia chips – albeit older, cheaper versions - and utilises Meta's open-source Llama architecture, as well as AliBaba's equivalent Qwen.

https://www.bbc.com/news/articles/cx2jxvn0r51o

They copied Meta's open source. And by all accounts DeepSeek is not as good as the US competition, for instance you can't ask it questions about Tiannamen Square. It censors itself when it comes to Chinese topics.

Actually, it's broader than China vs USA, it's also about open source vs commercial AI. The open source ai community is quite dynamic and it's an additional contribution which will be improved not only by China, but also by other AI open source participants.

The choice of open source by the Chinese has two advantages:

- as you pointed out, it allows them to benefit from previous open source knowledge and resources,

- it directs a share of AI open source innovation by other actors towards complementary innovation which will help China to cope with current limitations regarding chips and other resources.

January 28, 20251 yr

Here is an intersting video on the topic.

January 28, 20251 yr

Popular Post

The above video by Microsoft engineer David Plummer is a clear intro to what Deepseek is, and isn't. The clever part is its distillation process to turn larger models into smaller, good enough, models that use less hardware. But I do see a possible cat in the bag and asked Grok to confirm (we're already a mix of human and AI!). I post my question and it's answer for clarity. However it goes, Deepseek's arrival will be transformative.

Me: My question is about the newly released Deepseek AI software and its operation. The question is specific. Preface: Deepseek is said to use larger AI models to train a number of smaller models that can run on smaller hardware and thus cheaper. Does this mean that 1) larger hardware and training is still required when training the smaller modules? and 2) does the Deepseek open source code they released include the initial large training model, or it only includes code for running the smaller models?

(Spoiler, required and no not released)

Grok:

Larger Hardware and Training for Smaller Modules: Yes, the process of training smaller models through distillation or transfer learning still requires the use of larger hardware and computational resources initially. DeepSeek uses a large model (like DeepSeek-R1 or DeepSeek-V3) to train these smaller models. The initial training of the large model necessitates significant computational resources, but once trained, the knowledge can be distilled into smaller models that require less computational power for deployment. This is evident from the strategies mentioned where DeepSeek-R1 knowledge is distilled into smaller models, which implies that the heavy lifting is done during the training of the larger model.
Open Source Code and Model Inclusion: The open-source release by DeepSeek includes models that are distilled from the larger models, particularly the DeepSeek-R1 series. However, the training code for the initial large models (like DeepSeek-V3 or the base models used for distillation) does not seem to be open-sourced. The comments on X and technical reports indicate that while the distilled models and possibly some of their training code are available, the actual training pipeline for the larger models, including the data and the original training code, are not part of the open-source release. This means users can run and possibly fine-tune the smaller models but cannot replicate the training of the large models without additional resources or information.

In summary, while you can utilize and potentially further train the smaller models with less hardware, the groundwork of training the larger models still requires substantial resources. The open-source materials provided by DeepSeek focus on the smaller, distilled models rather than the full spectrum of their training process for the large models.

End of Grok.

January 28, 20251 yr

I think the scariest part is that in the moment nobody really knows the details. People claim this and that. But is all the information true? But the market obviously reacts already. Because maybe it is true.

AI is such a hype and so many people invest in things which look promising, the bubble is very unstable.

And I guess what just happened could happen again and again. Or maybe not. It's a high-risk investment, between making huge amounts of money and losing it all.

January 28, 20251 yr

12 minutes ago, rabas said:

The above video by Microsoft engineer David Plummer is a clear intro to what Deepseek is, and isn't. The clever part is its distillation process to turn larger models into smaller, good enough, models that use less hardware. But I do see a possible cat in the bag and asked Grok to confirm (we're already a mix of human and AI!). I post my question and it's answer for clarity. However it goes, Deepseek's arrival will be transformative.

Me: My question is about the newly released Deepseek AI software and its operation. The question is specific. Preface: Deepseek is said to use larger AI models to train a number of smaller models that can run on smaller hardware and thus cheaper. Does this mean that 1) larger hardware and training is still required when training the smaller modules? and 2) does the Deepseek open source code they released include the initial large training model, or it only includes code for running the smaller models?

(Spoiler, required and no not released)

Grok:

Larger Hardware and Training for Smaller Modules: Yes, the process of training smaller models through distillation or transfer learning still requires the use of larger hardware and computational resources initially. DeepSeek uses a large model (like DeepSeek-R1 or DeepSeek-V3) to train these smaller models. The initial training of the large model necessitates significant computational resources, but once trained, the knowledge can be distilled into smaller models that require less computational power for deployment. This is evident from the strategies mentioned where DeepSeek-R1 knowledge is distilled into smaller models, which implies that the heavy lifting is done during the training of the larger model.

Open Source Code and Model Inclusion: The open-source release by DeepSeek includes models that are distilled from the larger models, particularly the DeepSeek-R1 series. However, the training code for the initial large models (like DeepSeek-V3 or the base models used for distillation) does not seem to be open-sourced. The comments on X and technical reports indicate that while the distilled models and possibly some of their training code are available, the actual training pipeline for the larger models, including the data and the original training code, are not part of the open-source release. This means users can run and possibly fine-tune the smaller models but cannot replicate the training of the large models without additional resources or information.

In summary, while you can utilize and potentially further train the smaller models with less hardware, the groundwork of training the larger models still requires substantial resources. The open-source materials provided by DeepSeek focus on the smaller, distilled models rather than the full spectrum of their training process for the large models.

End of Grok.

Does Grok, and other AI, know up to date data?

Not long ago I asked AI about a recent event and the answer was something like: I don't have any data about that event.

January 28, 20251 yr

11 minutes ago, OneMoreFarang said:

Does Grok, and other AI, know up to date data?

Not long ago I asked AI about a recent event and the answer was something like: I don't have any data about that event.

Good question. From my post you see Grok is well aware of Deepseek and how it works. How so if It takes so long to train?

Learning and knowledge break down into 3 stages.

Stage 1: Build an initial model (links held in memory) , which takes many months at full power. Once per major revision.

Stage 2: These large models can be updated at significant but lest cost. Done periodically, as required by new info, not too often.

Stage 3: On the fly. When Grok answered my question, he first looked at a number of websites and X posts to learn. This is current probably from some indexing. I,e., he is not yet trained on it but can go 'read up' before answering my question.

January 29, 20251 yr

You all need to read this

Create an account or sign in to comment

Followers

Go to topic listing

No registered users viewing this page.

Making a 90 day report first time in a few years.
Pattaya

Making a 90 day report first time in a few years.

giddyup · 17 hours ago17 hr

I've been doing online 90 day reports for a few years, but because I'm about to get a new passport I believe the next 90 day has to be done in person. The last time I did a report at Jomtien it was only necessary to present your passport, now I believe there are a couple of forms to fill out. I have been at the same address for 16 years, do I still have to fill out a TM30?
- 8 replies
- 569 views
giddyup

17 hours ago17 hr

connda

30 minutes ago30 min

Bbus or minivan from Bang Saray or Pattaya to Sakaeo///
Pattaya

Bbus or minivan from Bang Saray or Pattaya to Sakaeo///

liddelljohn · Monday at 11:24 AM1 day

friend of mine is going to visit a friend who lives at Sakaeo and needs a bus or minivan from pattaya or Bang saray ,, any ideas please ??? I cant help as im overseas again
- 11 replies
- 607 views
liddelljohn

Monday at 11:24 AM1 day

KhunBENQ

20 hours ago20 hr

Joe Kent, Counterterrorism official in Trump Administration, resigns.
Political Soapbox

Joe Kent, Counterterrorism official in Trump Administration, resigns.

Alan Zweibel · 13 hours ago13 hr

Joe Kent, one of the United States’ top counterterrorism officials, announced his resignation on Tuesday, citing his opposition to the Iran war and what he said was Israel’s influence over the Trump administration’s policies. Here is his resignation letter in full. https://static01.nyt.com/newsgraphics/documenttools/dea3cbf97dbfb39c/1/output-1.png https://archive.ph/R2vUr I may not like his principles, but at least unlike most rightwingers who post here, he doesn't believe that MAGA is Trump an
- 53 replies
- 527 views
Alan Zweibel

13 hours ago13 hr

Alan Zweibel

9 minutes ago9 min

Trump Snaps At Reporter As War Questions Intensify
What the Papers are Saying

Trump
Trump Snaps At Reporter As War Questions Intensify

Social Media · 21 hours ago21 hr

Trump Snaps At Reporter As War Questions Intensify A heated exchange between Donald Trump and a reporter aboard Air Force One has gone viral, underscoring growing pressure on the president over the expanding war with Iran. During a press gaggle, a female journalist asked Trump to explain the deployment of 5,000 U.S. Marines and sailors to the region — a question that appeared to trigger an immediate and sharp reaction. “You’re a very obnoxious person,” Trump shot back, before cutting her off
- 28 replies
- 403 views
Social Media

21 hours ago21 hr

ravip

11 minutes ago11 min

Don’t Need help after we already won, News
Political Soapbox

Don’t Need help after we already won, News

riclag · March 8Mar 8

We don’t need people that join Wars after we’ve already won!" So true Mr . President ! If and when the tide turned and the UK ally needed help , America would be the first to step up! Goes to show what a weak appeasing PM , he is., as he cowers in the face of diversity. https://www.foxnews.com/politics/trump-tells-starmer-aircraft-carriers-no-longer-needed-mideast-accuses-him-joining-war-us-already-won.amp its a story about a foxnews report
- 176 replies
- 2,152 views
riclag

March 8Mar 8

candide

20 minutes ago20 min

Trumps Greatest Troll Ever
Political Soapbox

Trumps Greatest Troll Ever

Yagoda · Saturday at 03:41 PM3 days

Trump on Truth Social just invited the Chinese to keep help Hormuz open bwwaaaaaaaaaaaaaaaahahahahahaha First they watch us destroy their satrap while they say nothing, now Trump tells them to come protect their oil that we now control. Burn. We dont deserve him.
- 102 replies
- 1,638 views
Yagoda

Saturday at 03:41 PM3 days

metisdead

20 minutes ago20 min

Activist Group Leads Push to End Free Migrant Schooling
Education
Activist Group Leads Push to End Free Migrant Schooling

Aussie999 replied to Georgealbert's topic in Thailand News - Discussion

OK, how about a reciprocal response, from other countries, no education for Thais, no matter what family they come from...ok, maybe ok from poor families.
- Just now1 min
- 6 replies
Trump Criticises UK’s Cautious Approach to Iran Conflict
Trump
Trump Criticises UK’s Cautious Approach to Iran Conflict

MikeandDow replied to webfact's topic in World News - Discussion

Harsh words any proof !! or is it just BS as i suspect !! Any alleged factual claims must be supported by a valid link to a mainstream media source.
- 1 minute ago1 min
- 15 replies
Joe Kent, Counterterrorism official in Trump Administration, resigns.
Joe Kent, Counterterrorism official in Trump Administration, resigns.

ronnie50 replied to Alan Zweibel's topic in Political Soapbox

Crackpot or not, Kent is among a growing number of MAGA and ultra-right-wing voices to call out Israel's interference in the US administration. Much of the MAGA base has always been unhappy with Israel's influence in the US. In any case, there is no logical reason for the US to be at war with Iran - a deal was within reach on nuclear enrichment and international inspections. But Israel's Nettanyahu wanted to quickly shut that door and bomb Iran back to the stoneage (which won't happen of course). Trump is simply Nettanyahu's useful idiot. Now there's not even a clear off ramp for Trump and the US military, short of just throwing in the towel and walking away. But Nettanyahu probably won't let Trump do that. He likely posseses some compromat on Trump - maybe Epstein file related. The rest of the world is paying the price.
- 1 minute ago1 min
- 54 replies
Fake protein powders in Thailand
Fake protein powders in Thailand

KhunLA replied to msbkk's topic in The Wellness Zone

Buy from iHerb, and they should be authentic supplements. I only buy from iHerb for that reason. Order 1400 thb worth, and shipping is free, though still a VAT & duty due, and prepaid. My problem is, I only take / order a couple, and only when 20% or more discount, and with their 2 pcs max of any product per order, I sometimes have a hard time hitting 1400, if out of the larger quantity container. I think a lot of shoppers have same mindset, as larger sizes seem to sell out fast.
- 1 minute ago1 min
- 3 replies

Which country's passport is most beautiful? And, why?
ASEAN NOW Community Pub

Which country's passport is most beautiful? And, why?

GammaGlobulin · 1 hour ago1 hr

Dear Folks, I have, for many years, been so envious of the beauty of the British Passport. It is far more lovely than my own. For example: If you could buy one, which one would you buy? Alternatively, out of all countries on Planet Earth, which country has... \The most beautiful passport.... Aesthetically speaking, of course. How much would you pay for one of these, by the way? Best regards, Gamma
- 0 replies
- 58 views
GammaGlobulin

1 hour ago1 hr

GammaGlobulin

1 hour ago1 hr

Will Trump back down?
ASEAN NOW Community Pub

Will Trump back down?

GammaGlobulin · 11 hours ago11 hr

Dear Folks, I have absolutely no idea how to create a “poll”, on TV. However, be that as it may be, what is your opinion? Will Trump back down? Deep in his big heart, Trump knows that the right move, given the reality of his situation, is to finally step down from his high-horse, and just….BACK DOWN. Do you think he is capable for doing so? Yes? No? IMHO, the answer is a definite YES. Trump is easily capable of pragmatically backing down, and getting out of Iran, while the getting is goo
- 6 replies
- 492 views
GammaGlobulin

11 hours ago11 hr

GammaGlobulin

2 hours ago2 hr

HEGSETH: Would you say he is a “dandy” ?
ASEAN NOW Community Pub

HEGSETH: Would you say he is a “dandy” ?

GammaGlobulin · 13 hours ago13 hr

Dear Folks, Ever since the Iran War came into full view, and since the time Hagseth, too, began flexing his muscles, in his too-tight-little suit-jackets, I have been wracking my brain trying to recall the image he most evokes, just based on my many decades of living. Then, wouldn’t you know it? Just about 5-minutes ago, it finally hit me. I know, I know, I shouted. Hagseth is the SPITTING IMAGE of the Dandy song, "Follower of Fashion", first sung by the glorious and very incisive KINKS. Thi
- 7 replies
- 466 views
GammaGlobulin

13 hours ago13 hr

spidermike007

53 minutes ago53 min

Forum Home
News
World News - Discussion
Political Soapbox
Chinese start-ups such as DeepSeek are challenging global AI giants

All Activity

Chinese start-ups such as DeepSeek are challenging global AI giants

Featured Replies

Create an account or sign in to comment

Recently Browsing 0

Topics

Making a 90 day report first time in a few years.

Bbus or minivan from Bang Saray or Pattaya to Sakaeo///

Topics

Joe Kent, Counterterrorism official in Trump Administration, resigns.

Trump Snaps At Reporter As War Questions Intensify

Don’t Need help after we already won, News

Trumps Greatest Troll Ever

Popular Contributors

Latest posts...

Activist Group Leads Push to End Free Migrant Schooling

Trump Criticises UK’s Cautious Approach to Iran Conflict

Joe Kent, Counterterrorism official in Trump Administration, resigns.

Fake protein powders in Thailand

Popular in The Pub

Which country's passport is most beautiful? And, why?

Will Trump back down?

HEGSETH: Would you say he is a “dandy” ?

ASEAN NOW

MORE INFO

POPULAR AREAS

CONTACT US

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)