View Section: 2025-02-02 BBC Interview

Let’s begin by discussing your very first experience of using DeepSeek. When it launched last week, people were quite amazed by it. Could you tell us what that first encounter was like?

Link in context Link

Audrey Tang

Mm, you mean DeepSeek R1, right? The latest model?

Link in context Link

Interviewer

Yes, the newest version.

Link in context Link

Audrey Tang

I believe I noticed it about 13 days ago—around two weeks back. At the time, there wasn’t any media coverage; it had simply been uploaded to Hugging Face.

Link in context Link

Audrey Tang

Because I regularly download new models from Hugging Face, I just downloaded it and naturally started trying it out myself.

Link in context Link

Interviewer

When you first used R1, what were your impressions? Did you feel it had truly… perhaps surpassed OpenAI’s latest model?

Link in context Link

Audrey Tang

Well, I personally use O1 Pro on a daily basis, quite frequently in fact. Naturally, compared to O1 Pro, R1 isn’t more accurate, nor does it show more creativity. In most respects, it doesn’t quite reach O1 Pro’s level.

Link in context Link

Interviewer

I see. So, last week when many media outlets were comparing R1 to O1 Pro, how did you feel when you saw those comparisons?

Link in context Link

Audrey Tang

Because O1 Pro is the version of O1 that uses more computing power, most users of O1 are using versions with lower compute—like O1 Mini or just plain O1. Compared with those, especially for processing Chinese, DeepSeek does feel more fluent than O1, most likely because it was trained on more Chinese-language data.

Link in context Link

Audrey Tang

However, if you use O1 Pro, then for Chinese—as well as for region-specific features of different forms of written Chinese—O1 Pro still has more depth of understanding.

Link in context Link

Interviewer

Understood. Earlier, we also noted that when you responded to people, you explained certain “unlocking” methods for it. Did you figure out how to bypass censorship because you knew that the offline version lacks some of the safeguards present in the online or app versions? How did you go about circumventing its content moderation?

Link in context Link

Audrey Tang

First, to be clear, I haven’t used the website or the app versions, so I haven’t really dealt with that aspect. Typically, I go straight for the model on Hugging Face.

Link in context Link

Audrey Tang

I did, however, notice that the model seemingly has two layers: one layer detects political or sensitive topics and, without much thought, provides an “official narrative.” You can observe that for ordinary queries, it first appears to “think” before answering; but if it encounters certain keywords, it skips that thinking process and immediately produces a standard official line. That’s quite unusual—clearly a deliberately designed feature.

Link in context Link

Audrey Tang

Another issue, which people are also discussing, is that because the training dataset for political or social science questions might come from sources with a specific ideology, it will think and respond, but the answers often reveal that ideological bent. That aspect also exists, but what first caught my eye was the skipping mechanism I just mentioned.

Link in context Link

Interviewer

Right. So at the time, once you realised that difference, you surmised that perhaps the offline version is less subject to the heavy moderation found on the online or app versions. Also, because you mentioned that it displays its “thinking” process, some users say that when they try to input sensitive terms in the online version, it starts to provide an answer, but within two seconds the text is deleted. Does that match your observations?

Link in context Link

Audrey Tang

Yes, that’s yet another issue. In total, it seems there are three layers of systems at play:

Link in context Link

Audrey Tang

Online or app-only layer: Even if it begins to answer, the response is overridden.Built-in censorship in the offline model: As mentioned, in the offline version it still has triggers for certain keywords or historical events, causing it to skip its normal thinking process and produce unnatural evasions. Unsurprisingly, this undermines trust; people may feel the censorship is too pervasive and not want to use it.Underlying ideological perspective: In its “natural” mode, it will answer, but the answer reflects a particular ideology.

Link in context Link

Interviewer

Right. So that underlying ideology is essentially because of the way it was trained—“fed” by data that inherently contains those biases?

Link in context Link

Audrey Tang

Precisely. But we’ve already seen, for instance, that Perplexity.ai’s embedded R1 version has removed those unnatural censorship and avoidance behaviours through changes to the model weights. Even if you disable search on Perplexity and use what they call “assistive writing mode,” when you ask about politically sensitive keywords or historical events, it responds normally.

Link in context Link

Audrey Tang

This shows that, because its weights are open source, one can modify them to remove these unnatural censorship routines.

Link in context Link

Interviewer

I see.

Link in context Link

Interviewer

We touched on how different instructions or keywords might produce drastically different results. For instance, “Tiananmen Incident” might yield a starkly different answer from “Tiananmen Massacre.”

Link in context Link

Audrey Tang

I haven’t tried “Tiananmen Massacre,” but I did try “Tiananmen Tragedy.” Essentially, language models work by “continuing” a text prompt, so if you phrase your question or topic differently, it often provides a different description. The keyword itself can carry a particular connotation.

Link in context Link

Audrey Tang

Even when we write news headlines, how we phrase the headline suggests how we’ll describe it in the body. Because a language model is fundamentally about text continuation, the prompt steers the tone of its response.

Link in context Link

Interviewer

Yes. But regarding the online version, do you think censorship has gone so far that, as some reports mention, it might even refuse to answer “Who is Xi Jinping?”? It appears many queries aren’t answered at all.

Link in context Link

Audrey Tang

That’s why I emphasised that in Taiwan, we value freedom of expression and pluralistic participation. If a product or service exhibits such tendencies, people may neither trust nor want to use it, and they’ll also look for the kind of technical workarounds I mentioned.

Link in context Link

Audrey Tang

For example, besides Perplexity, Hugging Face itself—where I initially discovered the model—has started something called “Open-R1”: it uses the training approach of R1 but without the censorship module. It’s not a matter of downloading R1 and removing the censorship code; rather, they retrain the model from scratch in the same style as R1 but without adding those censorship mechanisms. As a result, the Open-R1 model naturally won’t respond with that sudden, unnatural avoidance.

Link in context Link

Audrey Tang

Because of this, we can anticipate that an open-source model like Open-R1—which is more transparent—may be integrated into major companies’ solutions. They wouldn’t necessarily have to take DeepSeek-R1, remove parts, and so forth. That’s one option.

Link in context Link

Audrey Tang

Another is that we’re seeing different countries obtaining R1 and adding their own language data. That approach can address the third layer—the ideological bias—because if the model’s initial information was one-sided, it might reflect a specific worldview even if it’s willing to answer. For example, Japan’s CyberAgent added Japanese corpora to train it further, so if you ask politically sensitive questions in Japanese, it’s no longer limited by the original ideological framing.

Link in context Link

Interviewer

So using English might also yield different results in the online version?

Link in context Link

Audrey Tang

Yes, relatively speaking. But as I mentioned, because the weights are open source, once you download and train it further with, say, a Japanese dataset, it tends to adopt whatever additional training you provided.

Link in context Link

Audrey Tang

So it seems there are essentially two main approaches: 1. Completely new: Retrain from scratch following the R1 paper’s methodology without the censorship module. 2. Incremental: Take the official R1, remove or replace its second-layer censorship module, and add new training data to correct the third-layer ideological bias. That way, the second and third layers are handled. The first layer—where you must use their website or app—ceases to matter since you already have the model locally.

Link in context Link

Interviewer

Yes, I see. But for users who aren’t very tech-savvy and only use the app or website version, do you think its censorship approach might affect which AI product they choose? In other words, what are the hidden downsides for more “lay” users?

Link in context Link

Audrey Tang

I’m not entirely sure what you mean. For instance, if you visit chatgpt.com now, it won’t offer R1; it offers O3 Mini, which is freely accessible with a “think” button. In most respects, O3 Mini is actually as good as or better than R1.

Link in context Link

Audrey Tang

So are you suggesting that a user must choose between free O3 Mini or free DeepSeek R1, and you’re wondering how they’d weigh their options?

Link in context Link

Interviewer

Yes, precisely. If the DeepSeek app or website clearly imposes ideological filters whenever you ask about public affairs or society, might users just decide not to bother with DeepSeek?

Link in context Link

Audrey Tang

In other words, since users have access to options like O3 Mini that are functionally equivalent or even superior, and which also avoid this ideological bias, most people will naturally gravitate towards using the ones without such issues.

Link in context Link

Audrey Tang

A real trade-off would only exist if DeepSeek R1 were demonstrably superior to all other products, forcing users to consider tolerating its ideological censorship. But that’s simply not the situation we face.

Link in context Link

Audrey Tang

Besides OpenAI, Gemini’s AI Studio offers something called Flash Thinking, which many rankings consider top-tier. It similarly doesn’t impose ideological bias. People who want a completely free solution can simply use Gemini’s Flash Thinking.

Link in context Link

Interviewer

Understood. In non-Chinese-speaking regions, the DeepSeek app soared to second place in app download charts last week. I’m curious: for people who download the app, where do you think the appeal lies, given the heavy content moderation?

Link in context Link

Audrey Tang

Partly, it’s the media buzz. It doesn’t take much effort to download an app. People see the headlines and think, “Might as well try it out.”

Link in context Link

Audrey Tang

In Taiwan, we’ve also seen the Ministry of Digital Affairs very quickly advise that if you use its app or web service, your data storage and subsequent usage is entirely controlled by DeepSeek’s parent company. The Ministry of Digital Affairs declared that in public agencies or government-funded schools, this product is categorised as a “risky product,” similar to TikTok, because it can be substantively controlled by the Chinese government. Therefore, it’s not permitted for official use. This specifically applies to DeepSeek’s web and app versions.

Link in context Link

Interviewer

Are you saying the risk is analogous to other Chinese apps, such as Xiaohongshu (Little Red Book)?

Link in context Link

Audrey Tang

Exactly. Starting in 2019, Taiwan implemented the “Principles for Restricting the Use of Products that Endanger National Cybersecurity,” which focus on any product substantively controlled by hostile foreign regimes—regardless of the product’s nominal country of origin. Government agencies, whether operating in-house or outsourcing, or providing venues for public activities, are required to restrict their use for that reason. Essentially, because the controlling power is out of our jurisdiction.

Link in context Link

Audrey Tang

However, as I mentioned, if Hugging Face retrains a new Open-R1 from the same technical foundation but is not subject to Beijing’s control, then it doesn’t pose that same national security concern.

Link in context Link

Interviewer

I see. So the security concerns now raised in various places essentially mirror what you’ve described?

Link in context Link

Audrey Tang

Yes. The primary worry, the first layer, is that the app or website might leak data outside the scope of effective oversight, posing a risk. That’s the first layer.

Link in context Link

Audrey Tang

Then, you have the second and third layers regarding built-in censorship triggers and ideological training data. But those relate to the model’s weights. The “hazardous product” designation addresses the first layer.

Link in context Link

Interviewer

Could the ideological layer also create a security issue by steering user answers in a certain direction?

Link in context Link

Audrey Tang

That’s certainly a risk. Additionally, if you read the R1 paper, it focuses on how to assist users and reduce harm to them. But it doesn’t say much about “honesty.”

Link in context Link

Audrey Tang

Consequently, if a user asks DeepSeek R1 to generate harmful or malicious content—for example, personal attacks, disinformation scripts, or scam plots—the model doesn’t inherently refuse. It tries to comply with user requests.

Link in context Link

Audrey Tang

That’s another angle: its training is heavily geared towards user satisfaction, without strong safeguards against malicious uses.

Link in context Link

Interviewer

Understood. Has the censorship approach overshadowed its initial “wow factor”? Or would you say that if one simply downloads and retrains it locally, it’s not really affected?

Link in context Link

Audrey Tang

If you look at Perplexity’s integration of R1, for example, it seems to have effectively removed the second layer. Moreover, if you rely on Perplexity to do web searches, it grounds its answers in actual web results, which mitigates the third-layer “hallucinations” or ideological partialities. They’re not entirely eradicated, but they’re largely contained. Of course, to fully address these issues, an entirely new open version—like Open-R1—would be ideal.

Link in context Link

Audrey Tang

And, of course, there’s a very straightforward alternative: people can use Gemini Thinking or O3 Mini, or other AI solutions, many of which are free as well. In terms of programming support or reasoning capabilities, Gemini Thinking or O3 Mini aren’t inferior, so there’s no real sacrifice to switching.

Link in context Link

Interviewer

I see, thank you. How do you, personally, feel about R1’s development, given you’ve used it for some time?

Link in context Link

Audrey Tang

Well, as I said, I still mainly rely on O1 Pro.

Link in context Link

Audrey Tang

When R1 was first released, maybe one or two days in, I downloaded it for local testing because I regularly check Hugging Face for new models. When Perplexity’s R1 came out, I tried it for a bit to see how extensively the censorship was removed. But at this point, my daily usage still revolves around O1 Pro and, to some extent, O3 Mini High.

Link in context Link

Interviewer

I see. One last question: over the past week, people have been talking a lot about DeepSeek. It’s only been a short time, but do you have any thoughts on its future?

Link in context Link

Audrey Tang

I think what’s most notable is that this week’s conversations have really centred on data rights, privacy rights, freedom of expression, and so on. And I think that’s a good thing—people are increasingly conscious of the fact that fundamental rights apply in digital spaces just as much as in the analogue world.

Link in context Link

Audrey Tang

I think this is a very positive thing, because as people increasingly value these fundamental human rights—recognizing that human rights aren’t limited to the analog world, but are equally important online—I believe that the more we discuss these issues, the greater the consensus will become regarding which rights are essential to uphold.

Link in context Link

Audrey Tang

Furthermore, this increased focus on rights is also a positive development for the future of open source. This is because internationally, there’s ongoing discussion about whether open source truly advances the rights we’ve been discussing, or if it might, conversely, undermine those very rights by easily enabling things like scams and deepfakes. It’s a significant point of discussion.

Link in context Link

Audrey Tang

But if people’s discussions lean towards “How can open source properly fulfil its social responsibilities? Could the community collaboratively design tools to guard against malicious uses?” and so forth, then we can steer open-source AI in a direction that benefits society and upholds human rights.

Link in context Link

Interviewer

All right, that’s about all I wanted to ask today. Thank you so much for speaking with me, especially so late in the evening.

Link in context Link

Audrey Tang

No problem at all. I’ll produce a verbatim transcript of this conversation later on, but I’ll wait until you’ve published your piece before releasing it.

Link in context Link

Interviewer

Sure, no worries. I’ll send you the link once the article goes live this week.

Link in context Link

Audrey Tang

Great, thank you.

Link in context Link