Large Language Models Deepseek

Cryptopolitan on MSN

DeepSeek V4 rumored to outperform ChatGPT and Claude in long-context coding

February, is rumored to outperform ChatGPT and Claude in long-context coding, targeting elite-level coding tasks.

DeepSeek looks to offload simple LLM tasks to save billions of parameters

Detailed in a recently published technical paper, the Chinese startup’s Engram concept offloads static knowledge (simple ...

14don MSN

DeepSeek pitches new route to scale AI, but researchers call for more testing

DeepSeek's proposed "mHC" architecture could transform the training of large language models (LLMs) - the technology behind artificial intelligence chatbots - as developers look for ways to scale ...

2don MSN

New AI method lets models think harder while avoiding costly bandwidth

New AI memory method lets models think harder while avoiding costly high-bandwidth memory, which is the major driver for DRAM ...

Decrypt

Insiders Say DeepSeek V4 Will Beat Claude and ChatGPT at Coding, Launch Within Weeks

DeepSeek's upcoming V4 model could outperform Claude and ChatGPT in coding tasks, according to insiders—with its purported ...

14don MSN

Another Chinese quant fund joins DeepSeek in AI race with model rivalling GPT-5.1, Claude

Beijing-based Ubiquant launches code-focused systems claiming benchmark wins over US peers despite using far fewer parameters ...

Analytics India Magazine

Decoding DeepSeek’s Solution to China’s Compute Shortage

DeepSeek's new research enables retrieval using computational memory, not neural computation, freeing up GPUs.

Centre for International Governance InnovationOpinion

Chinese AI Models and the High-Stakes Fight for AI Neutrality

Chinese and Western large language models are reshaping global information power, embedding political world views into the ...

DIGITIMES

DeepSeek V4 update: Conditional memory reshapes large-model efficiency

DeepSeek founder Liang Wenfeng has published a new paper with a research team from Peking University, outlining key technical ...

The Information

DeepSeek To Release Next Flagship AI Model With Strong Coding Ability

Chinese AI startup DeepSeek is expected to launch its next-generation AI model that features strong coding capabilities in ...

VentureBeat

DeepSeek's success shows why motivation is key to AI innovation

January 2025 shook the AI landscape. The seemingly unstoppable OpenAI and the powerful American tech giants were shocked by what we can certainly call an underdog in the area of large language models ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results