Cache Memory Optimization

11d

New KV cache compaction technique cuts LLM memory 50x without accuracy loss

MIT researchers developed Attention Matching, a KV cache compaction technique that compresses LLM memory by 50x in seconds — ...

Penguin Solutions Introduces Industry's First Production-Ready CXL-Based KV Cache Server

Penguin Solutions today announced its MemoryAI KV cache server, the industry's first production-ready KV cache server utilizing CXL memory technology.

Make Tech Easier

The Myth of Linux Optimization Tools, and Why You Really Don’t Need Them At All

Learn why Linux often doesn't need extra optimization tools and how simple, built-in utilities can keep your system running smoothly.

The agentic AI boom is here; operations will decide who wins

This shift matters because agentic AI doesn't scale like a chatbot or a single workload. A company-wide fleet of agents is a moving swarm of concurrent users, bringing with them bursts of demand, ...

Intel Core Ultra 200HX Plus CPUs Bring Faster Gaming And A New Optimization Tool

According to Intel, users upgrading from older platforms will see as much as a 62% gain in gaming and up to 30% faster single-threaded performances.

Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap

Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage, claiming 5x token throughput and 4x energy efficiency for agentic AI ...

44m

Micron (MU) Q2 2026 Earnings Call Transcript

Micron Technology, Inc. delivered an exceptional fiscal Q2. Quarterly revenue nearly tripled versus one year ago, and revenue ...

Sandisk: This Nvidia GTC 2026 Announcement Could Be A Game Changer

Sandisk stock is up 158% YTD. Explore AI data center NAND demand, BiCS8 QLC SSD ramp, and Nvidia GTC 2026 memory hierarchy ...

Morning Overview on MSN

Nanoengineered spintronic memory stores data in 4 resistance states

A magnetic tunnel junction engineered to produce four distinct resistance states instead of the standard two could double the ...

Network World

Nvidia targets inference as AI’s next battleground with Groq 3 LPX

The company says its new architecture marks a shift from training-focused infrastructure to systems optimized for continuous, ...

Intel launches Core Ultra 270K Plus and 250K Plus to revive its desktop gaming push

These chips arrive as Intel navigates a rocky recent history in the high-end desktop space. Past generations faced thermal woes and instability, the latest Arrow Lake ...

Storage vendors orbit the Nvidia sun at GTC

GTC Hitachi Vantara and Nutanix announced support for Nvidia’s new GPUs and software at GTC 2026, much like every other storage system vendor, while IBM integrated Watsonx and other offerings more ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results