Autoencoder Image Compression

SFTNet: Spatial-Frequency Transformer Network for Learned Image Compression

Abstract: In learned image compression (LIC), most existing methods process spatial domain information through convolutional neural networks (CNNs) or Transformers. However, they struggle to model ...

Scientific Research Publishing

A Spatio-Temporal Prediction Model for Wall Turbulence Based on Hybrid Neural Networks ()

The spatio-temporal evolution of wall-bounded turbulence is characterized by high nonlinearity, multi-scale dynamics, and chaotic nature, making its accurate prediction a significant challenge for ...

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

GitHub

SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model

Important Note: This repository implements SVG-T2I, a text-to-image diffusion framework that performs visual generation directly in Visual Foundation Model (VFM) representation space, rather than ...

IEEE

Multi-Scale Semantic Compression for Robust Collaborative CNN Inference in Low-SNR Environments: An Attention-Enhanced UNet Autoencoder

Abstract: In collaborative inference scenarios, semantic communication replaces raw data transmission by conveying task-oriented semantic features to improve bandwidth efficiency. However, under noisy ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results