Text Label Frame Image Classification Python

ML-Mamba: A Dual-Stream State Space Model with Atrous Scanning for Multi-Label Remote Sensing Image Classification

Abstract: Remote sensing images exhibit multi-semantic char acteristics with abundant geographical and contextual information. Although multi-label learning methods demonstrate remark able advantages ...

IEEE

AMITA: Attribute-Guided Masked Image-Text Alignment for Multi-Label Image Representation

Abstract: Multi-label image classification, which involves recognizing multiple objects within a single image, is a fundamental task in computer vision. Recently, Visual-Language Models (VLMs) have ...

GitHub

LAP-GAN: Label augmentation with perceptual loss for self-supervised text-to-image synthesis

This repository provides the pytorch code for the paper "LAP-GAN: Label augmentation with perceptual loss for self-supervised text-to-image synthesis" by Yong Xuan Tan, Jit Yan Lim, Kian Ming Lim, ...

GitHub

Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise

TL;DR: We propose CUFIT, a robust fine-tuning method for vision foundation models under noisy label conditions, based on the advantages of linear probing and adapters. Download the training data, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results