Multi-level Fusion of Multi-modal Semantic Embeddings for Zero Shot Learning.

AllImages Books Videos Maps News Shopping

Scholarly articles for Multi-level Fusion of Multi-modal Semantic Embeddings for Zero Shot Learning.

scholar.google.com › citations

MFF: Multi-modal feature fusion for zero-shot learning
Cao · Cited by 20

Hybrid transformer with multi-level fusion for …
Chen · Cited by 136

Multi-level Fusion of Multi-modal Semantic Embeddings for Zero ...

Nov 7, 2022 · We pioneer to propose a multi-level fusion model to effectively combine knowledge encoded in multi-modal semantic embeddings together.

Multi-level Fusion of Multi-modal Semantic Embeddings for Zero ...

dl.acm.org › doi › fullHtml

In this paper, we propose a multi-level fusion zero shot learning (MLF-ZSL) model to effectively fuse semantic embeddings from multiple modalities.

Multi-level Fusion of Multi-modal Semantic Embeddings for Zero ...

bohrium.dp.tech › paper › arxiv

Abstract:Zero shot learning aims to recognize objects whose instances may not be covered by the training data. To generalize knowledge from seen classes to ...

Multi-level Fusion of Multi-modal Semantic Embeddings for Zero ...

www.researchgate.net › ... › Embedding

Dec 10, 2024 · To address the challenges, we propose a Region-Wise Multi-View Representation Learning (ROMER) to capture multi-view dependencies and learn ...

MFF: Multi-modal feature fusion for zero-shot learning - ScienceDirect

www.sciencedirect.com › article › abs › pii

Oct 21, 2022 · A novel Multi-Modal Feature Fusion algorithm (MFF) is proposed to alleviate the domain shift problem of Zero-Shot Learning (ZSL).

Adaptive Multi-Scale Semantic Fusion Network For Zero-Shot Learning

ieeexplore.ieee.org › document

We propose a practical Adaptive Multi-scale Semantic Fusion (AMSF) framework to perform object-based multi-scale attribute attention for semantic disambiguation ...

MFF: Multi-modal feature fusion for zero-shot learning | Semantic ...

www.semanticscholar.org › paper › MFF...

A Vision Transformer-based GZSL method named Depth-Aware Multi-Modal ViT (DAM2ViT), which exploits multi-level features of ViT and incorporates a ...

Multi-level multilingual semantic alignment for zero-shot ... - Bohrium

bohrium.dp.tech › paper › arxiv

In this paper, we propose a simple and novel unsupervised method for cross-language entity alignment. We utilize the deep learning multi-language encoder ...

Multimodal Semantic Fusion for Zero-Shot Learning - ResearchGate

www.researchgate.net › ... › Multimodality

Oct 28, 2024 · The first is how to choose the best independent modalities.The second is how are a set of modalities optimally fused to map to the high-level ...

Multi-level multilingual semantic alignment for zero-shot cross ...

www.sciencedirect.com › article › abs › pii

In this paper, we propose a novel multi-level alignment framework, which hierarchically learns the semantic correlation between multiple levels.